TOP ACTIVATIONS
MAX = 10.682

worry , this is R uf us . He 's very
, " O w , ow , that hurts !" She
, " O w , ow , ow ! Sara ,
O w , ow , ow ! Sara , you hurt
O w , ow , ow . Stop it . That
, " O w , ow , ow . Stop it
O w , ow , ow !" Ben cried . "
" O w , ow , ow !" Ben cried
O w , ow , ow ! The dog bit me
" O w , ow , ow ! The dog
play - date with R uf us now ." El le
, " O w , ow , ow ! Please stop
O w , ow , ow ! Please stop , please
O w , ow , ow !" Tom screamed . "
" O w , ow , ow !" Tom screamed
" O w , ow , ow !" Lily screamed
" O w , ow , ow ! That goose
O w , ow , ow !" Lily screamed . "
the ladder , then R uf us can come over ."
O w , ow , ow ! That goose is mean

MIN ACTIVATIONS
MIN = -9.951

a girl who loved sun sets . Every day she would
bear . He loved sun sets and he wanted to take
The people , the sun sets and the b owing all
big mountains and beautiful sun sets . He felt happy and
big mountains and beautiful sun sets . But one day ,
. She watched the sun sets and fireworks for hours .
adm iring the beautiful sun sets from afar . <|endoftext|> Once
could look at the sun sets pieces every day and remember
friends who play drums and keyboards . They wear colorful clothes
marching and shouting , waving banners . He thought it was
. His wife hung happy banners and spark ly stream ers
Everyone had fun painting the banners and marching around .
living room with balloons and banners . They had a big
all the shiny , colourful guitars and explained to his mother
with her hands on her hips . She said , �
put her hands on her hips and said , " E
. Tim my saw many guitars and he didn 't know
with his hands on his hips , waiting for the k
! They had drums and guitars and a singer who sang
put his hands on his hips , and roared like a

INTERVAL 5.524 - 10.682
CONTAINS 0.844%

. It is an animal that lives in the snow .
then Mom said , " Oh no , the car is
and comfortable home . But that tiger always smiled when it
It was going so fast that she couldn 't keep up
, they did not know that there were big , mean

INTERVAL 0.366 - 5.524
CONTAINS 40.938%

the chim ney and rescued the k itty . The k
can do it if you keep trying ." The
happy and together they collected the blue shells which the her
Bounce cried . Yoga felt sad because Bounce got hurt .
clock !" Lily said , as the clock started to tick

INTERVAL -4.792 - 0.366
CONTAINS 56.083%

Ann was also busy with her operation , carrying food back
and all the magic that came with it . One day
play with their toy boats in the pool . They had
smiled and showed it to Pam . But Pam frowned when
day , James wanted to record something on his camera .

INTERVAL -9.951 - -4.792
CONTAINS 2.135%

said he would tell Tom 's mom . Tom felt ashamed
They shared their stories and laughed together . When they arrived
. Come on . Let 's have fun ."
<|endoftext|> Once upon a time there was a lazy cat who
my replied , " That 's okay , Mom my .
Token worry
Token ID5490
Feature activation+4.02e-01

Pos logit contributions
lets+0.050
packs+0.049
 names+0.049
 mattered+0.048
 interests+0.047
Neg logit contributions
 beast-0.070
 crow-0.067
 hospital-0.065
 ig-0.063
 desert-0.062
Token,
Token ID11
Feature activation-2.01e-01

Pos logit contributions
 beast+0.022
 crow+0.022
 hospital+0.021
 ig+0.020
 desert+0.020
Neg logit contributions
packs-0.018
aces-0.017
lets-0.017
 names-0.016
suits-0.016
Token this
Token ID428
Feature activation+6.77e-01

Pos logit contributions
packs+0.009
lets+0.008
aces+0.008
 names+0.008
suits+0.008
Neg logit contributions
 beast-0.010
 hospital-0.010
 crow-0.010
 ig-0.010
 desert-0.009
Token is
Token ID318
Feature activation+5.01e-02

Pos logit contributions
 beast+0.036
 crow+0.035
 hospital+0.035
 ig+0.033
 unfortunate+0.032
Neg logit contributions
packs-0.031
aces-0.030
lets-0.029
suits-0.028
 names-0.027
Token R
Token ID371
Feature activation+3.98e+00

Pos logit contributions
 beast+0.003
 crow+0.003
 hospital+0.003
 desert+0.002
 ig+0.002
Neg logit contributions
packs-0.002
 names-0.002
lets-0.002
aces-0.002
 mattered-0.002
Tokenuf
Token ID3046
Feature activation+1.07e+01

Pos logit contributions
 beast+0.178
 crow+0.172
 hospital+0.170
 ig+0.161
 unfortunate+0.156
Neg logit contributions
packs-0.216
aces-0.209
lets-0.203
suits-0.198
 names-0.195
Tokenus
Token ID385
Feature activation+5.94e+00

Pos logit contributions
 beast+0.375
 crow+0.359
 hospital+0.359
 ig+0.332
 unfortunate+0.321
Neg logit contributions
packs-0.678
aces-0.659
lets-0.645
suits-0.631
 names-0.622
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
 beast+0.279
 crow+0.273
 hospital+0.256
 that+0.245
 ig+0.241
Neg logit contributions
packs-0.310
aces-0.292
 journals-0.283
lets-0.279
suits-0.278
Token He
Token ID679
Feature activation+2.20e+00

Pos logit contributions
 beast+0.030
 crow+0.029
 hospital+0.029
 ig+0.027
 unfortunate+0.026
Neg logit contributions
packs-0.028
aces-0.026
lets-0.026
suits-0.025
 names-0.025
Token's
Token ID338
Feature activation-6.18e+00

Pos logit contributions
 beast+0.137
 crow+0.134
 hospital+0.132
 ig+0.126
 unfortunate+0.123
Neg logit contributions
packs-0.082
aces-0.077
lets-0.073
suits-0.072
 names-0.068
Token very
Token ID845
Feature activation+4.14e+00

Pos logit contributions
packs+0.291
 names+0.287
aces+0.275
lets+0.269
 interests+0.259
Neg logit contributions
 beast-0.304
 crow-0.290
 hospital-0.281
 cliff-0.276
andel-0.273
Token,
Token ID11
Feature activation-1.97e-01

Pos logit contributions
 beast+0.153
 crow+0.151
 hospital+0.147
 ig+0.138
 desert+0.133
Neg logit contributions
packs-0.159
aces-0.151
lets-0.147
suits-0.143
 journals-0.139
Token "
Token ID366
Feature activation-1.64e+00

Pos logit contributions
aces+0.007
packs+0.007
lets+0.006
 hips+0.006
 names+0.006
Neg logit contributions
 beast-0.012
 ig-0.012
 hospital-0.012
 crow-0.012
 unfortunate-0.011
TokenO
Token ID46
Feature activation+4.66e+00

Pos logit contributions
packs+0.079
lets+0.076
aces+0.075
 names+0.074
 mattered+0.074
Neg logit contributions
 ig-0.073
 beast-0.073
 hospital-0.071
 desert-0.066
 crow-0.065
Tokenw
Token ID86
Feature activation-5.21e-01

Pos logit contributions
 hospital+0.287
 beast+0.284
 crow+0.276
 ig+0.269
 desert+0.264
Neg logit contributions
packs-0.174
aces-0.161
suits-0.160
lets-0.160
 names-0.141
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 names+0.024
Neg logit contributions
 beast-0.025
 hospital-0.025
 crow-0.024
 ig-0.023
 desert-0.022
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.013
 hands+0.013
 yes+0.013
're+0.012
 names+0.012
Neg logit contributions
 clearing-0.010
 depths-0.009
rig-0.009
 alley-0.009
 cottage-0.008
Token,
Token ID11
Feature activation-2.40e-01

Pos logit contributions
 hospital+0.542
 beast+0.539
 crow+0.524
 unfortunate+0.495
 desert+0.494
Neg logit contributions
packs-0.482
aces-0.451
lets-0.449
suits-0.437
 names-0.416
Token that
Token ID326
Feature activation+6.18e+00

Pos logit contributions
 are+0.014
 yes+0.013
 hands+0.013
're+0.012
 names+0.012
Neg logit contributions
 clearing-0.009
 depths-0.009
rig-0.009
 cottage-0.008
 beast-0.008
Token hurts
Token ID20406
Feature activation+3.15e-01

Pos logit contributions
 beast+0.273
 hospital+0.266
 ig+0.262
 crow+0.254
andel+0.250
Neg logit contributions
packs-0.308
aces-0.297
lets-0.295
 names-0.294
 mattered-0.289
Token!"
Token ID2474
Feature activation-4.66e-01

Pos logit contributions
 beast+0.015
 hospital+0.015
 desert+0.015
 crow+0.015
 cliff+0.015
Neg logit contributions
packs-0.014
lets-0.014
 names-0.014
aces-0.013
suits-0.013
Token She
Token ID1375
Feature activation+2.66e+00

Pos logit contributions
packs+0.017
aces+0.017
 names+0.016
 shirts+0.016
suits+0.015
Neg logit contributions
 beast-0.029
 ig-0.027
 crow-0.027
 hospital-0.026
 unfortunate-0.026
Token,
Token ID11
Feature activation-3.08e-01

Pos logit contributions
 beast+0.145
 crow+0.142
 hospital+0.141
 ig+0.132
 unfortunate+0.128
Neg logit contributions
packs-0.154
aces-0.147
lets-0.144
suits-0.140
 names-0.136
Token "
Token ID366
Feature activation-1.79e+00

Pos logit contributions
aces+0.011
packs+0.011
lets+0.011
 names+0.010
suits+0.010
Neg logit contributions
 beast-0.019
 crow-0.018
 hospital-0.018
 ig-0.017
 desert-0.017
TokenO
Token ID46
Feature activation+4.65e+00

Pos logit contributions
aces+0.089
packs+0.088
lets+0.085
 shirts+0.084
 mattered+0.084
Neg logit contributions
 beast-0.083
 ig-0.081
 crow-0.076
 desert-0.075
 hospital-0.075
Tokenw
Token ID86
Feature activation-6.03e-01

Pos logit contributions
 beast+0.279
 hospital+0.275
 crow+0.275
 ig+0.263
 desert+0.261
Neg logit contributions
packs-0.181
aces-0.171
lets-0.165
suits-0.164
 names-0.148
Token,
Token ID11
Feature activation-3.10e-01

Pos logit contributions
packs+0.033
aces+0.032
lets+0.031
suits+0.030
 names+0.030
Neg logit contributions
 beast-0.027
 crow-0.026
 hospital-0.026
 ig-0.024
 desert-0.023
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.018
 dolls+0.017
aces+0.017
 hands+0.017
 names+0.017
Neg logit contributions
 clearing-0.011
 depths-0.011
 beast-0.011
 cottage-0.010
 desert-0.010
Token,
Token ID11
Feature activation-2.79e-01

Pos logit contributions
 beast+0.512
 crow+0.499
 hospital+0.499
 ig+0.466
 desert+0.464
Neg logit contributions
packs-0.516
aces-0.491
lets-0.483
suits-0.467
 names-0.452
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.016
aces+0.016
 dolls+0.016
 hands+0.016
 shirts+0.016
Neg logit contributions
 clearing-0.010
 beast-0.010
 depths-0.009
 desert-0.009
 cottage-0.009
Token!
Token ID0
Feature activation+2.89e+00

Pos logit contributions
 beast+0.466
 hospital+0.465
 crow+0.451
 desert+0.437
 ig+0.430
Neg logit contributions
packs-0.538
aces-0.516
lets-0.511
suits-0.492
 mattered-0.479
Token Sara
Token ID24799
Feature activation+2.79e+00

Pos logit contributions
 beast+0.160
 crow+0.157
 hospital+0.155
 ig+0.147
 unfortunate+0.143
Neg logit contributions
packs-0.127
aces-0.120
lets-0.117
suits-0.114
 names-0.110
Token,
Token ID11
Feature activation-2.60e-01

Pos logit contributions
 beast+0.144
 hospital+0.140
 desert+0.140
 crow+0.140
 ig+0.134
Neg logit contributions
packs-0.125
aces-0.123
lets-0.118
suits-0.110
 pads-0.109
TokenO
Token ID46
Feature activation+4.65e+00

Pos logit contributions
aces+0.089
packs+0.088
lets+0.085
 shirts+0.084
 mattered+0.084
Neg logit contributions
 beast-0.083
 ig-0.081
 crow-0.076
 desert-0.075
 hospital-0.075
Tokenw
Token ID86
Feature activation-6.03e-01

Pos logit contributions
 beast+0.279
 hospital+0.275
 crow+0.275
 ig+0.263
 desert+0.261
Neg logit contributions
packs-0.181
aces-0.171
lets-0.165
suits-0.164
 names-0.148
Token,
Token ID11
Feature activation-3.10e-01

Pos logit contributions
packs+0.033
aces+0.032
lets+0.031
suits+0.030
 names+0.030
Neg logit contributions
 beast-0.027
 crow-0.026
 hospital-0.026
 ig-0.024
 desert-0.023
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.018
 dolls+0.017
aces+0.017
 hands+0.017
 names+0.017
Neg logit contributions
 clearing-0.011
 depths-0.011
 beast-0.011
 cottage-0.010
 desert-0.010
Token,
Token ID11
Feature activation-2.79e-01

Pos logit contributions
 beast+0.512
 crow+0.499
 hospital+0.499
 ig+0.466
 desert+0.464
Neg logit contributions
packs-0.516
aces-0.491
lets-0.483
suits-0.467
 names-0.452
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.016
aces+0.016
 dolls+0.016
 hands+0.016
 shirts+0.016
Neg logit contributions
 clearing-0.010
 beast-0.010
 depths-0.009
 desert-0.009
 cottage-0.009
Token!
Token ID0
Feature activation+2.89e+00

Pos logit contributions
 beast+0.466
 hospital+0.465
 crow+0.451
 desert+0.437
 ig+0.430
Neg logit contributions
packs-0.538
aces-0.516
lets-0.511
suits-0.492
 mattered-0.479
Token Sara
Token ID24799
Feature activation+2.79e+00

Pos logit contributions
 beast+0.160
 crow+0.157
 hospital+0.155
 ig+0.147
 unfortunate+0.143
Neg logit contributions
packs-0.127
aces-0.120
lets-0.117
suits-0.114
 names-0.110
Token,
Token ID11
Feature activation-2.60e-01

Pos logit contributions
 beast+0.144
 hospital+0.140
 desert+0.140
 crow+0.140
 ig+0.134
Neg logit contributions
packs-0.125
aces-0.123
lets-0.118
suits-0.110
 pads-0.109
Token you
Token ID345
Feature activation+4.97e-01

Pos logit contributions
aces+0.010
 are+0.010
 hands+0.010
packs+0.009
 shirts+0.009
Neg logit contributions
 beast-0.015
 clearing-0.014
 depths-0.014
 desert-0.014
 crow-0.013
Token hurt
Token ID5938
Feature activation+1.95e+00

Pos logit contributions
 beast+0.033
 crow+0.033
 hospital+0.032
 ig+0.031
 desert+0.030
Neg logit contributions
packs-0.016
aces-0.015
lets-0.014
suits-0.014
 names-0.013
TokenO
Token ID46
Feature activation+4.63e+00

Pos logit contributions
packs+0.077
lets+0.074
aces+0.073
 names+0.072
suits+0.070
Neg logit contributions
 beast-0.085
 hospital-0.081
 crow-0.080
 ig-0.079
 unfortunate-0.076
Tokenw
Token ID86
Feature activation-5.24e-01

Pos logit contributions
 beast+0.281
 crow+0.276
 hospital+0.276
 desert+0.257
 ig+0.256
Neg logit contributions
packs-0.182
aces-0.169
lets-0.166
suits-0.164
 names-0.146
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 names+0.023
Neg logit contributions
 beast-0.026
 crow-0.025
 hospital-0.025
 ig-0.023
 desert-0.023
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.014
 hands+0.012
 dolls+0.012
 names+0.011
're+0.011
Neg logit contributions
 depths-0.013
 clearing-0.013
rig-0.012
 cottage-0.011
 beast-0.011
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
 beast+0.604
 crow+0.591
 hospital+0.589
 desert+0.553
 unfortunate+0.550
Neg logit contributions
packs-0.428
aces-0.392
lets-0.389
suits-0.378
 names-0.352
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.012
 hands+0.011
 dolls+0.010
 yes+0.010
're+0.010
Neg logit contributions
 clearing-0.011
 depths-0.011
 cottage-0.011
 beast-0.011
rig-0.010
Token.
Token ID13
Feature activation+9.20e-01

Pos logit contributions
 hospital+0.531
 beast+0.526
 crow+0.511
 desert+0.494
 unfortunate+0.482
Neg logit contributions
packs-0.471
lets-0.440
aces-0.434
suits-0.428
es-0.418
Token Stop
Token ID13707
Feature activation+2.81e+00

Pos logit contributions
 beast+0.051
 crow+0.050
 hospital+0.050
 ig+0.047
 unfortunate+0.046
Neg logit contributions
packs-0.040
aces-0.038
lets-0.037
suits-0.035
 names-0.035
Token it
Token ID340
Feature activation-3.70e-01

Pos logit contributions
 beast+0.144
 ig+0.137
 clearing+0.136
 unfortunate+0.136
 hospital+0.133
Neg logit contributions
lets-0.111
 hands-0.110
 hips-0.109
 dolls-0.108
 names-0.107
Token.
Token ID13
Feature activation+8.89e-01

Pos logit contributions
packs+0.018
aces+0.017
lets+0.016
suits+0.016
 names+0.016
Neg logit contributions
 beast-0.019
 crow-0.018
 hospital-0.017
 desert-0.017
 ig-0.017
Token That
Token ID1320
Feature activation+2.11e-02

Pos logit contributions
 beast+0.048
 crow+0.048
 hospital+0.047
 ig+0.044
 unfortunate+0.043
Neg logit contributions
packs-0.040
aces-0.037
lets-0.036
suits-0.036
 journals-0.034
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
 that+0.018
 crow+0.018
 hospital+0.017
 beast+0.017
 ig+0.016
Neg logit contributions
packs-0.030
suits-0.029
lets-0.029
aces-0.029
 pads-0.028
Token "
Token ID366
Feature activation-1.66e+00

Pos logit contributions
packs+0.009
aces+0.009
lets+0.009
suits+0.008
 names+0.008
Neg logit contributions
 beast-0.013
 crow-0.013
 hospital-0.013
 ig-0.012
 unfortunate-0.012
TokenO
Token ID46
Feature activation+4.63e+00

Pos logit contributions
packs+0.077
lets+0.074
aces+0.073
 names+0.072
suits+0.070
Neg logit contributions
 beast-0.085
 hospital-0.081
 crow-0.080
 ig-0.079
 unfortunate-0.076
Tokenw
Token ID86
Feature activation-5.24e-01

Pos logit contributions
 beast+0.281
 crow+0.276
 hospital+0.276
 desert+0.257
 ig+0.256
Neg logit contributions
packs-0.182
aces-0.169
lets-0.166
suits-0.164
 names-0.146
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 names+0.023
Neg logit contributions
 beast-0.026
 crow-0.025
 hospital-0.025
 ig-0.023
 desert-0.023
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.014
 hands+0.012
 dolls+0.012
 names+0.011
're+0.011
Neg logit contributions
 depths-0.013
 clearing-0.013
rig-0.012
 cottage-0.011
 beast-0.011
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
 beast+0.604
 crow+0.591
 hospital+0.589
 desert+0.553
 unfortunate+0.550
Neg logit contributions
packs-0.428
aces-0.392
lets-0.389
suits-0.378
 names-0.352
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.012
 hands+0.011
 dolls+0.010
 yes+0.010
're+0.010
Neg logit contributions
 clearing-0.011
 depths-0.011
 cottage-0.011
 beast-0.011
rig-0.010
Token.
Token ID13
Feature activation+9.20e-01

Pos logit contributions
 hospital+0.531
 beast+0.526
 crow+0.511
 desert+0.494
 unfortunate+0.482
Neg logit contributions
packs-0.471
lets-0.440
aces-0.434
suits-0.428
es-0.418
Token Stop
Token ID13707
Feature activation+2.81e+00

Pos logit contributions
 beast+0.051
 crow+0.050
 hospital+0.050
 ig+0.047
 unfortunate+0.046
Neg logit contributions
packs-0.040
aces-0.038
lets-0.037
suits-0.035
 names-0.035
Token it
Token ID340
Feature activation-3.70e-01

Pos logit contributions
 beast+0.144
 ig+0.137
 clearing+0.136
 unfortunate+0.136
 hospital+0.133
Neg logit contributions
lets-0.111
 hands-0.110
 hips-0.109
 dolls-0.108
 names-0.107
TokenO
Token ID46
Feature activation+4.60e+00

Pos logit contributions
packs+0.235
aces+0.228
lets+0.223
suits+0.212
 names+0.208
Neg logit contributions
 crow-0.155
 beast-0.153
 hospital-0.145
 ig-0.131
 unfortunate-0.129
Tokenw
Token ID86
Feature activation-5.38e-01

Pos logit contributions
 beast+0.277
 hospital+0.274
 crow+0.270
 ig+0.259
 desert+0.255
Neg logit contributions
packs-0.178
aces-0.166
suits-0.164
lets-0.162
 names-0.146
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
packs+0.025
aces+0.025
lets+0.024
suits+0.023
 names+0.023
Neg logit contributions
 beast-0.027
 hospital-0.027
 crow-0.027
 ig-0.025
 desert-0.025
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.012
 hands+0.011
 names+0.011
 dolls+0.011
 forks+0.011
Neg logit contributions
 clearing-0.008
 beast-0.007
 depths-0.007
rig-0.007
 alley-0.007
Token,
Token ID11
Feature activation-1.90e-01

Pos logit contributions
 beast+0.587
 hospital+0.577
 crow+0.572
 unfortunate+0.536
 desert+0.535
Neg logit contributions
packs-0.442
aces-0.409
lets-0.404
suits-0.395
 names-0.373
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.011
 names+0.010
 dolls+0.010
 shirts+0.010
Neg logit contributions
 beast-0.007
 clearing-0.007
 depths-0.006
 desert-0.006
rig-0.006
Token!"
Token ID2474
Feature activation+4.25e-01

Pos logit contributions
 hospital+0.563
 beast+0.546
 crow+0.527
 desert+0.519
 unfortunate+0.513
Neg logit contributions
packs-0.450
lets-0.418
suits-0.412
aces-0.410
 pads-0.376
Token Ben
Token ID3932
Feature activation+2.86e-01

Pos logit contributions
 beast+0.031
 eagle+0.029
 crow+0.029
 ig+0.029
asis+0.029
Neg logit contributions
packs-0.011
 shirts-0.009
 names-0.009
 hands-0.009
 hips-0.009
Token cried
Token ID16896
Feature activation+1.52e+00

Pos logit contributions
 beast+0.016
 hospital+0.015
 crow+0.015
 ig+0.015
 desert+0.015
Neg logit contributions
packs-0.012
aces-0.012
suits-0.011
lets-0.011
 names-0.011
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 beast+0.080
 crow+0.078
 hospital+0.076
 ig+0.073
 desert+0.071
Neg logit contributions
packs-0.071
aces-0.068
lets-0.066
suits-0.064
 names-0.063
Token "
Token ID366
Feature activation-1.62e+00

Pos logit contributions
 beast+0.069
 crow+0.067
 hospital+0.066
 ig+0.066
 unfortunate+0.064
Neg logit contributions
packs-0.038
aces-0.037
 names-0.036
suits-0.035
lets-0.033
Token
Token ID198
Feature activation+9.04e-01

Pos logit contributions
 beast+0.067
 crow+0.064
 eagle+0.062
 hospital+0.060
 hawk+0.060
Neg logit contributions
packs-0.029
 names-0.026
suits-0.024
aces-0.022
 interests-0.022
Token"
Token ID1
Feature activation-3.82e+00

Pos logit contributions
 beast+0.054
 crow+0.052
 hospital+0.051
 ig+0.050
 eagle+0.049
Neg logit contributions
packs-0.037
suits-0.034
lets-0.033
aces-0.033
 names-0.032
TokenO
Token ID46
Feature activation+4.60e+00

Pos logit contributions
packs+0.235
aces+0.228
lets+0.223
suits+0.212
 names+0.208
Neg logit contributions
 crow-0.155
 beast-0.153
 hospital-0.145
 ig-0.131
 unfortunate-0.129
Tokenw
Token ID86
Feature activation-5.38e-01

Pos logit contributions
 beast+0.277
 hospital+0.274
 crow+0.270
 ig+0.259
 desert+0.255
Neg logit contributions
packs-0.178
aces-0.166
suits-0.164
lets-0.162
 names-0.146
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
packs+0.025
aces+0.025
lets+0.024
suits+0.023
 names+0.023
Neg logit contributions
 beast-0.027
 hospital-0.027
 crow-0.027
 ig-0.025
 desert-0.025
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.012
 hands+0.011
 names+0.011
 dolls+0.011
 forks+0.011
Neg logit contributions
 clearing-0.008
 beast-0.007
 depths-0.007
rig-0.007
 alley-0.007
Token,
Token ID11
Feature activation-1.90e-01

Pos logit contributions
 beast+0.587
 hospital+0.577
 crow+0.572
 unfortunate+0.536
 desert+0.535
Neg logit contributions
packs-0.442
aces-0.409
lets-0.404
suits-0.395
 names-0.373
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.011
 names+0.010
 dolls+0.010
 shirts+0.010
Neg logit contributions
 beast-0.007
 clearing-0.007
 depths-0.006
 desert-0.006
rig-0.006
Token!"
Token ID2474
Feature activation+4.25e-01

Pos logit contributions
 hospital+0.563
 beast+0.546
 crow+0.527
 desert+0.519
 unfortunate+0.513
Neg logit contributions
packs-0.450
lets-0.418
suits-0.412
aces-0.410
 pads-0.376
Token Ben
Token ID3932
Feature activation+2.86e-01

Pos logit contributions
 beast+0.031
 eagle+0.029
 crow+0.029
 ig+0.029
asis+0.029
Neg logit contributions
packs-0.011
 shirts-0.009
 names-0.009
 hands-0.009
 hips-0.009
Token cried
Token ID16896
Feature activation+1.52e+00

Pos logit contributions
 beast+0.016
 hospital+0.015
 crow+0.015
 ig+0.015
 desert+0.015
Neg logit contributions
packs-0.012
aces-0.012
suits-0.011
lets-0.011
 names-0.011
TokenO
Token ID46
Feature activation+4.64e+00

Pos logit contributions
packs+0.211
aces+0.205
lets+0.200
suits+0.196
 names+0.196
Neg logit contributions
 beast-0.159
 hospital-0.154
 crow-0.150
 ig-0.149
 desert-0.143
Tokenw
Token ID86
Feature activation-5.57e-01

Pos logit contributions
 hospital+0.292
 beast+0.287
 crow+0.282
 ig+0.277
 desert+0.274
Neg logit contributions
packs-0.163
aces-0.157
suits-0.151
lets-0.150
 pads-0.133
Token,
Token ID11
Feature activation-1.95e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.023
 names+0.023
Neg logit contributions
 hospital-0.029
 beast-0.029
 crow-0.028
 ig-0.027
 desert-0.027
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.010
 dolls+0.010
 yes+0.010
aces+0.010
Neg logit contributions
 clearing-0.008
 depths-0.008
rig-0.008
 cottage-0.007
 desert-0.007
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
 hospital+0.606
 beast+0.605
 crow+0.592
 desert+0.568
 unfortunate+0.565
Neg logit contributions
packs-0.407
aces-0.379
lets-0.370
suits-0.358
 pads-0.335
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.010
 hands+0.010
 yes+0.009
aces+0.009
 dolls+0.009
Neg logit contributions
 clearing-0.007
 depths-0.007
 cottage-0.007
 desert-0.006
rig-0.006
Token!
Token ID0
Feature activation+3.08e+00

Pos logit contributions
 hospital+0.550
 desert+0.518
 beast+0.514
 crow+0.500
 unfortunate+0.498
Neg logit contributions
packs-0.433
aces-0.417
lets-0.411
es-0.401
suits-0.395
Token The
Token ID383
Feature activation+1.02e+00

Pos logit contributions
 beast+0.184
 hospital+0.180
 crow+0.178
 desert+0.174
 ig+0.172
Neg logit contributions
aces-0.114
packs-0.112
lets-0.106
 names-0.105
suits-0.097
Token dog
Token ID3290
Feature activation+3.21e+00

Pos logit contributions
 beast+0.049
 ig+0.047
 crow+0.047
 hospital+0.046
 desert+0.046
Neg logit contributions
 names-0.046
packs-0.045
 shirts-0.044
aces-0.044
suits-0.043
Token bit
Token ID1643
Feature activation+3.22e+00

Pos logit contributions
 desert+0.198
andel+0.193
 beast+0.193
 hospital+0.192
 ig+0.190
Neg logit contributions
aces-0.100
packs-0.099
lets-0.096
suits-0.088
 names-0.088
Token me
Token ID502
Feature activation+4.94e+00

Pos logit contributions
 beast+0.159
 crow+0.156
 paramed+0.145
 hospital+0.145
 eagle+0.143
Neg logit contributions
 hands-0.158
packs-0.145
 shirts-0.144
 forks-0.139
 skirts-0.135
Token
Token ID198
Feature activation+7.71e-01

Pos logit contributions
 beast+0.061
 crow+0.058
 eagle+0.058
 hawk+0.055
 hospital+0.055
Neg logit contributions
packs-0.022
 names-0.019
suits-0.019
 interests-0.016
aces-0.016
Token"
Token ID1
Feature activation-3.82e+00

Pos logit contributions
 beast+0.043
 crow+0.040
 hospital+0.039
 ig+0.039
 eagle+0.039
Neg logit contributions
packs-0.034
suits-0.033
lets-0.032
aces-0.031
 pads-0.030
TokenO
Token ID46
Feature activation+4.64e+00

Pos logit contributions
packs+0.211
aces+0.205
lets+0.200
suits+0.196
 names+0.196
Neg logit contributions
 beast-0.159
 hospital-0.154
 crow-0.150
 ig-0.149
 desert-0.143
Tokenw
Token ID86
Feature activation-5.57e-01

Pos logit contributions
 hospital+0.292
 beast+0.287
 crow+0.282
 ig+0.277
 desert+0.274
Neg logit contributions
packs-0.163
aces-0.157
suits-0.151
lets-0.150
 pads-0.133
Token,
Token ID11
Feature activation-1.95e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.023
 names+0.023
Neg logit contributions
 hospital-0.029
 beast-0.029
 crow-0.028
 ig-0.027
 desert-0.027
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.010
 dolls+0.010
 yes+0.010
aces+0.010
Neg logit contributions
 clearing-0.008
 depths-0.008
rig-0.008
 cottage-0.007
 desert-0.007
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
 hospital+0.606
 beast+0.605
 crow+0.592
 desert+0.568
 unfortunate+0.565
Neg logit contributions
packs-0.407
aces-0.379
lets-0.370
suits-0.358
 pads-0.335
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.010
 hands+0.010
 yes+0.009
aces+0.009
 dolls+0.009
Neg logit contributions
 clearing-0.007
 depths-0.007
 cottage-0.007
 desert-0.006
rig-0.006
Token!
Token ID0
Feature activation+3.08e+00

Pos logit contributions
 hospital+0.550
 desert+0.518
 beast+0.514
 crow+0.500
 unfortunate+0.498
Neg logit contributions
packs-0.433
aces-0.417
lets-0.411
es-0.401
suits-0.395
Token The
Token ID383
Feature activation+1.02e+00

Pos logit contributions
 beast+0.184
 hospital+0.180
 crow+0.178
 desert+0.174
 ig+0.172
Neg logit contributions
aces-0.114
packs-0.112
lets-0.106
 names-0.105
suits-0.097
Token dog
Token ID3290
Feature activation+3.21e+00

Pos logit contributions
 beast+0.049
 ig+0.047
 crow+0.047
 hospital+0.046
 desert+0.046
Neg logit contributions
 names-0.046
packs-0.045
 shirts-0.044
aces-0.044
suits-0.043
Token play
Token ID711
Feature activation-4.72e-01

Pos logit contributions
packs+0.027
aces+0.026
lets+0.025
 names+0.024
suits+0.024
Neg logit contributions
 beast-0.032
 crow-0.031
 hospital-0.030
 ig-0.029
 desert-0.028
Token-
Token ID12
Feature activation+3.90e+00

Pos logit contributions
packs+0.021
aces+0.021
lets+0.019
suits+0.019
 names+0.019
Neg logit contributions
 beast-0.025
 crow-0.025
 hospital-0.023
 unfortunate-0.023
 ig-0.023
Tokendate
Token ID4475
Feature activation-2.68e-01

Pos logit contributions
 beast+0.194
 crow+0.188
 hospital+0.186
 ig+0.177
 desert+0.171
Neg logit contributions
packs-0.192
aces-0.186
lets-0.181
suits-0.175
 names-0.173
Token with
Token ID351
Feature activation-6.66e-01

Pos logit contributions
packs+0.015
aces+0.014
lets+0.014
suits+0.014
 names+0.014
Neg logit contributions
 beast-0.011
 hospital-0.011
 crow-0.011
 ig-0.010
 unfortunate-0.010
Token R
Token ID371
Feature activation+3.73e+00

Pos logit contributions
packs+0.036
aces+0.035
 names+0.034
lets+0.034
 shirts+0.034
Neg logit contributions
 beast-0.029
 crow-0.028
 hospital-0.027
 ig-0.025
 unfortunate-0.025
Tokenuf
Token ID3046
Feature activation+1.04e+01

Pos logit contributions
 hospital+0.110
 beast+0.109
 crow+0.105
 ig+0.098
 unfortunate+0.097
Neg logit contributions
aces-0.253
packs-0.249
lets-0.240
 names-0.237
suits-0.234
Tokenus
Token ID385
Feature activation+5.60e+00

Pos logit contributions
 beast+0.313
 hospital+0.313
 crow+0.301
 ig+0.278
 unfortunate+0.273
Neg logit contributions
packs-0.705
aces-0.686
lets-0.672
suits-0.655
 names-0.641
Token now
Token ID783
Feature activation+3.53e+00

Pos logit contributions
 beast+0.259
 crow+0.251
 hospital+0.243
 ig+0.233
 that+0.225
Neg logit contributions
packs-0.286
aces-0.269
suits-0.261
lets-0.261
 journals-0.256
Token."
Token ID526
Feature activation-2.15e+00

Pos logit contributions
 beast+0.169
 crow+0.165
 hospital+0.161
 ig+0.154
 desert+0.149
Neg logit contributions
packs-0.179
aces-0.173
lets-0.168
suits-0.163
 names-0.161
Token El
Token ID2574
Feature activation+3.87e+00

Pos logit contributions
packs+0.095
aces+0.091
 names+0.090
lets+0.089
suits+0.086
Neg logit contributions
 beast-0.120
 crow-0.116
 hospital-0.115
 unfortunate-0.111
 ig-0.108
Tokenle
Token ID293
Feature activation-1.44e+00

Pos logit contributions
 beast+0.208
 crow+0.205
 hospital+0.204
 ig+0.194
 unfortunate+0.187
Neg logit contributions
packs-0.171
aces-0.165
lets-0.159
suits-0.157
 names-0.151
Token,
Token ID11
Feature activation-1.53e-01

Pos logit contributions
 that+0.019
 crow+0.017
 hospital+0.017
 beast+0.017
 ig+0.015
Neg logit contributions
packs-0.031
aces-0.029
suits-0.029
lets-0.029
 pads-0.028
Token "
Token ID366
Feature activation-1.77e+00

Pos logit contributions
packs+0.007
aces+0.007
lets+0.007
suits+0.006
 journals+0.006
Neg logit contributions
 beast-0.008
 crow-0.008
 hospital-0.008
 ig-0.007
 eagle-0.007
TokenO
Token ID46
Feature activation+4.69e+00

Pos logit contributions
packs+0.076
lets+0.071
aces+0.071
suits+0.069
 names+0.068
Neg logit contributions
 beast-0.096
 hospital-0.092
 crow-0.091
 ig-0.091
 desert-0.088
Tokenw
Token ID86
Feature activation-4.15e-01

Pos logit contributions
 beast+0.278
 crow+0.273
 hospital+0.271
 ig+0.256
 desert+0.254
Neg logit contributions
packs-0.189
aces-0.176
lets-0.173
suits-0.170
 names-0.158
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
packs+0.020
aces+0.019
lets+0.019
suits+0.018
 names+0.018
Neg logit contributions
 beast-0.021
 crow-0.020
 hospital-0.020
 ig-0.019
 desert-0.019
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.011
 names+0.011
 dolls+0.010
 yes+0.010
Neg logit contributions
 clearing-0.009
 depths-0.009
 beast-0.008
 desert-0.008
andel-0.008
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 beast+0.547
 crow+0.537
 hospital+0.533
 desert+0.496
 ig+0.494
Neg logit contributions
packs-0.481
aces-0.450
lets-0.443
suits-0.430
 names-0.414
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.010
 yes+0.010
 hands+0.009
 names+0.009
 dolls+0.009
Neg logit contributions
 clearing-0.009
 depths-0.008
 beast-0.008
 desert-0.008
andel-0.008
Token!
Token ID0
Feature activation+2.70e+00

Pos logit contributions
 hospital+0.437
 beast+0.435
 crow+0.428
 desert+0.403
 unfortunate+0.393
Neg logit contributions
packs-0.569
lets-0.533
aces-0.530
suits-0.520
 mattered-0.494
Token Please
Token ID4222
Feature activation+1.34e+00

Pos logit contributions
 beast+0.151
 crow+0.147
 hospital+0.146
 ig+0.139
 unfortunate+0.134
Neg logit contributions
packs-0.118
aces-0.112
lets-0.108
suits-0.106
 names-0.102
Token stop
Token ID2245
Feature activation+2.64e+00

Pos logit contributions
 beast+0.070
 crow+0.068
 hospital+0.066
 ig+0.063
 desert+0.063
Neg logit contributions
packs-0.062
aces-0.059
lets-0.059
 names-0.057
suits-0.057
TokenO
Token ID46
Feature activation+4.69e+00

Pos logit contributions
packs+0.076
lets+0.071
aces+0.071
suits+0.069
 names+0.068
Neg logit contributions
 beast-0.096
 hospital-0.092
 crow-0.091
 ig-0.091
 desert-0.088
Tokenw
Token ID86
Feature activation-4.15e-01

Pos logit contributions
 beast+0.278
 crow+0.273
 hospital+0.271
 ig+0.256
 desert+0.254
Neg logit contributions
packs-0.189
aces-0.176
lets-0.173
suits-0.170
 names-0.158
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
packs+0.020
aces+0.019
lets+0.019
suits+0.018
 names+0.018
Neg logit contributions
 beast-0.021
 crow-0.020
 hospital-0.020
 ig-0.019
 desert-0.019
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.011
 hands+0.011
 names+0.011
 dolls+0.010
 yes+0.010
Neg logit contributions
 clearing-0.009
 depths-0.009
 beast-0.008
 desert-0.008
andel-0.008
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 beast+0.547
 crow+0.537
 hospital+0.533
 desert+0.496
 ig+0.494
Neg logit contributions
packs-0.481
aces-0.450
lets-0.443
suits-0.430
 names-0.414
Token ow
Token ID12334
Feature activation+1.04e+01

Pos logit contributions
 are+0.010
 yes+0.010
 hands+0.009
 names+0.009
 dolls+0.009
Neg logit contributions
 clearing-0.009
 depths-0.008
 beast-0.008
 desert-0.008
andel-0.008
Token!
Token ID0
Feature activation+2.70e+00

Pos logit contributions
 hospital+0.437
 beast+0.435
 crow+0.428
 desert+0.403
 unfortunate+0.393
Neg logit contributions
packs-0.569
lets-0.533
aces-0.530
suits-0.520
 mattered-0.494
Token Please
Token ID4222
Feature activation+1.34e+00

Pos logit contributions
 beast+0.151
 crow+0.147
 hospital+0.146
 ig+0.139
 unfortunate+0.134
Neg logit contributions
packs-0.118
aces-0.112
lets-0.108
suits-0.106
 names-0.102
Token stop
Token ID2245
Feature activation+2.64e+00

Pos logit contributions
 beast+0.070
 crow+0.068
 hospital+0.066
 ig+0.063
 desert+0.063
Neg logit contributions
packs-0.062
aces-0.059
lets-0.059
 names-0.057
suits-0.057
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
 beast+0.134
 crow+0.131
 unfortunate+0.127
andel+0.125
 hospital+0.125
Neg logit contributions
packs-0.114
 hands-0.112
lets-0.109
 names-0.108
 hips-0.106
Token please
Token ID3387
Feature activation+1.32e+00

Pos logit contributions
 yes+0.005
 names+0.005
 are+0.005
 swap+0.005
 hands+0.004
Neg logit contributions
 desert-0.007
 depths-0.007
 beast-0.007
 clearing-0.007
andel-0.007
TokenO
Token ID46
Feature activation+4.54e+00

Pos logit contributions
packs+0.226
aces+0.215
suits+0.211
lets+0.210
 names+0.206
Neg logit contributions
 beast-0.151
 hospital-0.145
 crow-0.141
 ig-0.140
 desert-0.136
Tokenw
Token ID86
Feature activation-6.16e-01

Pos logit contributions
 hospital+0.278
 beast+0.277
 crow+0.273
 ig+0.265
 desert+0.262
Neg logit contributions
packs-0.165
aces-0.157
suits-0.155
lets-0.151
 pads-0.135
Token,
Token ID11
Feature activation-2.36e-01

Pos logit contributions
packs+0.028
aces+0.028
lets+0.027
suits+0.025
 names+0.025
Neg logit contributions
 hospital-0.032
 beast-0.032
 crow-0.031
 desert-0.030
 ig-0.030
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.014
 hands+0.013
 dolls+0.012
 yes+0.012
 names+0.012
Neg logit contributions
 depths-0.009
rig-0.009
 clearing-0.009
 desert-0.008
andel-0.008
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
 hospital+0.601
 beast+0.600
 crow+0.587
 desert+0.559
 unfortunate+0.558
Neg logit contributions
packs-0.413
aces-0.378
lets-0.376
suits-0.367
 names-0.334
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.014
 hands+0.013
 yes+0.012
 dolls+0.012
're+0.012
Neg logit contributions
 depths-0.009
 clearing-0.009
 desert-0.008
rig-0.008
 cottage-0.008
Token!"
Token ID2474
Feature activation+1.00e-01

Pos logit contributions
 hospital+0.556
 beast+0.529
 desert+0.516
 crow+0.516
 unfortunate+0.505
Neg logit contributions
packs-0.437
lets-0.412
aces-0.406
suits-0.404
es-0.375
Token Tom
Token ID4186
Feature activation-1.73e-01

Pos logit contributions
 beast+0.007
 eagle+0.007
asis+0.007
 ig+0.007
 crow+0.007
Neg logit contributions
packs-0.002
 shirts-0.002
aces-0.002
 names-0.002
suits-0.002
Token screamed
Token ID25421
Feature activation+5.65e-01

Pos logit contributions
aces+0.007
packs+0.006
lets+0.006
suits+0.006
 pads+0.006
Neg logit contributions
 beast-0.010
 hospital-0.010
 desert-0.010
 ig-0.010
 cliff-0.010
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 beast+0.033
 crow+0.032
 hospital+0.031
 desert+0.030
 ig+0.030
Neg logit contributions
packs-0.023
aces-0.022
lets-0.021
suits-0.021
 names-0.021
Token "
Token ID366
Feature activation-1.64e+00

Pos logit contributions
 beast+0.072
 ig+0.070
 crow+0.069
 desert+0.068
 hospital+0.068
Neg logit contributions
aces-0.035
packs-0.033
 names-0.032
suits-0.032
 hips-0.030
Token
Token ID198
Feature activation+6.76e-01

Pos logit contributions
 beast+0.050
 crow+0.048
 eagle+0.047
 hospital+0.046
 desert+0.045
Neg logit contributions
packs-0.022
 names-0.019
suits-0.019
aces-0.018
 interests-0.017
Token"
Token ID1
Feature activation-3.92e+00

Pos logit contributions
 beast+0.038
 crow+0.036
 hospital+0.035
 ig+0.034
 eagle+0.034
Neg logit contributions
packs-0.030
suits-0.029
lets-0.028
aces-0.028
 names-0.026
TokenO
Token ID46
Feature activation+4.54e+00

Pos logit contributions
packs+0.226
aces+0.215
suits+0.211
lets+0.210
 names+0.206
Neg logit contributions
 beast-0.151
 hospital-0.145
 crow-0.141
 ig-0.140
 desert-0.136
Tokenw
Token ID86
Feature activation-6.16e-01

Pos logit contributions
 hospital+0.278
 beast+0.277
 crow+0.273
 ig+0.265
 desert+0.262
Neg logit contributions
packs-0.165
aces-0.157
suits-0.155
lets-0.151
 pads-0.135
Token,
Token ID11
Feature activation-2.36e-01

Pos logit contributions
packs+0.028
aces+0.028
lets+0.027
suits+0.025
 names+0.025
Neg logit contributions
 hospital-0.032
 beast-0.032
 crow-0.031
 desert-0.030
 ig-0.030
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.014
 hands+0.013
 dolls+0.012
 yes+0.012
 names+0.012
Neg logit contributions
 depths-0.009
rig-0.009
 clearing-0.009
 desert-0.008
andel-0.008
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
 hospital+0.601
 beast+0.600
 crow+0.587
 desert+0.559
 unfortunate+0.558
Neg logit contributions
packs-0.413
aces-0.378
lets-0.376
suits-0.367
 names-0.334
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.014
 hands+0.013
 yes+0.012
 dolls+0.012
're+0.012
Neg logit contributions
 depths-0.009
 clearing-0.009
 desert-0.008
rig-0.008
 cottage-0.008
Token!"
Token ID2474
Feature activation+1.00e-01

Pos logit contributions
 hospital+0.556
 beast+0.529
 desert+0.516
 crow+0.516
 unfortunate+0.505
Neg logit contributions
packs-0.437
lets-0.412
aces-0.406
suits-0.404
es-0.375
Token Tom
Token ID4186
Feature activation-1.73e-01

Pos logit contributions
 beast+0.007
 eagle+0.007
asis+0.007
 ig+0.007
 crow+0.007
Neg logit contributions
packs-0.002
 shirts-0.002
aces-0.002
 names-0.002
suits-0.002
Token screamed
Token ID25421
Feature activation+5.65e-01

Pos logit contributions
aces+0.007
packs+0.006
lets+0.006
suits+0.006
 pads+0.006
Neg logit contributions
 beast-0.010
 hospital-0.010
 desert-0.010
 ig-0.010
 cliff-0.010
Token
Token ID198
Feature activation+7.20e-01

Pos logit contributions
 beast+0.054
 crow+0.051
 eagle+0.049
 hospital+0.049
 ig+0.049
Neg logit contributions
packs-0.025
suits-0.021
 names-0.021
aces-0.020
lets-0.019
Token"
Token ID1
Feature activation-3.91e+00

Pos logit contributions
 beast+0.040
 crow+0.038
 hospital+0.038
 ig+0.037
 desert+0.035
Neg logit contributions
packs-0.032
aces-0.031
lets-0.030
suits-0.030
 names-0.028
TokenO
Token ID46
Feature activation+4.55e+00

Pos logit contributions
packs+0.236
aces+0.228
lets+0.223
suits+0.218
 names+0.215
Neg logit contributions
 beast-0.154
 crow-0.150
 hospital-0.146
 ig-0.136
 unfortunate-0.131
Tokenw
Token ID86
Feature activation-5.65e-01

Pos logit contributions
 beast+0.276
 hospital+0.275
 crow+0.268
 ig+0.261
 desert+0.257
Neg logit contributions
packs-0.172
aces-0.164
lets-0.161
suits-0.160
 pads-0.141
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
packs+0.027
aces+0.027
lets+0.026
suits+0.025
 names+0.025
Neg logit contributions
 beast-0.028
 hospital-0.028
 crow-0.027
 ig-0.026
 desert-0.026
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.010
 hands+0.010
 names+0.010
 dolls+0.009
 yes+0.009
Neg logit contributions
 depths-0.007
 clearing-0.007
rig-0.006
 beast-0.006
 desert-0.006
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
 beast+0.602
 hospital+0.594
 crow+0.583
 unfortunate+0.554
 desert+0.553
Neg logit contributions
packs-0.418
aces-0.389
lets-0.385
suits-0.373
 names-0.348
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.011
 hands+0.010
 yes+0.010
aces+0.010
 names+0.010
Neg logit contributions
 depths-0.007
 clearing-0.007
 beast-0.007
 desert-0.006
 cottage-0.006
Token!"
Token ID2474
Feature activation+1.10e-01

Pos logit contributions
 hospital+0.578
 beast+0.563
 desert+0.540
 crow+0.537
 unfortunate+0.531
Neg logit contributions
packs-0.412
lets-0.395
aces-0.389
suits-0.378
es-0.350
Token Lily
Token ID20037
Feature activation-4.61e-01

Pos logit contributions
 beast+0.008
 ig+0.008
 crow+0.007
asis+0.007
 eagle+0.007
Neg logit contributions
aces-0.003
packs-0.003
 shirts-0.002
 names-0.002
 hands-0.002
Token screamed
Token ID25421
Feature activation+6.25e-01

Pos logit contributions
packs+0.020
aces+0.020
lets+0.019
suits+0.018
 names+0.018
Neg logit contributions
 beast-0.025
 hospital-0.024
 crow-0.024
 ig-0.024
 desert-0.023
Token
Token ID198
Feature activation+3.09e-01

Pos logit contributions
 beast+0.021
 crow+0.021
 eagle+0.020
 hospital+0.020
 desert+0.020
Neg logit contributions
packs-0.012
 names-0.011
suits-0.010
aces-0.010
lets-0.010
Token"
Token ID1
Feature activation-4.22e+00

Pos logit contributions
 beast+0.017
 crow+0.016
 ig+0.016
 desert+0.016
 eagle+0.016
Neg logit contributions
packs-0.012
aces-0.012
suits-0.012
 names-0.012
lets-0.011
TokenO
Token ID46
Feature activation+4.49e+00

Pos logit contributions
packs+0.201
aces+0.191
lets+0.184
 mattered+0.183
 names+0.182
Neg logit contributions
 beast-0.194
 hospital-0.186
 ig-0.185
 crow-0.178
 desert-0.176
Tokenw
Token ID86
Feature activation-5.89e-01

Pos logit contributions
 hospital+0.282
 beast+0.280
 crow+0.269
 ig+0.269
 desert+0.263
Neg logit contributions
packs-0.153
lets-0.146
aces-0.146
suits-0.143
 mattered-0.128
Token,
Token ID11
Feature activation-4.95e-01

Pos logit contributions
packs+0.027
aces+0.027
lets+0.026
suits+0.024
 names+0.024
Neg logit contributions
 beast-0.031
 hospital-0.030
 crow-0.030
 desert-0.028
 ig-0.028
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.027
 hands+0.026
 dolls+0.025
 yes+0.025
 names+0.024
Neg logit contributions
 clearing-0.020
 depths-0.020
rig-0.020
 cottage-0.018
 desert-0.018
Token,
Token ID11
Feature activation-4.80e-01

Pos logit contributions
 hospital+0.597
 beast+0.593
 crow+0.570
 desert+0.552
 unfortunate+0.552
Neg logit contributions
packs-0.410
lets-0.382
aces-0.380
suits-0.367
 mattered-0.340
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.027
 hands+0.026
 yes+0.025
're+0.024
 names+0.024
Neg logit contributions
 clearing-0.020
 depths-0.019
 cottage-0.018
rig-0.018
 desert-0.017
Token!
Token ID0
Feature activation+2.56e+00

Pos logit contributions
 hospital+0.573
 beast+0.542
 desert+0.525
 unfortunate+0.517
 ig+0.513
Neg logit contributions
packs-0.408
lets-0.397
aces-0.381
suits-0.379
es-0.361
Token That
Token ID1320
Feature activation-6.79e-02

Pos logit contributions
 beast+0.143
 crow+0.138
 hospital+0.138
 desert+0.135
 ig+0.133
Neg logit contributions
packs-0.102
aces-0.102
lets-0.099
 names-0.096
suits-0.092
Token goose
Token ID37246
Feature activation+3.67e+00

Pos logit contributions
lets+0.003
 names+0.003
packs+0.003
 mattered+0.002
aces+0.002
Neg logit contributions
 beast-0.004
andel-0.004
 hospital-0.004
 desert-0.003
 ig-0.003
TokenO
Token ID46
Feature activation+4.55e+00

Pos logit contributions
packs+0.236
aces+0.228
lets+0.223
suits+0.218
 names+0.215
Neg logit contributions
 beast-0.154
 crow-0.150
 hospital-0.146
 ig-0.136
 unfortunate-0.131
Tokenw
Token ID86
Feature activation-5.65e-01

Pos logit contributions
 beast+0.276
 hospital+0.275
 crow+0.268
 ig+0.261
 desert+0.257
Neg logit contributions
packs-0.172
aces-0.164
lets-0.161
suits-0.160
 pads-0.141
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
packs+0.027
aces+0.027
lets+0.026
suits+0.025
 names+0.025
Neg logit contributions
 beast-0.028
 hospital-0.028
 crow-0.027
 ig-0.026
 desert-0.026
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.010
 hands+0.010
 names+0.010
 dolls+0.009
 yes+0.009
Neg logit contributions
 depths-0.007
 clearing-0.007
rig-0.006
 beast-0.006
 desert-0.006
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
 beast+0.602
 hospital+0.594
 crow+0.583
 unfortunate+0.554
 desert+0.553
Neg logit contributions
packs-0.418
aces-0.389
lets-0.385
suits-0.373
 names-0.348
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.011
 hands+0.010
 yes+0.010
aces+0.010
 names+0.010
Neg logit contributions
 depths-0.007
 clearing-0.007
 beast-0.007
 desert-0.006
 cottage-0.006
Token!"
Token ID2474
Feature activation+1.10e-01

Pos logit contributions
 hospital+0.578
 beast+0.563
 desert+0.540
 crow+0.537
 unfortunate+0.531
Neg logit contributions
packs-0.412
lets-0.395
aces-0.389
suits-0.378
es-0.350
Token Lily
Token ID20037
Feature activation-4.61e-01

Pos logit contributions
 beast+0.008
 ig+0.008
 crow+0.007
asis+0.007
 eagle+0.007
Neg logit contributions
aces-0.003
packs-0.003
 shirts-0.002
 names-0.002
 hands-0.002
Token screamed
Token ID25421
Feature activation+6.25e-01

Pos logit contributions
packs+0.020
aces+0.020
lets+0.019
suits+0.018
 names+0.018
Neg logit contributions
 beast-0.025
 hospital-0.024
 crow-0.024
 ig-0.024
 desert-0.023
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
 beast+0.035
 crow+0.034
 hospital+0.033
 ig+0.032
 desert+0.032
Neg logit contributions
packs-0.027
aces-0.026
lets-0.025
suits-0.024
 names-0.024
Token "
Token ID366
Feature activation-1.60e+00

Pos logit contributions
 beast+0.063
 crow+0.061
 hospital+0.060
 ig+0.058
 unfortunate+0.056
Neg logit contributions
packs-0.045
aces-0.044
lets-0.042
suits-0.041
 names-0.040
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
packs+0.041
aces+0.040
lets+0.039
suits+0.038
 names+0.037
Neg logit contributions
 beast-0.032
 crow-0.031
 hospital-0.030
 ig-0.028
 desert-0.028
Token ladder
Token ID18002
Feature activation+1.99e+00

Pos logit contributions
 beast+0.055
 crow+0.054
 hospital+0.051
 ig+0.049
 unfortunate+0.048
Neg logit contributions
aces-0.050
packs-0.050
 names-0.048
suits-0.046
 shirts-0.045
Token,
Token ID11
Feature activation-1.85e-01

Pos logit contributions
 beast+0.099
 crow+0.097
 hospital+0.095
 ig+0.090
 unfortunate+0.088
Neg logit contributions
packs-0.097
aces-0.093
lets-0.091
suits-0.088
 names-0.087
Token then
Token ID788
Feature activation+2.38e+00

Pos logit contributions
packs+0.009
aces+0.008
lets+0.008
 names+0.008
 shirts+0.008
Neg logit contributions
 beast-0.009
 crow-0.009
 hospital-0.009
 ig-0.009
 desert-0.008
Token R
Token ID371
Feature activation+3.76e+00

Pos logit contributions
 beast+0.111
 crow+0.108
 hospital+0.105
 ig+0.101
 desert+0.098
Neg logit contributions
packs-0.124
aces-0.118
lets-0.117
 names-0.113
suits-0.111
Tokenuf
Token ID3046
Feature activation+1.03e+01

Pos logit contributions
 hospital+0.102
 beast+0.100
 unfortunate+0.094
 cob+0.094
 crow+0.093
Neg logit contributions
aces-0.261
packs-0.255
lets-0.248
 names-0.240
suits-0.239
Tokenus
Token ID385
Feature activation+5.66e+00

Pos logit contributions
 hospital+0.351
 beast+0.345
 crow+0.329
 ig+0.327
 unfortunate+0.325
Neg logit contributions
packs-0.653
aces-0.628
lets-0.625
suits-0.604
ver-0.591
Token can
Token ID460
Feature activation-1.48e+00

Pos logit contributions
 beast+0.307
 crow+0.301
 hospital+0.294
 ig+0.280
 that+0.271
Neg logit contributions
packs-0.247
aces-0.230
lets-0.220
 journals-0.219
suits-0.218
Token come
Token ID1282
Feature activation+6.58e-01

Pos logit contributions
aces+0.059
 names+0.056
packs+0.056
lets+0.055
 chooses+0.055
Neg logit contributions
 beast-0.082
 crow-0.079
 desert-0.078
 ig-0.077
 hospital-0.076
Token over
Token ID625
Feature activation-2.27e+00

Pos logit contributions
 beast+0.039
 crow+0.038
 hospital+0.037
 unfortunate+0.035
 ig+0.035
Neg logit contributions
packs-0.028
lets-0.026
aces-0.025
 names-0.024
suits-0.024
Token."
Token ID526
Feature activation-2.14e+00

Pos logit contributions
packs+0.105
aces+0.097
lets+0.094
 names+0.092
suits+0.092
Neg logit contributions
 beast-0.121
 crow-0.119
 hospital-0.115
 ig-0.110
 unfortunate-0.109
TokenO
Token ID46
Feature activation+4.49e+00

Pos logit contributions
packs+0.201
aces+0.191
lets+0.184
 mattered+0.183
 names+0.182
Neg logit contributions
 beast-0.194
 hospital-0.186
 ig-0.185
 crow-0.178
 desert-0.176
Tokenw
Token ID86
Feature activation-5.89e-01

Pos logit contributions
 hospital+0.282
 beast+0.280
 crow+0.269
 ig+0.269
 desert+0.263
Neg logit contributions
packs-0.153
lets-0.146
aces-0.146
suits-0.143
 mattered-0.128
Token,
Token ID11
Feature activation-4.95e-01

Pos logit contributions
packs+0.027
aces+0.027
lets+0.026
suits+0.024
 names+0.024
Neg logit contributions
 beast-0.031
 hospital-0.030
 crow-0.030
 desert-0.028
 ig-0.028
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.027
 hands+0.026
 dolls+0.025
 yes+0.025
 names+0.024
Neg logit contributions
 clearing-0.020
 depths-0.020
rig-0.020
 cottage-0.018
 desert-0.018
Token,
Token ID11
Feature activation-4.80e-01

Pos logit contributions
 hospital+0.597
 beast+0.593
 crow+0.570
 desert+0.552
 unfortunate+0.552
Neg logit contributions
packs-0.410
lets-0.382
aces-0.380
suits-0.367
 mattered-0.340
Token ow
Token ID12334
Feature activation+1.03e+01

Pos logit contributions
 are+0.027
 hands+0.026
 yes+0.025
're+0.024
 names+0.024
Neg logit contributions
 clearing-0.020
 depths-0.019
 cottage-0.018
rig-0.018
 desert-0.017
Token!
Token ID0
Feature activation+2.56e+00

Pos logit contributions
 hospital+0.573
 beast+0.542
 desert+0.525
 unfortunate+0.517
 ig+0.513
Neg logit contributions
packs-0.408
lets-0.397
aces-0.381
suits-0.379
es-0.361
Token That
Token ID1320
Feature activation-6.79e-02

Pos logit contributions
 beast+0.143
 crow+0.138
 hospital+0.138
 desert+0.135
 ig+0.133
Neg logit contributions
packs-0.102
aces-0.102
lets-0.099
 names-0.096
suits-0.092
Token goose
Token ID37246
Feature activation+3.67e+00

Pos logit contributions
lets+0.003
 names+0.003
packs+0.003
 mattered+0.002
aces+0.002
Neg logit contributions
 beast-0.004
andel-0.004
 hospital-0.004
 desert-0.003
 ig-0.003
Token is
Token ID318
Feature activation+9.22e-02

Pos logit contributions
 beast+0.208
 crow+0.204
 hospital+0.204
 desert+0.202
 ig+0.199
Neg logit contributions
packs-0.138
aces-0.137
lets-0.132
suits-0.125
 names-0.124
Token mean
Token ID1612
Feature activation+4.99e-01

Pos logit contributions
 beast+0.004
 desert+0.004
 cliff+0.004
andel+0.004
 ig+0.004
Neg logit contributions
aces-0.004
 mattered-0.004
 names-0.004
 shared-0.004
lets-0.004
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
 crow+0.014
 beast+0.014
 hospital+0.014
 ig+0.013
 desert+0.013
Neg logit contributions
packs-0.016
lets-0.016
aces-0.016
suits-0.015
 names-0.015
Token girl
Token ID2576
Feature activation+2.16e+00

Pos logit contributions
suits+0.050
lets+0.047
 names+0.047
packs+0.046
 shirts+0.045
Neg logit contributions
 beast-0.049
 hospital-0.046
 unfortunate-0.045
 depths-0.043
 crow-0.043
Token who
Token ID508
Feature activation+4.39e+00

Pos logit contributions
 beast+0.115
 unfortunate+0.105
 crow+0.102
 hospital+0.102
 eagle+0.099
Neg logit contributions
packs-0.105
lets-0.098
aces-0.092
suits-0.090
 pads-0.087
Token loved
Token ID6151
Feature activation-7.35e-01

Pos logit contributions
 beast+0.272
 crow+0.268
 hospital+0.264
 ig+0.253
 unfortunate+0.248
Neg logit contributions
packs-0.163
aces-0.154
lets-0.148
suits-0.142
 names-0.138
Token sun
Token ID4252
Feature activation-2.37e+00

Pos logit contributions
packs+0.032
aces+0.030
lets+0.030
 names+0.030
 shirts+0.029
Neg logit contributions
 beast-0.041
 crow-0.039
 hospital-0.038
 unfortunate-0.037
 ig-0.036
Tokensets
Token ID28709
Feature activation-9.95e+00

Pos logit contributions
packs+0.143
suits+0.141
ver+0.137
aces+0.135
 interests+0.134
Neg logit contributions
 crow-0.087
 ig-0.084
 that-0.084
 hospital-0.080
 desert-0.079
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
packs+0.632
suits+0.593
 interests+0.588
lets+0.582
 journals+0.577
Neg logit contributions
 that-0.366
 hospital-0.334
 crow-0.328
 ig-0.320
 beast-0.317
Token Every
Token ID3887
Feature activation-2.75e-01

Pos logit contributions
packs+0.019
suits+0.018
aces+0.018
lets+0.017
 mattered+0.017
Neg logit contributions
 hospital-0.011
 crow-0.011
 desert-0.010
 beast-0.010
 ig-0.010
Token day
Token ID1110
Feature activation+1.51e+00

Pos logit contributions
packs+0.014
aces+0.013
lets+0.013
suits+0.013
 names+0.013
Neg logit contributions
 beast-0.013
 crow-0.013
 hospital-0.013
 ig-0.012
 unfortunate-0.012
Token she
Token ID673
Feature activation+1.58e+00

Pos logit contributions
 that+0.050
 crow+0.050
 hospital+0.046
 beast+0.045
 desert+0.045
Neg logit contributions
aces-0.100
packs-0.100
suits-0.095
lets-0.093
 hips-0.091
Token would
Token ID561
Feature activation-2.21e+00

Pos logit contributions
 crow+0.098
 hospital+0.095
 beast+0.093
 ig+0.092
 desert+0.090
Neg logit contributions
packs-0.059
aces-0.056
lets-0.050
 hips-0.050
 journals-0.048
Token bear
Token ID6842
Feature activation+2.26e+00

Pos logit contributions
lets+0.046
suits+0.046
 names+0.043
 shirts+0.043
packs+0.043
Neg logit contributions
 unfortunate-0.053
 beast-0.051
 hospital-0.049
 explosion-0.048
 frightening-0.047
Token.
Token ID13
Feature activation-7.80e-01

Pos logit contributions
 beast+0.120
 unfortunate+0.116
 hospital+0.111
 crow+0.110
 cliff+0.107
Neg logit contributions
packs-0.102
lets-0.100
aces-0.093
 names-0.090
suits-0.087
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
packs+0.038
aces+0.036
lets+0.036
suits+0.035
 names+0.034
Neg logit contributions
 beast-0.039
 crow-0.038
 hospital-0.038
 ig-0.036
 desert-0.035
Token loved
Token ID6151
Feature activation-6.76e-01

Pos logit contributions
 beast+0.122
 hospital+0.118
 crow+0.117
 unfortunate+0.112
 ig+0.112
Neg logit contributions
aces-0.050
packs-0.050
lets-0.047
suits-0.046
 names-0.045
Token sun
Token ID4252
Feature activation-2.27e+00

Pos logit contributions
packs+0.027
 names+0.026
aces+0.025
lets+0.025
 shirts+0.024
Neg logit contributions
 beast-0.040
 hospital-0.037
 crow-0.037
 unfortunate-0.037
 hawk-0.034
Tokensets
Token ID28709
Feature activation-9.87e+00

Pos logit contributions
packs+0.114
suits+0.108
aces+0.106
 journals+0.103
 interests+0.102
Neg logit contributions
 ig-0.108
 crow-0.107
 beast-0.104
 that-0.103
 eagle-0.101
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
packs+0.589
suits+0.544
lets+0.538
aces+0.537
 interests+0.532
Neg logit contributions
 that-0.394
 beast-0.374
 hospital-0.370
 crow-0.370
 ig-0.368
Token he
Token ID339
Feature activation+1.73e+00

Pos logit contributions
packs+0.055
aces+0.053
suits+0.051
lets+0.048
ver+0.046
Neg logit contributions
 crow-0.065
 beast-0.063
 hospital-0.061
 ig-0.060
 desert-0.060
Token wanted
Token ID2227
Feature activation-1.06e-02

Pos logit contributions
 crow+0.106
 beast+0.105
 hospital+0.103
 ig+0.099
 desert+0.098
Neg logit contributions
packs-0.065
aces-0.061
lets-0.058
suits-0.056
 names-0.055
Token to
Token ID284
Feature activation+4.46e-01

Pos logit contributions
packs+0.001
aces+0.001
suits+0.001
lets+0.001
osaurs+0.001
Neg logit contributions
 crow-0.000
 beast-0.000
 hospital-0.000
 that-0.000
 desert-0.000
Token take
Token ID1011
Feature activation+2.51e+00

Pos logit contributions
 beast+0.024
 crow+0.023
 hospital+0.023
 ig+0.022
 unfortunate+0.022
Neg logit contributions
packs-0.020
aces-0.019
lets-0.019
 names-0.018
suits-0.018
Token The
Token ID383
Feature activation+5.38e-01

Pos logit contributions
packs+0.002
aces+0.002
lets+0.002
 names+0.002
suits+0.002
Neg logit contributions
 beast-0.003
 crow-0.003
 hospital-0.003
 ig-0.003
 eagle-0.003
Token people
Token ID661
Feature activation-4.61e-01

Pos logit contributions
 beast+0.022
 crow+0.022
 hospital+0.020
 ig+0.020
 unfortunate+0.019
Neg logit contributions
packs-0.029
 names-0.029
aces-0.028
suits-0.028
 shirts-0.027
Token,
Token ID11
Feature activation-6.37e-01

Pos logit contributions
packs+0.019
aces+0.018
lets+0.017
suits+0.017
 names+0.016
Neg logit contributions
 beast-0.027
 crow-0.026
 hospital-0.026
 ig-0.025
 desert-0.024
Token the
Token ID262
Feature activation+9.26e-01

Pos logit contributions
packs+0.033
aces+0.032
lets+0.031
suits+0.030
 names+0.030
Neg logit contributions
 beast-0.030
 crow-0.030
 hospital-0.029
 ig-0.028
 desert-0.027
Token sun
Token ID4252
Feature activation-2.31e+00

Pos logit contributions
 beast+0.037
 crow+0.035
 hospital+0.034
 ig+0.033
 unfortunate+0.032
Neg logit contributions
packs-0.052
aces-0.051
 names-0.051
suits-0.050
lets-0.049
Tokensets
Token ID28709
Feature activation-9.78e+00

Pos logit contributions
packs+0.110
aces+0.108
lets+0.101
suits+0.101
 journals+0.100
Neg logit contributions
 beast-0.116
 crow-0.116
 hospital-0.113
 ig-0.109
 desert-0.107
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
packs+0.328
aces+0.304
lets+0.290
suits+0.282
 names+0.275
Neg logit contributions
 beast-0.639
 crow-0.630
 hospital-0.624
 ig-0.598
 desert-0.587
Token the
Token ID262
Feature activation+9.75e-01

Pos logit contributions
packs+0.064
aces+0.063
lets+0.061
suits+0.060
 names+0.058
Neg logit contributions
 beast-0.043
 crow-0.042
 hospital-0.041
 ig-0.038
 desert-0.038
Token b
Token ID275
Feature activation+2.65e+00

Pos logit contributions
 beast+0.040
 crow+0.038
 hospital+0.037
 ig+0.035
 unfortunate+0.034
Neg logit contributions
packs-0.056
aces-0.055
lets-0.053
suits-0.052
 names-0.052
Tokenowing
Token ID7855
Feature activation+1.62e+00

Pos logit contributions
 beast+0.142
 crow+0.135
 hospital+0.134
 ig+0.125
 unfortunate+0.124
Neg logit contributions
packs-0.128
aces-0.126
lets-0.122
suits-0.115
 names-0.111
Token all
Token ID477
Feature activation-1.96e+00

Pos logit contributions
 beast+0.080
 crow+0.078
 hospital+0.076
 ig+0.071
 unfortunate+0.070
Neg logit contributions
packs-0.082
aces-0.077
lets-0.076
suits-0.074
 names-0.073
Token big
Token ID1263
Feature activation+3.32e+00

Pos logit contributions
 beast+0.075
 crow+0.072
 hospital+0.070
 unfortunate+0.066
 ig+0.066
Neg logit contributions
packs-0.071
aces-0.067
 names-0.066
 shirts-0.064
suits-0.064
Token mountains
Token ID12269
Feature activation-1.02e+00

Pos logit contributions
 beast+0.159
 crow+0.154
 hospital+0.152
 ig+0.147
 desert+0.143
Neg logit contributions
packs-0.171
aces-0.166
lets-0.164
suits-0.157
 names-0.154
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
packs+0.044
aces+0.042
lets+0.040
suits+0.040
 names+0.038
Neg logit contributions
 beast-0.057
 crow-0.056
 hospital-0.055
 unfortunate-0.052
 ig-0.052
Token beautiful
Token ID4950
Feature activation-4.62e-01

Pos logit contributions
packs+0.059
aces+0.056
lets+0.055
suits+0.055
 names+0.054
Neg logit contributions
 beast-0.054
 crow-0.052
 hospital-0.051
 ig-0.048
 unfortunate-0.047
Token sun
Token ID4252
Feature activation-2.29e+00

Pos logit contributions
packs+0.023
aces+0.022
lets+0.021
suits+0.021
 names+0.020
Neg logit contributions
 beast-0.023
 crow-0.022
 hospital-0.022
 ig-0.021
 unfortunate-0.021
Tokensets
Token ID28709
Feature activation-9.77e+00

Pos logit contributions
packs+0.114
aces+0.110
lets+0.106
suits+0.106
 journals+0.104
Neg logit contributions
 beast-0.110
 crow-0.108
 hospital-0.105
 ig-0.103
 desert-0.100
Token.
Token ID13
Feature activation-2.70e-01

Pos logit contributions
packs+0.545
aces+0.506
lets+0.500
 journals+0.499
 interests+0.496
Neg logit contributions
 beast-0.420
 hospital-0.407
 crow-0.407
 that-0.397
 ig-0.383
Token He
Token ID679
Feature activation+1.97e+00

Pos logit contributions
packs+0.013
aces+0.012
lets+0.012
suits+0.011
 names+0.011
Neg logit contributions
 beast-0.014
 crow-0.014
 hospital-0.014
 ig-0.013
 unfortunate-0.013
Token felt
Token ID2936
Feature activation+3.06e+00

Pos logit contributions
 beast+0.111
 crow+0.108
 hospital+0.106
 ig+0.101
 unfortunate+0.100
Neg logit contributions
packs-0.082
aces-0.078
lets-0.078
suits-0.075
 names-0.073
Token happy
Token ID3772
Feature activation-2.21e+00

Pos logit contributions
 beast+0.172
 crow+0.167
 ig+0.151
 hospital+0.151
 eagle+0.147
Neg logit contributions
aces-0.126
 names-0.121
packs-0.121
 mattered-0.118
 chooses-0.113
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
packs+0.092
aces+0.089
lets+0.086
 names+0.081
 pads+0.081
Neg logit contributions
 beast-0.124
 crow-0.121
 hospital-0.115
 unfortunate-0.113
 ig-0.111
Token big
Token ID1263
Feature activation+3.16e+00

Pos logit contributions
 beast+0.074
 crow+0.072
 hospital+0.071
 ig+0.066
 unfortunate+0.065
Neg logit contributions
packs-0.080
aces-0.075
lets-0.073
 names-0.073
suits-0.073
Token mountains
Token ID12269
Feature activation-1.03e+00

Pos logit contributions
 beast+0.138
 crow+0.132
 ig+0.131
 desert+0.131
 hospital+0.129
Neg logit contributions
aces-0.174
packs-0.171
lets-0.167
ver-0.162
suits-0.157
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
packs+0.046
aces+0.044
lets+0.042
suits+0.042
 names+0.040
Neg logit contributions
 beast-0.057
 crow-0.055
 hospital-0.055
 ig-0.051
 unfortunate-0.051
Token beautiful
Token ID4950
Feature activation-4.78e-01

Pos logit contributions
packs+0.062
aces+0.060
lets+0.059
suits+0.057
 names+0.056
Neg logit contributions
 beast-0.049
 crow-0.047
 hospital-0.046
 ig-0.044
 desert-0.043
Token sun
Token ID4252
Feature activation-2.35e+00

Pos logit contributions
packs+0.025
aces+0.024
lets+0.023
suits+0.023
 names+0.022
Neg logit contributions
 beast-0.022
 crow-0.022
 hospital-0.021
 ig-0.021
 desert-0.020
Tokensets
Token ID28709
Feature activation-9.72e+00

Pos logit contributions
packs+0.125
aces+0.123
 journals+0.120
 interests+0.118
suits+0.116
Neg logit contributions
 beast-0.099
 crow-0.097
 that-0.097
 ig-0.096
 hospital-0.094
Token.
Token ID13
Feature activation-5.53e-01

Pos logit contributions
packs+0.566
 journals+0.530
 interests+0.527
aces+0.527
lets+0.516
Neg logit contributions
 beast-0.387
 that-0.384
 crow-0.373
 hospital-0.372
 ig-0.353
Token But
Token ID887
Feature activation+5.60e-01

Pos logit contributions
packs+0.031
aces+0.031
lets+0.029
 journals+0.029
suits+0.027
Neg logit contributions
 hospital-0.024
 beast-0.023
 ig-0.022
 crow-0.022
 that-0.022
Token one
Token ID530
Feature activation-3.40e+00

Pos logit contributions
 beast+0.020
 hospital+0.019
 crow+0.019
 ig+0.018
 that+0.017
Neg logit contributions
packs-0.036
aces-0.036
lets-0.034
suits-0.034
 names-0.033
Token day
Token ID1110
Feature activation+1.66e+00

Pos logit contributions
packs+0.178
aces+0.168
lets+0.166
suits+0.162
 names+0.159
Neg logit contributions
 beast-0.160
 crow-0.155
 hospital-0.153
 ig-0.143
 unfortunate-0.140
Token,
Token ID11
Feature activation-7.23e-01

Pos logit contributions
 that+0.061
 beast+0.059
 hospital+0.057
 crow+0.057
 ig+0.056
Neg logit contributions
aces-0.108
packs-0.103
lets-0.099
 wrists-0.099
suits-0.097
Token.
Token ID13
Feature activation-4.41e-02

Pos logit contributions
packs+0.105
lets+0.101
aces+0.098
 names+0.095
 pads+0.090
Neg logit contributions
 beast-0.105
 crow-0.100
 hospital-0.099
 unfortunate-0.096
 ig-0.095
Token She
Token ID1375
Feature activation+2.33e+00

Pos logit contributions
packs+0.002
aces+0.002
lets+0.002
suits+0.002
 names+0.002
Neg logit contributions
 beast-0.002
 crow-0.002
 hospital-0.002
 ig-0.002
 desert-0.002
Token watched
Token ID7342
Feature activation+2.83e-01

Pos logit contributions
 beast+0.150
 hospital+0.144
 crow+0.144
 unfortunate+0.136
 ig+0.136
Neg logit contributions
packs-0.080
lets-0.080
aces-0.077
suits-0.071
 names-0.071
Token the
Token ID262
Feature activation+9.55e-01

Pos logit contributions
 beast+0.014
 crow+0.013
 hospital+0.013
 ig+0.012
 unfortunate+0.012
Neg logit contributions
packs-0.014
aces-0.014
lets-0.013
 names-0.013
suits-0.013
Token sun
Token ID4252
Feature activation-2.23e+00

Pos logit contributions
 beast+0.044
 crow+0.043
 hospital+0.042
 unfortunate+0.040
 ig+0.040
Neg logit contributions
packs-0.048
 names-0.046
aces-0.046
lets-0.045
 journals-0.044
Tokensets
Token ID28709
Feature activation-9.65e+00

Pos logit contributions
packs+0.109
aces+0.106
suits+0.101
lets+0.101
 names+0.097
Neg logit contributions
 beast-0.109
 crow-0.107
 hospital-0.105
 ig-0.101
 desert-0.098
Token and
Token ID290
Feature activation-9.66e-01

Pos logit contributions
packs+0.483
aces+0.460
suits+0.443
lets+0.438
 names+0.425
Neg logit contributions
 beast-0.466
 crow-0.456
 hospital-0.450
 ig-0.429
 eagle-0.417
Token fireworks
Token ID26056
Feature activation-2.46e+00

Pos logit contributions
packs+0.043
lets+0.041
aces+0.040
 names+0.039
suits+0.038
Neg logit contributions
 beast-0.053
 crow-0.051
 hospital-0.050
 ig-0.048
 unfortunate-0.047
Token for
Token ID329
Feature activation+3.53e+00

Pos logit contributions
packs+0.116
aces+0.112
lets+0.108
suits+0.105
 names+0.103
Neg logit contributions
 beast-0.126
 crow-0.123
 hospital-0.121
 ig-0.116
 desert-0.112
Token hours
Token ID2250
Feature activation-1.17e+00

Pos logit contributions
 beast+0.188
 crow+0.182
 hospital+0.177
 ig+0.173
 desert+0.166
Neg logit contributions
packs-0.159
aces-0.152
 names-0.146
lets-0.145
 journals-0.142
Token.
Token ID13
Feature activation+1.12e-01

Pos logit contributions
packs+0.058
aces+0.055
lets+0.054
suits+0.053
 names+0.052
Neg logit contributions
 beast-0.057
 crow-0.056
 hospital-0.055
 ig-0.052
 desert-0.051
Token adm
Token ID6178
Feature activation-2.36e-01

Pos logit contributions
packs+0.039
aces+0.036
 names+0.035
lets+0.035
 interests+0.033
Neg logit contributions
 beast-0.037
 crow-0.036
 hospital-0.035
 ig-0.033
 unfortunate-0.032
Tokeniring
Token ID3428
Feature activation+2.90e+00

Pos logit contributions
aces+0.016
packs+0.016
lets+0.015
 names+0.015
 journals+0.014
Neg logit contributions
 beast-0.007
 hospital-0.007
 crow-0.007
 ig-0.006
 desert-0.006
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 beast+0.142
 crow+0.140
 hospital+0.135
 ig+0.127
 desert+0.127
Neg logit contributions
packs-0.143
aces-0.136
 names-0.130
lets-0.129
 mattered-0.123
Token beautiful
Token ID4950
Feature activation-3.51e-01

Pos logit contributions
 beast+0.042
 crow+0.041
 hospital+0.040
 ig+0.038
 unfortunate+0.036
Neg logit contributions
packs-0.062
aces-0.059
lets-0.058
 names-0.058
suits-0.057
Token sun
Token ID4252
Feature activation-2.10e+00

Pos logit contributions
packs+0.021
aces+0.020
lets+0.020
suits+0.019
 names+0.019
Neg logit contributions
 beast-0.014
 crow-0.014
 hospital-0.013
 ig-0.012
 unfortunate-0.012
Tokensets
Token ID28709
Feature activation-9.62e+00

Pos logit contributions
packs+0.110
aces+0.106
suits+0.105
lets+0.102
 names+0.101
Neg logit contributions
 beast-0.095
 crow-0.094
 hospital-0.089
 that-0.086
 ig-0.085
Token from
Token ID422
Feature activation+3.84e+00

Pos logit contributions
packs+0.510
aces+0.482
suits+0.470
lets+0.470
 names+0.460
Neg logit contributions
 beast-0.443
 crow-0.429
 hospital-0.420
 ig-0.393
 that-0.391
Token afar
Token ID44246
Feature activation+3.13e+00

Pos logit contributions
 beast+0.196
 crow+0.191
 hospital+0.185
ule+0.179
 ig+0.178
Neg logit contributions
packs-0.175
 names-0.146
aces-0.145
 wrists-0.142
 shirts-0.139
Token.
Token ID13
Feature activation+4.01e-01

Pos logit contributions
 beast+0.160
 crow+0.156
 hospital+0.153
 ig+0.146
 unfortunate+0.142
Neg logit contributions
packs-0.150
aces-0.144
lets-0.140
suits-0.135
 names-0.134
Token<|endoftext|>
Token ID50256
Feature activation+1.81e+00

Pos logit contributions
 beast+0.023
 crow+0.022
 hospital+0.022
 ig+0.021
 unfortunate+0.020
Neg logit contributions
packs-0.017
aces-0.016
lets-0.016
suits-0.015
 names-0.015
TokenOnce
Token ID7454
Feature activation-6.51e-01

Pos logit contributions
 beast+0.062
 crow+0.058
 hospital+0.056
 ig+0.053
 desert+0.052
Neg logit contributions
packs-0.116
lets-0.115
aces-0.114
 journals-0.108
suits-0.108
Token could
Token ID714
Feature activation-8.57e-01

Pos logit contributions
 beast+0.134
 crow+0.132
 hospital+0.130
 desert+0.125
 ig+0.124
Neg logit contributions
packs-0.077
aces-0.075
lets-0.074
 names-0.067
 mattered-0.066
Token look
Token ID804
Feature activation+1.63e+00

Pos logit contributions
packs+0.034
aces+0.034
lets+0.032
 names+0.031
 mattered+0.030
Neg logit contributions
 beast-0.050
 crow-0.048
 desert-0.046
 hospital-0.045
 ig-0.045
Token at
Token ID379
Feature activation-2.54e+00

Pos logit contributions
 beast+0.087
 crow+0.085
 hospital+0.083
 ig+0.079
 desert+0.077
Neg logit contributions
packs-0.076
aces-0.072
lets-0.070
suits-0.068
 names-0.067
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
packs+0.141
aces+0.136
lets+0.133
suits+0.131
 names+0.128
Neg logit contributions
 beast-0.110
 crow-0.107
 hospital-0.105
 ig-0.099
 unfortunate-0.096
Token sun
Token ID4252
Feature activation-2.09e+00

Pos logit contributions
 beast+0.047
 crow+0.045
 hospital+0.044
 ig+0.041
 desert+0.040
Neg logit contributions
packs-0.071
aces-0.069
lets-0.067
suits-0.066
 names-0.064
Tokensets
Token ID28709
Feature activation-9.61e+00

Pos logit contributions
packs+0.099
aces+0.095
lets+0.092
suits+0.090
 names+0.088
Neg logit contributions
 beast-0.107
 crow-0.105
 hospital-0.103
 ig-0.098
 desert-0.095
Token pieces
Token ID5207
Feature activation-2.91e+00

Pos logit contributions
packs+0.461
aces+0.440
lets+0.427
suits+0.419
 names+0.411
Neg logit contributions
 beast-0.490
 crow-0.477
 hospital-0.470
 ig-0.446
 desert-0.433
Token every
Token ID790
Feature activation-8.49e-02

Pos logit contributions
packs+0.149
aces+0.143
lets+0.139
suits+0.136
 names+0.134
Neg logit contributions
 beast-0.139
 crow-0.135
 hospital-0.133
 ig-0.126
 unfortunate-0.122
Token day
Token ID1110
Feature activation+1.88e+00

Pos logit contributions
packs+0.004
aces+0.004
lets+0.004
suits+0.004
 names+0.004
Neg logit contributions
 beast-0.004
 crow-0.004
 hospital-0.004
 ig-0.004
 unfortunate-0.004
Token and
Token ID290
Feature activation-6.96e-01

Pos logit contributions
 beast+0.092
 crow+0.090
 hospital+0.088
 ig+0.083
 desert+0.081
Neg logit contributions
packs-0.093
aces-0.089
lets-0.087
suits-0.085
 names-0.084
Token remember
Token ID3505
Feature activation-2.05e+00

Pos logit contributions
packs+0.033
lets+0.031
aces+0.031
 names+0.030
 mattered+0.029
Neg logit contributions
 beast-0.036
 crow-0.035
 hospital-0.033
 ig-0.033
 desert-0.031
Token friends
Token ID2460
Feature activation-4.24e+00

Pos logit contributions
packs+0.101
aces+0.097
lets+0.093
 names+0.090
suits+0.090
Neg logit contributions
 beast-0.110
 crow-0.107
 hospital-0.105
 ig-0.100
 unfortunate-0.098
Token who
Token ID508
Feature activation+4.95e+00

Pos logit contributions
packs+0.227
aces+0.221
lets+0.217
suits+0.214
 names+0.206
Neg logit contributions
 beast-0.187
 crow-0.183
 hospital-0.183
 that-0.172
 ig-0.169
Token play
Token ID711
Feature activation-4.42e-01

Pos logit contributions
 beast+0.309
 crow+0.303
 hospital+0.299
 ig+0.286
 desert+0.280
Neg logit contributions
packs-0.180
aces-0.170
lets-0.164
suits-0.158
 names-0.154
Token drums
Token ID24730
Feature activation-5.44e+00

Pos logit contributions
packs+0.023
lets+0.022
aces+0.022
suits+0.021
 names+0.020
Neg logit contributions
 beast-0.021
 hospital-0.021
 crow-0.020
 that-0.019
 ig-0.019
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
packs+0.293
lets+0.275
suits+0.274
aces+0.273
 journals+0.264
Neg logit contributions
 hospital-0.237
 beast-0.235
 crow-0.230
 that-0.225
 ig-0.211
Token keyboards
Token ID34512
Feature activation-9.41e+00

Pos logit contributions
packs+0.016
aces+0.015
suits+0.015
lets+0.015
ver+0.014
Neg logit contributions
 beast-0.020
 crow-0.020
 hospital-0.020
 eagle-0.018
 that-0.018
Token.
Token ID13
Feature activation+8.30e-01

Pos logit contributions
packs+0.544
suits+0.514
lets+0.506
aces+0.505
 journals+0.492
Neg logit contributions
 hospital-0.386
 crow-0.372
 that-0.372
 beast-0.364
 ig-0.336
Token They
Token ID1119
Feature activation-2.48e+00

Pos logit contributions
 beast+0.040
 crow+0.039
 hospital+0.039
 ig+0.037
 desert+0.036
Neg logit contributions
packs-0.042
aces-0.040
lets-0.039
suits-0.038
 names-0.037
Token wear
Token ID5806
Feature activation-2.30e+00

Pos logit contributions
packs+0.116
aces+0.110
lets+0.106
suits+0.105
 journals+0.098
Neg logit contributions
 hospital-0.129
 beast-0.129
 crow-0.128
 ig-0.118
 eagle-0.118
Token colorful
Token ID20239
Feature activation-3.03e+00

Pos logit contributions
packs+0.119
lets+0.114
ver+0.110
aces+0.109
 mattered+0.106
Neg logit contributions
 that-0.110
 hospital-0.098
 beast-0.091
 crow-0.091
 cob-0.088
Token clothes
Token ID8242
Feature activation-3.53e+00

Pos logit contributions
ver+0.118
packs+0.110
 mattered+0.104
aces+0.102
lets+0.099
Neg logit contributions
 that-0.172
 crow-0.169
 beast-0.166
 hospital-0.163
 ig-0.161
Token marching
Token ID26409
Feature activation-1.26e+00

Pos logit contributions
packs+0.023
aces+0.023
lets+0.022
suits+0.021
 names+0.021
Neg logit contributions
 beast-0.021
 hospital-0.020
 crow-0.020
 ig-0.019
 that-0.019
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
packs+0.062
aces+0.061
lets+0.060
 journals+0.058
 names+0.054
Neg logit contributions
 beast-0.057
 hospital-0.056
 that-0.055
 crow-0.051
 unfortunate-0.051
Token shouting
Token ID19642
Feature activation+3.62e-01

Pos logit contributions
packs+0.049
aces+0.047
lets+0.046
suits+0.044
 names+0.043
Neg logit contributions
 beast-0.053
 crow-0.051
 hospital-0.051
 ig-0.048
 desert-0.047
Token,
Token ID11
Feature activation-6.57e-01

Pos logit contributions
 beast+0.015
 hospital+0.014
 that+0.014
 crow+0.014
 unfortunate+0.013
Neg logit contributions
packs-0.021
aces-0.020
lets-0.019
suits-0.018
 journals-0.018
Token waving
Token ID25849
Feature activation-9.40e-01

Pos logit contributions
packs+0.031
aces+0.029
lets+0.028
suits+0.028
 names+0.027
Neg logit contributions
 beast-0.035
 crow-0.034
 hospital-0.033
 ig-0.032
 desert-0.031
Token banners
Token ID31931
Feature activation-9.19e+00

Pos logit contributions
packs+0.050
aces+0.049
lets+0.047
suits+0.045
 journals+0.044
Neg logit contributions
 beast-0.041
 crow-0.040
 that-0.039
 hospital-0.038
 unfortunate-0.035
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
packs+0.478
aces+0.466
lets+0.447
suits+0.444
 journals+0.442
Neg logit contributions
 beast-0.408
 crow-0.392
 hospital-0.386
 that-0.384
 ig-0.370
Token He
Token ID679
Feature activation+2.18e+00

Pos logit contributions
packs+0.006
aces+0.006
lets+0.005
suits+0.005
 names+0.005
Neg logit contributions
 beast-0.006
 hospital-0.006
 crow-0.006
 ig-0.006
 desert-0.006
Token thought
Token ID1807
Feature activation+1.30e-01

Pos logit contributions
 beast+0.118
 crow+0.115
 hospital+0.113
 ig+0.108
 desert+0.105
Neg logit contributions
packs-0.097
aces-0.093
lets-0.091
suits-0.088
 names-0.087
Token it
Token ID340
Feature activation-7.33e-01

Pos logit contributions
 beast+0.006
 crow+0.005
 hospital+0.005
 ig+0.005
 desert+0.005
Neg logit contributions
packs-0.007
aces-0.007
lets-0.007
suits-0.007
 names-0.007
Token was
Token ID373
Feature activation+5.91e-01

Pos logit contributions
packs+0.048
aces+0.045
lets+0.042
suits+0.042
 journals+0.041
Neg logit contributions
 beast-0.029
 that-0.026
 hospital-0.026
 frightening-0.026
 unfortunate-0.026
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
packs+0.110
aces+0.106
lets+0.103
suits+0.101
 names+0.099
Neg logit contributions
 beast-0.115
 crow-0.112
 hospital-0.110
 ig-0.105
 desert-0.102
Token His
Token ID2399
Feature activation-2.71e+00

Pos logit contributions
packs+0.011
aces+0.010
lets+0.010
suits+0.010
 names+0.010
Neg logit contributions
 beast-0.011
 hospital-0.011
 crow-0.010
 ig-0.010
 desert-0.010
Token wife
Token ID3656
Feature activation-8.98e-01

Pos logit contributions
packs+0.136
aces+0.130
lets+0.127
suits+0.124
 names+0.123
Neg logit contributions
 beast-0.133
 crow-0.129
 hospital-0.127
 ig-0.121
 unfortunate-0.117
Token hung
Token ID9174
Feature activation+2.35e+00

Pos logit contributions
packs+0.044
aces+0.041
suits+0.039
 journals+0.038
lets+0.038
Neg logit contributions
 crow-0.044
 beast-0.043
 hospital-0.043
 ig-0.041
 that-0.041
Token happy
Token ID3772
Feature activation-2.11e+00

Pos logit contributions
 beast+0.112
 crow+0.110
 hospital+0.107
 ig+0.104
 that+0.099
Neg logit contributions
packs-0.119
lets-0.113
aces-0.113
suits-0.110
 pads-0.105
Token banners
Token ID31931
Feature activation-9.11e+00

Pos logit contributions
packs+0.098
aces+0.093
lets+0.090
suits+0.087
 names+0.087
Neg logit contributions
 beast-0.112
 crow-0.110
 hospital-0.107
 ig-0.102
 unfortunate-0.100
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
packs+0.462
aces+0.446
lets+0.434
suits+0.432
 names+0.422
Neg logit contributions
 beast-0.416
 crow-0.402
 hospital-0.401
 ig-0.388
 that-0.381
Token spark
Token ID9009
Feature activation+4.23e-01

Pos logit contributions
packs+0.043
aces+0.042
suits+0.040
lets+0.040
 names+0.038
Neg logit contributions
 beast-0.049
 hospital-0.048
 crow-0.048
 ig-0.047
 desert-0.045
Tokenly
Token ID306
Feature activation+2.34e+00

Pos logit contributions
 beast+0.018
 crow+0.017
 hospital+0.017
 that+0.017
 ig+0.016
Neg logit contributions
packs-0.023
aces-0.021
suits-0.021
lets-0.020
 names-0.020
Token stream
Token ID4269
Feature activation+3.62e+00

Pos logit contributions
 beast+0.125
 crow+0.123
 hospital+0.121
 ig+0.121
 desert+0.116
Neg logit contributions
packs-0.096
aces-0.091
lets-0.088
suits-0.088
ver-0.087
Tokeners
Token ID364
Feature activation-3.42e+00

Pos logit contributions
 beast+0.221
 crow+0.217
 hospital+0.215
 ig+0.210
 that+0.203
Neg logit contributions
packs-0.132
lets-0.119
aces-0.116
 journals-0.111
 names-0.110
Token Everyone
Token ID11075
Feature activation-4.47e-01

Pos logit contributions
 beast+0.086
 hospital+0.084
 crow+0.082
 ig+0.078
 that+0.077
Neg logit contributions
packs-0.073
aces-0.071
lets-0.069
suits-0.066
 journals-0.064
Token had
Token ID550
Feature activation-2.46e+00

Pos logit contributions
packs+0.019
aces+0.019
lets+0.018
suits+0.017
 names+0.017
Neg logit contributions
 beast-0.025
 crow-0.024
 hospital-0.024
 ig-0.023
 unfortunate-0.022
Token fun
Token ID1257
Feature activation-2.53e+00

Pos logit contributions
packs+0.136
aces+0.133
lets+0.129
suits+0.125
 journals+0.122
Neg logit contributions
 beast-0.104
 crow-0.102
 hospital-0.102
 that-0.094
 ig-0.093
Token painting
Token ID12036
Feature activation-1.43e-01

Pos logit contributions
packs+0.122
aces+0.119
lets+0.116
suits+0.112
 names+0.109
Neg logit contributions
 beast-0.126
 hospital-0.121
 crow-0.120
 ig-0.117
 that-0.115
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
packs+0.006
aces+0.006
lets+0.006
suits+0.006
 names+0.006
Neg logit contributions
 beast-0.008
 crow-0.007
 hospital-0.007
 ig-0.007
 that-0.007
Token banners
Token ID31931
Feature activation-9.11e+00

Pos logit contributions
 beast+0.045
 crow+0.044
 hospital+0.043
 ig+0.040
 unfortunate+0.039
Neg logit contributions
packs-0.061
aces-0.059
lets-0.058
suits-0.058
 names-0.057
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
packs+0.456
aces+0.440
lets+0.425
suits+0.420
 names+0.410
Neg logit contributions
 beast-0.437
 crow-0.425
 hospital-0.420
 ig-0.397
 desert-0.387
Token marching
Token ID26409
Feature activation-1.12e+00

Pos logit contributions
packs+0.053
aces+0.052
lets+0.049
suits+0.049
 names+0.046
Neg logit contributions
 beast-0.052
 crow-0.051
 hospital-0.050
 ig-0.048
 desert-0.047
Token around
Token ID1088
Feature activation-2.47e+00

Pos logit contributions
packs+0.052
aces+0.050
lets+0.049
suits+0.046
 journals+0.045
Neg logit contributions
 beast-0.057
 hospital-0.055
 crow-0.053
 that-0.052
 ig-0.051
Token.
Token ID13
Feature activation+4.41e-02

Pos logit contributions
packs+0.133
aces+0.129
lets+0.126
suits+0.123
 names+0.120
Neg logit contributions
 beast-0.111
 crow-0.108
 hospital-0.106
 ig-0.100
 desert-0.097
Token
Token ID220
Feature activation-1.47e+00

Pos logit contributions
 beast+0.002
 hospital+0.002
 crow+0.002
 ig+0.002
 desert+0.002
Neg logit contributions
packs-0.002
aces-0.002
lets-0.002
suits-0.002
 journals-0.002
Token living
Token ID2877
Feature activation+3.35e+00

Pos logit contributions
 beast+0.062
 crow+0.060
 hawk+0.056
 unfortunate+0.055
 eagle+0.054
Neg logit contributions
packs-0.056
 shirts-0.056
aces-0.056
 names-0.055
 journals-0.053
Token room
Token ID2119
Feature activation-8.68e-02

Pos logit contributions
 beast+0.172
 crow+0.169
 hospital+0.159
 eagle+0.155
 hawk+0.154
Neg logit contributions
packs-0.165
aces-0.156
suits-0.147
 shirts-0.147
lets-0.145
Token with
Token ID351
Feature activation-5.49e-01

Pos logit contributions
packs+0.004
aces+0.004
lets+0.004
suits+0.004
 names+0.004
Neg logit contributions
 beast-0.004
 hospital-0.004
 crow-0.004
 ig-0.004
 that-0.004
Token balloons
Token ID35485
Feature activation-6.82e+00

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 pads+0.022
Neg logit contributions
 beast-0.027
 hospital-0.027
 crow-0.027
 ig-0.027
 desert-0.025
Token and
Token ID290
Feature activation-6.16e-01

Pos logit contributions
packs+0.346
aces+0.346
lets+0.329
suits+0.322
 names+0.321
Neg logit contributions
 beast-0.306
 crow-0.301
 hospital-0.296
 ig-0.291
 that-0.285
Token banners
Token ID31931
Feature activation-9.05e+00

Pos logit contributions
packs+0.022
aces+0.022
lets+0.021
suits+0.021
ver+0.021
Neg logit contributions
 crow-0.037
 hospital-0.036
 beast-0.036
 ig-0.036
 desert-0.034
Token.
Token ID13
Feature activation+4.71e-01

Pos logit contributions
packs+0.499
aces+0.493
 journals+0.475
 interests+0.474
lets+0.472
Neg logit contributions
 that-0.355
 beast-0.333
 crow-0.327
 ig-0.325
 hospital-0.322
Token They
Token ID1119
Feature activation-2.44e+00

Pos logit contributions
 hospital+0.031
 beast+0.030
 crow+0.030
 ig+0.029
 desert+0.029
Neg logit contributions
packs-0.015
aces-0.015
lets-0.015
suits-0.013
 names-0.012
Token had
Token ID550
Feature activation-2.31e+00

Pos logit contributions
packs+0.097
aces+0.095
lets+0.090
suits+0.087
 names+0.084
Neg logit contributions
 hospital-0.139
 crow-0.139
 beast-0.138
 ig-0.134
 desert-0.128
Token a
Token ID257
Feature activation-3.99e-01

Pos logit contributions
packs+0.120
aces+0.115
lets+0.113
suits+0.110
 mattered+0.104
Neg logit contributions
 crow-0.106
 hospital-0.105
 beast-0.105
 ig-0.100
 desert-0.097
Token big
Token ID1263
Feature activation+3.60e+00

Pos logit contributions
packs+0.020
suits+0.018
 names+0.018
lets+0.018
 journals+0.018
Neg logit contributions
 beast-0.020
 crow-0.019
 hospital-0.018
 ig-0.017
 eagle-0.017
Token all
Token ID477
Feature activation-2.03e+00

Pos logit contributions
packs+0.147
aces+0.141
lets+0.136
 names+0.135
suits+0.132
Neg logit contributions
 beast-0.129
 crow-0.127
 hospital-0.123
 ig-0.117
 desert-0.114
Token the
Token ID262
Feature activation+4.74e-01

Pos logit contributions
packs+0.116
aces+0.112
lets+0.110
suits+0.107
 names+0.106
Neg logit contributions
 beast-0.086
 crow-0.083
 hospital-0.081
 ig-0.076
 desert-0.074
Token shiny
Token ID22441
Feature activation+1.43e-02

Pos logit contributions
 beast+0.024
 crow+0.022
 hospital+0.022
 eagle+0.022
 unexpected+0.021
Neg logit contributions
packs-0.024
aces-0.023
lets-0.023
suits-0.022
ver-0.021
Token,
Token ID11
Feature activation-8.50e-01

Pos logit contributions
 beast+0.001
 crow+0.001
 hospital+0.001
 that+0.001
 eagle+0.001
Neg logit contributions
packs-0.001
aces-0.001
lets-0.001
suits-0.001
ver-0.001
Token colourful
Token ID44799
Feature activation-3.01e+00

Pos logit contributions
packs+0.048
aces+0.045
lets+0.045
suits+0.043
 names+0.042
Neg logit contributions
 beast-0.037
 crow-0.037
 hospital-0.036
 ig-0.033
 desert-0.033
Token guitars
Token ID32497
Feature activation-8.84e+00

Pos logit contributions
packs+0.124
aces+0.119
suits+0.112
lets+0.110
ver+0.109
Neg logit contributions
 beast-0.174
 crow-0.170
 hospital-0.162
 that-0.161
 eagle-0.161
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
packs+0.509
suits+0.474
aces+0.473
lets+0.460
 journals+0.457
Neg logit contributions
 beast-0.359
 crow-0.352
 that-0.351
 hospital-0.350
 eagle-0.322
Token explained
Token ID4893
Feature activation-2.31e+00

Pos logit contributions
packs+0.058
aces+0.054
suits+0.051
lets+0.051
 journals+0.048
Neg logit contributions
 crow-0.051
 beast-0.051
 hospital-0.048
 eagle-0.046
 that-0.046
Token to
Token ID284
Feature activation+4.24e-01

Pos logit contributions
packs+0.129
aces+0.125
lets+0.123
 names+0.120
suits+0.118
Neg logit contributions
 beast-0.098
 crow-0.095
 hospital-0.093
 ig-0.088
 desert-0.085
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
 crow+0.023
 beast+0.023
 desert+0.022
 hospital+0.022
 ig+0.022
Neg logit contributions
 names-0.017
aces-0.017
packs-0.017
lets-0.016
 shirts-0.016
Token mother
Token ID2802
Feature activation+3.88e-01

Pos logit contributions
packs+0.060
aces+0.059
suits+0.056
lets+0.055
 journals+0.052
Neg logit contributions
 beast-0.050
 crow-0.048
 hospital-0.046
 ig-0.043
 eagle-0.043
Token with
Token ID351
Feature activation-5.81e-01

Pos logit contributions
packs+0.032
lets+0.029
aces+0.029
suits+0.028
 names+0.028
Neg logit contributions
 beast-0.034
 crow-0.033
 hospital-0.032
 unfortunate-0.030
 cliff-0.030
Token her
Token ID607
Feature activation-6.21e-01

Pos logit contributions
packs+0.026
aces+0.024
 names+0.023
 shirts+0.023
 journals+0.022
Neg logit contributions
 beast-0.032
 crow-0.031
 hospital-0.029
 desert-0.029
 eagle-0.028
Token hands
Token ID2832
Feature activation-5.05e+00

Pos logit contributions
packs+0.029
aces+0.028
lets+0.027
suits+0.026
 names+0.026
Neg logit contributions
 beast-0.032
 crow-0.031
 hospital-0.031
 ig-0.029
 desert-0.029
Token on
Token ID319
Feature activation-2.73e+00

Pos logit contributions
packs+0.277
aces+0.272
lets+0.270
 journals+0.265
suits+0.263
Neg logit contributions
 beast-0.200
 that-0.198
 hospital-0.197
 crow-0.196
 ig-0.189
Token her
Token ID607
Feature activation-6.11e-01

Pos logit contributions
packs+0.119
aces+0.111
 shirts+0.105
 names+0.105
 hips+0.104
Neg logit contributions
 beast-0.148
 crow-0.144
 hospital-0.138
 unfortunate-0.137
 eagle-0.134
Token hips
Token ID21497
Feature activation-8.78e+00

Pos logit contributions
packs+0.026
aces+0.025
lets+0.024
suits+0.023
 names+0.023
Neg logit contributions
 beast-0.035
 crow-0.034
 hospital-0.033
 ig-0.032
 unfortunate-0.031
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
packs+0.529
aces+0.497
lets+0.494
suits+0.490
 journals+0.489
Neg logit contributions
 that-0.328
 hospital-0.312
 beast-0.309
 crow-0.298
 ig-0.294
Token She
Token ID1375
Feature activation+2.34e+00

Pos logit contributions
 beast+0.009
 crow+0.009
 hospital+0.009
 ig+0.008
 unfortunate+0.008
Neg logit contributions
packs-0.006
aces-0.005
lets-0.005
suits-0.005
 names-0.005
Token said
Token ID531
Feature activation-2.15e-02

Pos logit contributions
 beast+0.144
 hospital+0.139
 crow+0.136
 desert+0.135
 cliff+0.134
Neg logit contributions
lets-0.081
packs-0.078
suits-0.073
aces-0.072
 names-0.071
Token,
Token ID11
Feature activation-5.92e-01

Pos logit contributions
packs+0.001
aces+0.001
lets+0.001
suits+0.001
 names+0.001
Neg logit contributions
 beast-0.001
 crow-0.001
 hospital-0.001
 ig-0.001
 unfortunate-0.001
Token �
Token ID6184
Feature activation+1.38e+00

Pos logit contributions
packs+0.025
aces+0.024
lets+0.023
suits+0.022
 names+0.022
Neg logit contributions
 beast-0.034
 crow-0.033
 hospital-0.032
 ig-0.031
 unfortunate-0.030
Token put
Token ID1234
Feature activation+2.93e+00

Pos logit contributions
packs+0.035
lets+0.035
aces+0.032
suits+0.031
 shared+0.031
Neg logit contributions
 beast-0.063
 crow-0.061
 hospital-0.060
 cliff-0.059
 desert-0.057
Token her
Token ID607
Feature activation-6.46e-01

Pos logit contributions
 beast+0.150
 crow+0.147
 hospital+0.145
 ig+0.137
 unfortunate+0.134
Neg logit contributions
packs-0.139
aces-0.134
lets-0.130
suits-0.127
 names-0.124
Token hands
Token ID2832
Feature activation-5.12e+00

Pos logit contributions
packs+0.028
aces+0.027
lets+0.027
suits+0.026
 names+0.025
Neg logit contributions
 beast-0.035
 hospital-0.035
 crow-0.034
 ig-0.033
 unfortunate-0.032
Token on
Token ID319
Feature activation-2.85e+00

Pos logit contributions
 names+0.272
aces+0.271
lets+0.270
packs+0.260
 interests+0.258
Neg logit contributions
 that-0.213
 hospital-0.200
 ig-0.195
 beast-0.187
 crow-0.181
Token her
Token ID607
Feature activation-6.37e-01

Pos logit contributions
packs+0.140
aces+0.131
 names+0.127
suits+0.126
lets+0.126
Neg logit contributions
 beast-0.145
 crow-0.141
 hospital-0.131
 eagle-0.129
 ig-0.128
Token hips
Token ID21497
Feature activation-8.77e+00

Pos logit contributions
packs+0.030
lets+0.028
aces+0.028
suits+0.027
 names+0.026
Neg logit contributions
 beast-0.034
 hospital-0.033
 crow-0.032
 ig-0.031
 unfortunate-0.030
Token and
Token ID290
Feature activation-9.84e-01

Pos logit contributions
packs+0.514
aces+0.488
lets+0.479
 journals+0.473
suits+0.467
Neg logit contributions
 that-0.341
 hospital-0.329
 beast-0.316
 ig-0.302
 crow-0.301
Token said
Token ID531
Feature activation+4.98e-02

Pos logit contributions
packs+0.043
aces+0.041
lets+0.040
suits+0.039
 names+0.038
Neg logit contributions
 beast-0.055
 crow-0.053
 hospital-0.052
 ig-0.050
 desert-0.049
Token,
Token ID11
Feature activation-5.21e-01

Pos logit contributions
 beast+0.002
 crow+0.002
 hospital+0.002
 that+0.002
 unfortunate+0.002
Neg logit contributions
packs-0.003
aces-0.003
lets-0.003
suits-0.002
 pads-0.002
Token "
Token ID366
Feature activation-2.02e+00

Pos logit contributions
packs+0.022
aces+0.021
lets+0.020
suits+0.020
 names+0.019
Neg logit contributions
 beast-0.029
 crow-0.029
 hospital-0.028
 ig-0.027
 unfortunate-0.026
TokenE
Token ID36
Feature activation+6.18e+00

Pos logit contributions
packs+0.108
aces+0.104
lets+0.103
suits+0.100
 names+0.099
Neg logit contributions
 beast-0.092
 crow-0.088
 hospital-0.088
 ig-0.084
 desert-0.080
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
packs+0.158
aces+0.151
lets+0.146
suits+0.142
 names+0.141
Neg logit contributions
 beast-0.175
 crow-0.170
 hospital-0.164
 ig-0.158
 unfortunate-0.155
Token Tim
Token ID5045
Feature activation-7.67e-01

Pos logit contributions
packs+0.011
aces+0.011
lets+0.010
suits+0.010
 journals+0.010
Neg logit contributions
 beast-0.010
 hospital-0.010
 crow-0.010
 ig-0.010
 desert-0.009
Tokenmy
Token ID1820
Feature activation-4.37e-01

Pos logit contributions
packs+0.027
aces+0.026
lets+0.024
suits+0.024
 names+0.023
Neg logit contributions
 beast-0.048
 hospital-0.047
 crow-0.047
 ig-0.046
 desert-0.044
Token saw
Token ID2497
Feature activation+3.20e+00

Pos logit contributions
packs+0.024
aces+0.023
lets+0.021
 names+0.021
 journals+0.021
Neg logit contributions
 hospital-0.020
 beast-0.019
 crow-0.019
 ig-0.018
 that-0.018
Token many
Token ID867
Feature activation-2.60e-01

Pos logit contributions
 beast+0.187
 crow+0.181
 cliff+0.173
asis+0.171
andel+0.171
Neg logit contributions
 names-0.115
aces-0.114
packs-0.114
 shirts-0.113
lets-0.103
Token guitars
Token ID32497
Feature activation-8.77e+00

Pos logit contributions
packs+0.013
aces+0.013
lets+0.013
suits+0.012
ver+0.011
Neg logit contributions
 beast-0.012
 hospital-0.012
 crow-0.012
 that-0.011
 unexpected-0.011
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
packs+0.460
aces+0.446
lets+0.430
suits+0.428
 journals+0.414
Neg logit contributions
 beast-0.394
 hospital-0.391
 crow-0.384
 that-0.367
 ig-0.362
Token he
Token ID339
Feature activation+1.73e+00

Pos logit contributions
packs+0.057
aces+0.054
suits+0.051
lets+0.050
 names+0.047
Neg logit contributions
 crow-0.060
 beast-0.060
 hospital-0.059
 that-0.056
 ig-0.056
Token didn
Token ID1422
Feature activation+2.42e-01

Pos logit contributions
 beast+0.093
 crow+0.093
 hospital+0.091
 ig+0.086
 unfortunate+0.084
Neg logit contributions
packs-0.079
aces-0.075
lets-0.068
suits-0.068
 journals-0.066
Token't
Token ID470
Feature activation-1.68e+00

Pos logit contributions
 beast+0.013
 crow+0.013
 desert+0.013
 ig+0.012
 hospital+0.012
Neg logit contributions
packs-0.010
aces-0.010
 names-0.009
lets-0.009
suits-0.009
Token know
Token ID760
Feature activation+1.87e+00

Pos logit contributions
packs+0.077
aces+0.074
lets+0.071
suits+0.070
 names+0.068
Neg logit contributions
 beast-0.089
 crow-0.088
 hospital-0.086
 ig-0.082
 unfortunate-0.080
Token with
Token ID351
Feature activation-7.08e-01

Pos logit contributions
packs+0.270
aces+0.263
lets+0.254
suits+0.249
 names+0.240
Neg logit contributions
 beast-0.249
 crow-0.244
 hospital-0.240
 ig-0.227
 desert-0.220
Token his
Token ID465
Feature activation-8.91e-01

Pos logit contributions
packs+0.036
aces+0.035
lets+0.034
suits+0.034
 names+0.032
Neg logit contributions
 beast-0.034
 crow-0.033
 hospital-0.032
 ig-0.031
 unfortunate-0.030
Token hands
Token ID2832
Feature activation-5.15e+00

Pos logit contributions
packs+0.045
aces+0.043
lets+0.042
suits+0.041
 names+0.040
Neg logit contributions
 beast-0.044
 crow-0.043
 hospital-0.042
 ig-0.039
 unfortunate-0.038
Token on
Token ID319
Feature activation-2.86e+00

Pos logit contributions
packs+0.289
aces+0.277
 journals+0.276
suits+0.275
lets+0.271
Neg logit contributions
 that-0.208
 crow-0.203
 beast-0.201
 hospital-0.193
 ig-0.191
Token his
Token ID465
Feature activation-8.88e-01

Pos logit contributions
packs+0.152
aces+0.146
lets+0.143
suits+0.141
 names+0.136
Neg logit contributions
 beast-0.133
 crow-0.130
 hospital-0.127
 ig-0.120
 desert-0.116
Token hips
Token ID21497
Feature activation-8.74e+00

Pos logit contributions
packs+0.040
aces+0.038
lets+0.037
suits+0.036
 names+0.035
Neg logit contributions
 beast-0.048
 crow-0.047
 hospital-0.046
 ig-0.044
 eagle-0.043
Token,
Token ID11
Feature activation-6.79e-01

Pos logit contributions
packs+0.538
suits+0.501
aces+0.499
lets+0.495
 journals+0.493
Neg logit contributions
 that-0.327
 beast-0.303
 hospital-0.297
 crow-0.295
 ig-0.293
Token waiting
Token ID4953
Feature activation+4.18e+00

Pos logit contributions
packs+0.034
aces+0.033
lets+0.032
suits+0.031
 names+0.031
Neg logit contributions
 beast-0.033
 crow-0.032
 hospital-0.031
 ig-0.030
 unfortunate-0.029
Token for
Token ID329
Feature activation+3.57e+00

Pos logit contributions
 beast+0.223
 crow+0.214
 hospital+0.209
 eagle+0.196
 unfortunate+0.195
Neg logit contributions
packs-0.202
aces-0.187
suits-0.178
lets-0.178
 names-0.177
Token the
Token ID262
Feature activation+7.47e-01

Pos logit contributions
 beast+0.193
 crow+0.186
 hospital+0.183
 ig+0.175
 cliff+0.172
Neg logit contributions
packs-0.157
aces-0.149
 names-0.144
lets-0.140
 shirts-0.138
Token k
Token ID479
Feature activation+2.71e+00

Pos logit contributions
 beast+0.033
 crow+0.031
 hospital+0.030
 ig+0.030
 desert+0.029
Neg logit contributions
 names-0.037
aces-0.036
packs-0.036
suits-0.035
lets-0.035
Token!
Token ID0
Feature activation+1.35e+00

Pos logit contributions
packs+0.078
 names+0.067
suits+0.063
 shirts+0.063
 journals+0.060
Neg logit contributions
 crow-0.101
 beast-0.100
 eagle-0.096
 hospital-0.095
 unfortunate-0.094
Token They
Token ID1119
Feature activation-2.84e+00

Pos logit contributions
 hospital+0.059
 beast+0.058
 crow+0.056
 that+0.052
 ig+0.052
Neg logit contributions
packs-0.074
lets-0.072
aces-0.070
 interests-0.066
suits-0.066
Token had
Token ID550
Feature activation-2.75e+00

Pos logit contributions
packs+0.123
aces+0.121
lets+0.110
suits+0.108
 interests+0.107
Neg logit contributions
 beast-0.155
 crow-0.154
 hospital-0.153
 ig-0.147
 that-0.143
Token drums
Token ID24730
Feature activation-5.88e+00

Pos logit contributions
packs+0.143
aces+0.138
lets+0.134
suits+0.132
 names+0.127
Neg logit contributions
 beast-0.128
 crow-0.126
 hospital-0.124
 ig-0.117
 unfortunate-0.113
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
packs+0.303
aces+0.292
lets+0.281
 journals+0.278
suits+0.277
Neg logit contributions
 beast-0.268
 hospital-0.261
 crow-0.260
 that-0.247
 ig-0.246
Token guitars
Token ID32497
Feature activation-8.73e+00

Pos logit contributions
packs+0.040
aces+0.037
lets+0.035
suits+0.034
 journals+0.032
Neg logit contributions
 beast-0.065
 crow-0.065
 hospital-0.063
 ig-0.061
 that-0.060
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
packs+0.465
aces+0.443
 journals+0.432
suits+0.425
lets+0.424
Neg logit contributions
 beast-0.379
 hospital-0.374
 crow-0.372
 that-0.367
 ig-0.345
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
packs+0.040
aces+0.037
lets+0.035
suits+0.035
ver+0.032
Neg logit contributions
 beast-0.066
 crow-0.066
 hospital-0.065
 ig-0.061
 that-0.061
Token singer
Token ID14015
Feature activation+5.53e-01

Pos logit contributions
packs+0.052
suits+0.050
lets+0.050
 names+0.050
aces+0.049
Neg logit contributions
 beast-0.047
 crow-0.045
 hospital-0.044
 ig-0.043
 unfortunate-0.042
Token who
Token ID508
Feature activation+4.44e+00

Pos logit contributions
 beast+0.026
 crow+0.026
 hospital+0.025
 ig+0.024
 that+0.023
Neg logit contributions
packs-0.028
aces-0.027
suits-0.026
lets-0.026
 journals-0.025
Token sang
Token ID25889
Feature activation-4.62e+00

Pos logit contributions
 beast+0.243
 crow+0.240
 hospital+0.236
 that+0.224
 ig+0.221
Neg logit contributions
packs-0.189
aces-0.184
lets-0.171
 journals-0.168
suits-0.166
Token put
Token ID1234
Feature activation+2.90e+00

Pos logit contributions
packs+0.033
aces+0.032
lets+0.032
 names+0.031
suits+0.030
Neg logit contributions
 beast-0.034
 crow-0.032
 hospital-0.032
 unfortunate-0.031
 ig-0.030
Token his
Token ID465
Feature activation-8.88e-01

Pos logit contributions
 beast+0.145
 crow+0.141
 hospital+0.139
 ig+0.132
 unfortunate+0.128
Neg logit contributions
packs-0.143
aces-0.137
lets-0.133
suits-0.130
 names-0.127
Token hands
Token ID2832
Feature activation-5.15e+00

Pos logit contributions
packs+0.037
aces+0.035
lets+0.034
suits+0.033
 names+0.032
Neg logit contributions
 beast-0.052
 crow-0.050
 hospital-0.049
 ig-0.047
 desert-0.046
Token on
Token ID319
Feature activation-2.78e+00

Pos logit contributions
packs+0.266
 interests+0.265
 journals+0.259
aces+0.258
 names+0.258
Neg logit contributions
 that-0.224
 ig-0.199
 beast-0.197
 crow-0.190
 hospital-0.182
Token his
Token ID465
Feature activation-8.70e-01

Pos logit contributions
packs+0.122
aces+0.121
 names+0.120
lets+0.114
 shirts+0.114
Neg logit contributions
 beast-0.139
 crow-0.139
 hospital-0.136
 unfortunate-0.135
 desert-0.131
Token hips
Token ID21497
Feature activation-8.73e+00

Pos logit contributions
packs+0.035
aces+0.034
lets+0.032
suits+0.031
 names+0.031
Neg logit contributions
 beast-0.051
 crow-0.050
 hospital-0.049
 ig-0.047
 unfortunate-0.046
Token,
Token ID11
Feature activation-6.29e-01

Pos logit contributions
packs+0.523
aces+0.471
suits+0.467
lets+0.466
 journals+0.465
Neg logit contributions
 that-0.343
 beast-0.330
 hospital-0.318
 crow-0.311
 ig-0.310
Token and
Token ID290
Feature activation-9.87e-01

Pos logit contributions
packs+0.033
aces+0.031
suits+0.030
lets+0.030
 names+0.029
Neg logit contributions
 beast-0.030
 crow-0.030
 hospital-0.029
 ig-0.027
 eagle-0.026
Token roared
Token ID45615
Feature activation-1.47e+00

Pos logit contributions
packs+0.047
aces+0.046
lets+0.045
 names+0.043
suits+0.043
Neg logit contributions
 beast-0.050
 crow-0.048
 hospital-0.048
 ig-0.046
 unfortunate-0.045
Token like
Token ID588
Feature activation+5.78e-01

Pos logit contributions
packs+0.074
aces+0.071
lets+0.069
suits+0.067
 names+0.067
Neg logit contributions
 beast-0.071
 crow-0.070
 hospital-0.068
 ig-0.065
 desert-0.063
Token a
Token ID257
Feature activation-6.39e-01

Pos logit contributions
 beast+0.031
 hospital+0.030
 crow+0.030
 unfortunate+0.028
 ig+0.028
Neg logit contributions
aces-0.024
 names-0.024
packs-0.024
lets-0.022
 journals-0.022
Token.
Token ID13
Feature activation+4.65e-01

Pos logit contributions
packs+0.025
aces+0.024
lets+0.024
suits+0.023
 names+0.022
Neg logit contributions
 beast-0.027
 crow-0.026
 hospital-0.026
 ig-0.024
 unfortunate-0.024
Token It
Token ID632
Feature activation-6.04e-01

Pos logit contributions
 beast+0.026
 crow+0.025
 hospital+0.025
 ig+0.024
 desert+0.023
Neg logit contributions
packs-0.020
aces-0.019
lets-0.019
suits-0.018
 names-0.018
Token is
Token ID318
Feature activation-1.52e-01

Pos logit contributions
packs+0.023
aces+0.022
lets+0.021
suits+0.020
 names+0.020
Neg logit contributions
 beast-0.037
 crow-0.036
 hospital-0.036
 ig-0.034
 unfortunate-0.033
Token an
Token ID281
Feature activation+4.02e-01

Pos logit contributions
packs+0.007
 names+0.007
aces+0.007
lets+0.007
 pads+0.006
Neg logit contributions
 beast-0.008
 hospital-0.007
 crow-0.007
 cliff-0.007
 unfortunate-0.007
Token animal
Token ID5044
Feature activation+3.81e+00

Pos logit contributions
 ven+0.014
 swarm+0.014
pour+0.014
Far+0.014
 cliff+0.014
Neg logit contributions
aces-0.022
 names-0.021
 shirts-0.020
packs-0.020
lets-0.019
Token that
Token ID326
Feature activation+5.91e+00

Pos logit contributions
 beast+0.203
 crow+0.197
 unfortunate+0.194
 hospital+0.193
andel+0.186
Neg logit contributions
packs-0.165
aces-0.155
lets-0.152
 names-0.146
suits-0.145
Token lives
Token ID3160
Feature activation+2.91e+00

Pos logit contributions
 beast+0.329
 crow+0.322
 hospital+0.317
 ig+0.304
 unfortunate+0.297
Neg logit contributions
packs-0.253
aces-0.241
lets-0.233
suits-0.226
 names-0.224
Token in
Token ID287
Feature activation-1.63e+00

Pos logit contributions
 beast+0.134
 crow+0.131
 hospital+0.128
 ig+0.122
 desert+0.118
Neg logit contributions
packs-0.150
aces-0.147
lets-0.143
suits-0.138
 names-0.138
Token the
Token ID262
Feature activation+7.07e-01

Pos logit contributions
packs+0.079
 names+0.067
 journals+0.064
 shirts+0.062
aces+0.062
Neg logit contributions
 beast-0.084
 crow-0.083
 eagle-0.080
 hospital-0.080
 unfortunate-0.076
Token snow
Token ID6729
Feature activation+3.42e+00

Pos logit contributions
 crow+0.040
 beast+0.040
 unfortunate+0.039
 eagle+0.039
 frightening+0.038
Neg logit contributions
packs-0.027
 names-0.026
aces-0.025
 journals-0.025
 pads-0.024
Token.
Token ID13
Feature activation+4.98e-01

Pos logit contributions
 beast+0.178
 crow+0.174
 hospital+0.170
 ig+0.162
 unfortunate+0.159
Neg logit contributions
packs-0.161
aces-0.154
lets-0.149
suits-0.145
 names-0.142
Token then
Token ID788
Feature activation+2.14e+00

Pos logit contributions
packs+0.010
aces+0.010
lets+0.010
suits+0.010
 names+0.009
Neg logit contributions
 hospital-0.005
 beast-0.005
 crow-0.005
 ig-0.004
 that-0.004
Token Mom
Token ID11254
Feature activation-1.41e+00

Pos logit contributions
 beast+0.091
 crow+0.090
 hospital+0.089
 that+0.085
 ig+0.082
Neg logit contributions
aces-0.123
packs-0.118
lets-0.114
suits-0.109
 pads-0.107
Token said
Token ID531
Feature activation-1.91e-01

Pos logit contributions
packs+0.066
aces+0.063
lets+0.061
suits+0.059
 names+0.058
Neg logit contributions
 beast-0.075
 crow-0.073
 hospital-0.072
 ig-0.068
 desert-0.066
Token,
Token ID11
Feature activation-7.31e-01

Pos logit contributions
packs+0.010
aces+0.009
lets+0.009
suits+0.009
 names+0.009
Neg logit contributions
 crow-0.009
 beast-0.009
 hospital-0.009
 ig-0.008
 unfortunate-0.008
Token "
Token ID366
Feature activation-2.28e+00

Pos logit contributions
packs+0.034
aces+0.032
lets+0.031
suits+0.030
 names+0.029
Neg logit contributions
 beast-0.039
 crow-0.038
 hospital-0.038
 ig-0.036
 desert-0.035
TokenOh
Token ID5812
Feature activation+6.08e+00

Pos logit contributions
packs+0.127
aces+0.123
lets+0.119
suits+0.116
 names+0.114
Neg logit contributions
 beast-0.100
 crow-0.098
 hospital-0.096
 ig-0.089
 unfortunate-0.087
Token no
Token ID645
Feature activation+1.67e+00

Pos logit contributions
 beast+0.299
 hospital+0.293
 crow+0.292
 ig+0.277
 unfortunate+0.269
Neg logit contributions
packs-0.306
aces-0.300
lets-0.289
suits-0.282
 journals-0.273
Token,
Token ID11
Feature activation-7.06e-01

Pos logit contributions
 beast+0.086
 crow+0.084
 hospital+0.081
 unfortunate+0.076
 eagle+0.076
Neg logit contributions
packs-0.081
aces-0.075
lets-0.074
suits-0.073
 names-0.073
Token the
Token ID262
Feature activation+6.52e-01

Pos logit contributions
packs+0.042
aces+0.040
lets+0.039
suits+0.038
 names+0.037
Neg logit contributions
 beast-0.028
 crow-0.028
 hospital-0.028
 ig-0.026
 unfortunate-0.025
Token car
Token ID1097
Feature activation+1.60e+00

Pos logit contributions
 beast+0.029
 crow+0.028
 hospital+0.027
 ig+0.026
 unfortunate+0.026
Neg logit contributions
packs-0.035
aces-0.034
 names-0.033
lets-0.033
suits-0.032
Token is
Token ID318
Feature activation-1.51e-01

Pos logit contributions
 beast+0.097
 crow+0.095
 hospital+0.092
 ig+0.090
 desert+0.089
Neg logit contributions
packs-0.061
aces-0.058
lets-0.055
suits-0.054
 names-0.053
Token and
Token ID290
Feature activation-7.24e-01

Pos logit contributions
 beast+0.096
 crow+0.092
 hospital+0.085
 unfortunate+0.083
 eagle+0.081
Neg logit contributions
packs-0.099
aces-0.086
lets-0.086
suits-0.085
 names-0.084
Token comfortable
Token ID6792
Feature activation-3.24e-01

Pos logit contributions
packs+0.036
lets+0.033
aces+0.032
 mattered+0.032
 pads+0.032
Neg logit contributions
 beast-0.036
 crow-0.035
 hospital-0.032
 eagle-0.031
 hawk-0.031
Token home
Token ID1363
Feature activation+3.30e+00

Pos logit contributions
packs+0.017
aces+0.016
lets+0.015
suits+0.015
 names+0.015
Neg logit contributions
 beast-0.016
 crow-0.015
 hospital-0.014
 unfortunate-0.014
 eagle-0.014
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
 beast+0.178
 crow+0.174
 hospital+0.168
 ig+0.159
 desert+0.157
Neg logit contributions
packs-0.151
aces-0.144
lets-0.139
suits-0.135
 names-0.133
Token But
Token ID887
Feature activation+1.13e+00

Pos logit contributions
 beast+0.014
 crow+0.014
 hospital+0.014
 ig+0.013
 unfortunate+0.013
Neg logit contributions
packs-0.011
aces-0.011
lets-0.010
 names-0.010
suits-0.010
Token that
Token ID326
Feature activation+6.05e+00

Pos logit contributions
 crow+0.054
 beast+0.053
 hospital+0.050
andel+0.050
 desert+0.048
Neg logit contributions
packs-0.053
 mattered-0.049
ver-0.048
aces-0.048
lets-0.047
Token tiger
Token ID26241
Feature activation+1.52e+00

Pos logit contributions
 beast+0.309
 crow+0.300
 hospital+0.295
 ig+0.281
 unfortunate+0.273
Neg logit contributions
packs-0.294
aces-0.284
lets-0.276
suits-0.270
 names-0.262
Token always
Token ID1464
Feature activation+1.50e+00

Pos logit contributions
 beast+0.084
 crow+0.082
 hospital+0.080
 ig+0.078
 desert+0.077
Neg logit contributions
packs-0.062
aces-0.061
lets-0.059
suits-0.056
 names-0.055
Token smiled
Token ID13541
Feature activation-6.24e+00

Pos logit contributions
 beast+0.084
 crow+0.082
 hospital+0.081
 ig+0.077
 unfortunate+0.075
Neg logit contributions
packs-0.065
aces-0.061
lets-0.060
suits-0.058
 names-0.057
Token when
Token ID618
Feature activation+1.86e+00

Pos logit contributions
packs+0.276
aces+0.263
lets+0.257
 names+0.244
suits+0.241
Neg logit contributions
 beast-0.345
 crow-0.340
 hospital-0.331
 ig-0.313
 desert-0.308
Token it
Token ID340
Feature activation-4.97e-01

Pos logit contributions
 beast+0.081
 crow+0.081
 hospital+0.078
 ig+0.075
 unfortunate+0.073
Neg logit contributions
packs-0.096
aces-0.090
 names-0.088
 mattered-0.087
lets-0.086
Token It
Token ID632
Feature activation-8.13e-01

Pos logit contributions
 beast+0.007
 crow+0.007
 hospital+0.007
 ig+0.007
 unfortunate+0.007
Neg logit contributions
packs-0.005
aces-0.005
lets-0.005
suits-0.005
 names-0.005
Token was
Token ID373
Feature activation+7.26e-01

Pos logit contributions
packs+0.029
aces+0.028
lets+0.028
suits+0.027
 mattered+0.026
Neg logit contributions
 beast-0.049
 crow-0.048
 hospital-0.046
 ig-0.045
andel-0.044
Token going
Token ID1016
Feature activation+1.49e+00

Pos logit contributions
 beast+0.037
 crow+0.035
 hospital+0.034
 eagle+0.033
 cliff+0.032
Neg logit contributions
packs-0.035
aces-0.034
 names-0.033
 pads-0.031
lets-0.031
Token so
Token ID523
Feature activation+3.04e+00

Pos logit contributions
 beast+0.078
 crow+0.078
 hospital+0.074
 ig+0.072
 eagle+0.070
Neg logit contributions
packs-0.070
aces-0.066
suits-0.065
lets-0.065
 names-0.063
Token fast
Token ID3049
Feature activation+7.35e-01

Pos logit contributions
 beast+0.177
 crow+0.169
 eagle+0.161
 hospital+0.159
 ig+0.159
Neg logit contributions
packs-0.125
suits-0.124
 mattered-0.118
ver-0.115
 names-0.114
Token that
Token ID326
Feature activation+5.77e+00

Pos logit contributions
 crow+0.034
 beast+0.033
 ig+0.032
 hospital+0.029
 ven+0.029
Neg logit contributions
packs-0.038
aces-0.038
 pads-0.036
lets-0.035
suits-0.035
Token she
Token ID673
Feature activation+1.85e+00

Pos logit contributions
 beast+0.271
 crow+0.263
 hospital+0.254
 ig+0.244
 eagle+0.235
Neg logit contributions
packs-0.301
aces-0.287
lets-0.280
 names-0.272
suits-0.270
Token couldn
Token ID3521
Feature activation+3.53e+00

Pos logit contributions
 beast+0.111
 crow+0.108
 hospital+0.106
 ig+0.104
 desert+0.101
Neg logit contributions
packs-0.069
aces-0.068
lets-0.067
suits-0.065
 pads-0.061
Token't
Token ID470
Feature activation-1.52e+00

Pos logit contributions
 beast+0.200
 crow+0.199
 ig+0.191
 desert+0.188
 hospital+0.186
Neg logit contributions
aces-0.141
packs-0.140
lets-0.125
 names-0.124
 mattered-0.122
Token keep
Token ID1394
Feature activation+3.06e+00

Pos logit contributions
aces+0.061
 hips+0.057
packs+0.057
 yes+0.057
 names+0.055
Neg logit contributions
 beast-0.081
 crow-0.081
 desert-0.079
 ig-0.079
 hospital-0.075
Token up
Token ID510
Feature activation-1.96e+00

Pos logit contributions
 beast+0.142
 crow+0.138
 hospital+0.135
 ig+0.128
 unfortunate+0.124
Neg logit contributions
packs-0.162
aces-0.156
lets-0.152
suits-0.149
 names-0.146
Token,
Token ID11
Feature activation-6.02e-01

Pos logit contributions
 beast+0.032
 hospital+0.032
 crow+0.032
 ig+0.029
 unfortunate+0.028
Neg logit contributions
packs-0.048
aces-0.047
lets-0.045
suits-0.045
 names-0.044
Token they
Token ID484
Feature activation-3.40e+00

Pos logit contributions
aces+0.031
packs+0.030
suits+0.028
lets+0.028
 names+0.027
Neg logit contributions
 hospital-0.029
 crow-0.028
 beast-0.027
 that-0.027
 ig-0.027
Token did
Token ID750
Feature activation-2.55e+00

Pos logit contributions
packs+0.154
aces+0.147
lets+0.143
suits+0.139
 names+0.137
Neg logit contributions
 beast-0.183
 crow-0.179
 hospital-0.176
 ig-0.168
 unfortunate-0.164
Token not
Token ID407
Feature activation-7.38e-01

Pos logit contributions
packs+0.142
aces+0.140
lets+0.136
suits+0.132
 journals+0.129
Neg logit contributions
 beast-0.111
 hospital-0.109
 crow-0.108
 ig-0.099
 unfortunate-0.098
Token know
Token ID760
Feature activation+1.99e+00

Pos logit contributions
packs+0.034
 names+0.031
aces+0.031
lets+0.031
suits+0.031
Neg logit contributions
 beast-0.039
 crow-0.037
 hospital-0.036
 ig-0.036
 unfortunate-0.035
Token that
Token ID326
Feature activation+5.84e+00

Pos logit contributions
 beast+0.074
 crow+0.070
 hospital+0.068
 ig+0.062
 eagle+0.061
Neg logit contributions
packs-0.125
aces-0.119
lets-0.116
 names-0.114
suits-0.113
Token there
Token ID612
Feature activation-5.14e+00

Pos logit contributions
 beast+0.268
 crow+0.261
 hospital+0.257
 ig+0.242
 desert+0.235
Neg logit contributions
packs-0.311
aces-0.302
lets-0.294
suits-0.287
 names-0.281
Token were
Token ID547
Feature activation-1.63e+00

Pos logit contributions
packs+0.251
aces+0.250
lets+0.244
suits+0.238
 journals+0.229
Neg logit contributions
 beast-0.253
 hospital-0.251
 crow-0.248
 ig-0.236
 unfortunate-0.228
Token big
Token ID1263
Feature activation+3.40e+00

Pos logit contributions
packs+0.079
aces+0.075
 names+0.073
lets+0.072
suits+0.071
Neg logit contributions
 beast-0.082
 crow-0.079
 hospital-0.077
 ig-0.073
 eagle-0.072
Token,
Token ID11
Feature activation-5.85e-01

Pos logit contributions
 beast+0.162
 crow+0.159
 hospital+0.155
 unfortunate+0.146
 ig+0.146
Neg logit contributions
packs-0.170
aces-0.162
suits-0.156
lets-0.156
 names-0.155
Token mean
Token ID1612
Feature activation+4.71e-01

Pos logit contributions
packs+0.034
aces+0.032
lets+0.031
suits+0.031
 names+0.031
Neg logit contributions
 beast-0.024
 crow-0.023
 hospital-0.023
 ig-0.021
 unfortunate-0.021
Token the
Token ID262
Feature activation+9.08e-01

Pos logit contributions
 beast+0.031
 crow+0.028
 eagle+0.028
ule+0.028
 unfortunate+0.028
Neg logit contributions
packs-0.021
 names-0.020
lets-0.017
 chooses-0.017
 shared-0.017
Token chim
Token ID18205
Feature activation+4.72e+00

Pos logit contributions
 beast+0.039
rim+0.036
 ogre+0.035
aster+0.035
ister+0.035
Neg logit contributions
 dolls-0.042
 wrists-0.042
 names-0.042
 toys-0.041
 baskets-0.041
Tokenney
Token ID1681
Feature activation+4.10e+00

Pos logit contributions
 beast+0.260
 crow+0.247
 unfortunate+0.233
 hospital+0.231
 evil+0.230
Neg logit contributions
packs-0.233
lets-0.208
aces-0.207
suits-0.202
 pads-0.182
Token and
Token ID290
Feature activation-7.42e-01

Pos logit contributions
 beast+0.226
 crow+0.220
 hospital+0.214
 ig+0.203
 unfortunate+0.202
Neg logit contributions
packs-0.183
aces-0.174
lets-0.169
suits-0.164
 names-0.158
Token rescued
Token ID19868
Feature activation+1.90e+00

Pos logit contributions
packs+0.027
lets+0.027
suits+0.025
aces+0.025
 shared+0.024
Neg logit contributions
 beast-0.045
 crow-0.043
 hospital-0.042
 ig-0.041
 unfortunate-0.041
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 beast+0.090
 crow+0.082
 hospital+0.079
 ig+0.076
 desert+0.075
Neg logit contributions
 names-0.098
packs-0.096
lets-0.093
aces-0.090
 mattered-0.086
Token k
Token ID479
Feature activation+2.78e+00

Pos logit contributions
 beast+0.047
 crow+0.043
 hospital+0.043
aster+0.042
 desert+0.041
Neg logit contributions
 names-0.048
lets-0.047
packs-0.046
aces-0.046
 dolls-0.044
Tokenitty
Token ID9760
Feature activation+3.00e+00

Pos logit contributions
 beast+0.145
 hospital+0.140
 crow+0.136
 unfortunate+0.131
 ig+0.130
Neg logit contributions
packs-0.134
lets-0.129
aces-0.124
suits-0.119
 pads-0.111
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 beast+0.175
 crow+0.171
 hospital+0.165
 ig+0.159
 unfortunate+0.158
Neg logit contributions
packs-0.123
lets-0.116
aces-0.115
suits-0.107
 names-0.107
Token The
Token ID383
Feature activation+6.18e-01

Pos logit contributions
 beast+0.013
 crow+0.012
 hospital+0.012
 ig+0.011
 unfortunate+0.011
Neg logit contributions
packs-0.009
aces-0.009
lets-0.009
 names-0.008
suits-0.008
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
 beast+0.028
 crow+0.025
 desert+0.025
 hospital+0.025
 ogre+0.025
Neg logit contributions
 names-0.030
lets-0.029
aces-0.029
packs-0.029
 dolls-0.027
Token can
Token ID460
Feature activation-1.52e+00

Pos logit contributions
packs+0.007
suits+0.006
aces+0.006
lets+0.006
 shirts+0.006
Neg logit contributions
 beast-0.010
 hospital-0.009
 crow-0.009
 that-0.009
 eagle-0.009
Token do
Token ID466
Feature activation+2.90e+00

Pos logit contributions
aces+0.067
lets+0.065
packs+0.065
 names+0.064
 chooses+0.059
Neg logit contributions
 beast-0.080
 crow-0.077
 desert-0.075
 ig-0.074
 hospital-0.074
Token it
Token ID340
Feature activation-5.71e-01

Pos logit contributions
 beast+0.141
 crow+0.140
 hospital+0.133
 ig+0.131
 desert+0.130
Neg logit contributions
aces-0.137
packs-0.136
lets-0.134
 names-0.131
 mattered-0.125
Token if
Token ID611
Feature activation+7.47e-01

Pos logit contributions
packs+0.031
suits+0.028
aces+0.028
lets+0.028
 journals+0.027
Neg logit contributions
 beast-0.026
 hospital-0.025
 crow-0.025
 ig-0.024
 unfortunate-0.023
Token you
Token ID345
Feature activation+5.28e-01

Pos logit contributions
 beast+0.029
 crow+0.028
 hospital+0.028
 ig+0.026
 desert+0.025
Neg logit contributions
packs-0.044
aces-0.043
lets-0.042
suits-0.041
 names-0.041
Token keep
Token ID1394
Feature activation+2.99e+00

Pos logit contributions
 beast+0.027
 crow+0.026
 hospital+0.026
 ig+0.024
 unfortunate+0.024
Neg logit contributions
packs-0.026
aces-0.024
lets-0.023
suits-0.023
 names-0.022
Token trying
Token ID2111
Feature activation+1.97e-01

Pos logit contributions
 beast+0.133
 crow+0.130
 hospital+0.127
 ig+0.119
 unfortunate+0.116
Neg logit contributions
packs-0.164
aces-0.158
lets-0.154
suits-0.151
 names-0.148
Token."
Token ID526
Feature activation-2.18e+00

Pos logit contributions
 beast+0.009
 hospital+0.008
 crow+0.008
 that+0.008
 ig+0.008
Neg logit contributions
packs-0.010
aces-0.010
lets-0.010
suits-0.010
 journals-0.009
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
packs+0.098
aces+0.093
lets+0.091
suits+0.088
 names+0.087
Neg logit contributions
 beast-0.118
 crow-0.116
 hospital-0.112
 ig-0.109
 unfortunate-0.106
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
 beast+0.024
 crow+0.023
 ig+0.023
 hospital+0.023
 desert+0.023
Neg logit contributions
packs-0.011
suits-0.010
aces-0.010
 names-0.010
ver-0.010
TokenThe
Token ID464
Feature activation+6.99e-01

Pos logit contributions
 ig+0.020
 beast+0.020
 crow+0.020
 hospital+0.019
 desert+0.019
Neg logit contributions
lets-0.013
packs-0.013
 interests-0.012
aces-0.012
suits-0.012
Token happy
Token ID3772
Feature activation-2.21e+00

Pos logit contributions
 beast+0.020
 crow+0.019
 hospital+0.018
 ig+0.017
 eagle+0.017
Neg logit contributions
packs-0.021
aces-0.021
 names-0.020
lets-0.020
suits-0.019
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
packs+0.100
lets+0.096
aces+0.095
 names+0.091
 mattered+0.089
Neg logit contributions
 beast-0.116
 crow-0.112
 hospital-0.111
 ig-0.107
 unfortunate-0.105
Token together
Token ID1978
Feature activation-8.01e+00

Pos logit contributions
packs+0.054
aces+0.051
lets+0.050
suits+0.049
 names+0.048
Neg logit contributions
 beast-0.063
 crow-0.061
 hospital-0.060
 ig-0.057
 unfortunate-0.055
Token they
Token ID484
Feature activation-3.48e+00

Pos logit contributions
packs+0.354
aces+0.339
lets+0.324
suits+0.321
 names+0.309
Neg logit contributions
 beast-0.442
 crow-0.436
 hospital-0.427
 ig-0.408
 desert-0.400
Token collected
Token ID7723
Feature activation-5.47e-01

Pos logit contributions
packs+0.152
aces+0.147
lets+0.134
suits+0.134
 names+0.132
Neg logit contributions
 beast-0.187
 crow-0.183
 hospital-0.178
 ig-0.178
 desert-0.174
Token the
Token ID262
Feature activation+7.31e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 names+0.023
Neg logit contributions
 beast-0.027
 crow-0.027
 hospital-0.026
 ig-0.025
 desert-0.025
Token blue
Token ID4171
Feature activation-5.60e+00

Pos logit contributions
 beast+0.036
 crow+0.035
 hospital+0.034
 ig+0.033
 desert+0.032
Neg logit contributions
packs-0.037
aces-0.035
lets-0.034
suits-0.033
 names-0.032
Token shells
Token ID19679
Feature activation-3.19e+00

Pos logit contributions
packs+0.319
aces+0.309
lets+0.300
suits+0.293
 journals+0.282
Neg logit contributions
 beast-0.233
 crow-0.229
 hospital-0.216
 ig-0.213
 desert-0.212
Token which
Token ID543
Feature activation+3.04e+00

Pos logit contributions
packs+0.156
aces+0.152
 names+0.148
lets+0.147
suits+0.146
Neg logit contributions
 beast-0.145
 crow-0.144
 that-0.140
 hospital-0.139
 ig-0.138
Token the
Token ID262
Feature activation+6.97e-01

Pos logit contributions
 crow+0.177
 beast+0.176
 hospital+0.172
 desert+0.165
 ig+0.165
Neg logit contributions
packs-0.123
aces-0.119
lets-0.115
suits-0.110
 names-0.106
Token her
Token ID607
Feature activation-8.06e-01

Pos logit contributions
 beast+0.030
 crow+0.029
 hospital+0.029
 ig+0.027
 unfortunate+0.026
Neg logit contributions
packs-0.039
aces-0.038
lets-0.037
 names-0.036
suits-0.036
Token Bounce
Token ID41474
Feature activation-2.09e+00

Pos logit contributions
aces+0.026
lets+0.024
packs+0.024
 names+0.024
 shirts+0.023
Neg logit contributions
 beast-0.038
 ig-0.037
 crow-0.037
 desert-0.035
 hospital-0.034
Token cried
Token ID16896
Feature activation+1.23e+00

Pos logit contributions
aces+0.084
lets+0.083
packs+0.081
suits+0.076
 pads+0.074
Neg logit contributions
 beast-0.121
 crow-0.116
 ig-0.116
 hospital-0.115
 desert-0.113
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
 beast+0.070
 crow+0.069
 hospital+0.066
 desert+0.066
 ig+0.065
Neg logit contributions
aces-0.050
packs-0.049
lets-0.047
suits-0.045
 names-0.045
Token Yoga
Token ID32856
Feature activation-2.08e+00

Pos logit contributions
 beast+0.020
 crow+0.020
 hospital+0.019
 ig+0.018
 desert+0.018
Neg logit contributions
packs-0.015
aces-0.015
lets-0.014
suits-0.014
 names-0.014
Token felt
Token ID2936
Feature activation+3.20e+00

Pos logit contributions
packs+0.101
aces+0.095
lets+0.093
suits+0.090
 names+0.089
Neg logit contributions
 beast-0.106
 crow-0.103
 hospital-0.102
 ig-0.095
 unfortunate-0.094
Token sad
Token ID6507
Feature activation+2.56e+00

Pos logit contributions
 beast+0.159
 crow+0.152
 ig+0.151
 desert+0.147
 cliff+0.142
Neg logit contributions
 mattered-0.153
aces-0.141
 swapped-0.139
 shared-0.138
 matched-0.137
Token because
Token ID780
Feature activation+2.94e+00

Pos logit contributions
 crow+0.163
 beast+0.158
 desert+0.150
 ig+0.149
 cliff+0.147
Neg logit contributions
aces-0.090
packs-0.082
lets-0.080
 names-0.077
 mattered-0.075
Token Bounce
Token ID41474
Feature activation-2.12e+00

Pos logit contributions
 crow+0.145
 beast+0.144
 hospital+0.134
 ig+0.132
 desert+0.130
Neg logit contributions
packs-0.137
aces-0.134
 mattered-0.131
lets-0.125
 shirts-0.122
Token got
Token ID1392
Feature activation+2.41e+00

Pos logit contributions
packs+0.104
aces+0.094
lets+0.092
suits+0.091
 journals+0.090
Neg logit contributions
 beast-0.108
 hospital-0.105
 crow-0.105
 unfortunate-0.099
 that-0.097
Token hurt
Token ID5938
Feature activation+1.82e+00

Pos logit contributions
 beast+0.093
 crow+0.089
 hospital+0.086
 ig+0.082
 desert+0.080
Neg logit contributions
packs-0.146
aces-0.142
lets-0.138
 names-0.135
suits-0.134
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
 beast+0.095
 crow+0.094
 hospital+0.091
 ig+0.087
 desert+0.086
Neg logit contributions
packs-0.086
aces-0.083
lets-0.080
suits-0.077
 names-0.076
Token clock
Token ID8801
Feature activation+4.16e-01

Pos logit contributions
 beast+0.058
 crow+0.057
 hospital+0.055
 ig+0.052
 desert+0.051
Neg logit contributions
packs-0.060
aces-0.057
lets-0.056
 names-0.055
suits-0.055
Token!"
Token ID2474
Feature activation-1.12e-01

Pos logit contributions
 beast+0.021
 crow+0.021
 hospital+0.020
 ig+0.019
 desert+0.019
Neg logit contributions
packs-0.020
aces-0.019
lets-0.019
suits-0.018
 names-0.017
Token Lily
Token ID20037
Feature activation-5.78e-01

Pos logit contributions
packs+0.004
aces+0.003
lets+0.003
 names+0.003
 shirts+0.003
Neg logit contributions
 beast-0.008
 crow-0.007
 ig-0.007
 eagle-0.007
 cliff-0.007
Token said
Token ID531
Feature activation+5.54e-01

Pos logit contributions
packs+0.026
aces+0.025
lets+0.025
suits+0.024
 names+0.023
Neg logit contributions
 beast-0.031
 crow-0.030
 hospital-0.030
 ig-0.028
 unfortunate-0.028
Token,
Token ID11
Feature activation-3.40e-01

Pos logit contributions
 hospital+0.023
 beast+0.022
 crow+0.022
 that+0.022
 ig+0.020
Neg logit contributions
packs-0.031
aces-0.030
lets-0.029
suits-0.029
 names-0.029
Token as
Token ID355
Feature activation+1.15e+00

Pos logit contributions
packs+0.015
aces+0.014
lets+0.014
suits+0.013
 names+0.013
Neg logit contributions
 beast-0.018
 crow-0.018
 hospital-0.017
 ig-0.017
 unfortunate-0.017
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
 beast+0.051
 crow+0.049
 hospital+0.048
 ig+0.046
 desert+0.044
Neg logit contributions
packs-0.063
aces-0.060
lets-0.059
suits-0.057
 names-0.057
Token clock
Token ID8801
Feature activation+3.89e-01

Pos logit contributions
 beast+0.054
 crow+0.052
 hospital+0.052
 ig+0.048
 unfortunate+0.047
Neg logit contributions
packs-0.066
aces-0.063
lets-0.062
suits-0.061
 names-0.059
Token started
Token ID2067
Feature activation-1.34e+00

Pos logit contributions
 beast+0.021
 crow+0.020
 hospital+0.020
 ig+0.019
 desert+0.019
Neg logit contributions
packs-0.017
aces-0.017
lets-0.016
suits-0.016
 names-0.015
Token to
Token ID284
Feature activation+6.51e-01

Pos logit contributions
packs+0.066
aces+0.062
lets+0.061
 names+0.058
suits+0.058
Neg logit contributions
 beast-0.068
 crow-0.066
 hospital-0.064
 ig-0.060
 unfortunate-0.060
Token tick
Token ID4378
Feature activation+6.09e-01

Pos logit contributions
 beast+0.033
 crow+0.033
 hospital+0.031
 desert+0.030
 unfortunate+0.030
Neg logit contributions
packs-0.031
aces-0.029
lets-0.028
 names-0.028
suits-0.027
Token Ann
Token ID5506
Feature activation-4.87e-01

Pos logit contributions
packs+0.043
aces+0.041
lets+0.040
suits+0.039
 names+0.038
Neg logit contributions
 beast-0.054
 crow-0.053
 hospital-0.052
 ig-0.050
 desert-0.049
Token was
Token ID373
Feature activation+9.99e-02

Pos logit contributions
packs+0.020
aces+0.019
lets+0.019
suits+0.018
 names+0.018
Neg logit contributions
 beast-0.028
 crow-0.027
 hospital-0.027
 ig-0.026
 desert-0.025
Token also
Token ID635
Feature activation-2.24e+00

Pos logit contributions
 beast+0.005
 crow+0.005
 hospital+0.004
 ig+0.004
 cliff+0.004
Neg logit contributions
aces-0.005
packs-0.005
 names-0.005
lets-0.005
 mattered-0.004
Token busy
Token ID8179
Feature activation+1.05e+00

Pos logit contributions
packs+0.118
aces+0.115
lets+0.111
suits+0.108
 names+0.108
Neg logit contributions
 beast-0.104
 crow-0.099
 hospital-0.098
 ig-0.094
 desert-0.090
Token with
Token ID351
Feature activation-6.79e-01

Pos logit contributions
 beast+0.054
 crow+0.053
 hospital+0.051
 ig+0.049
 unfortunate+0.048
Neg logit contributions
packs-0.051
aces-0.048
lets-0.047
suits-0.045
 names-0.045
Token her
Token ID607
Feature activation-5.34e-01

Pos logit contributions
packs+0.031
aces+0.030
lets+0.028
 names+0.028
suits+0.027
Neg logit contributions
 beast-0.036
 crow-0.036
 hospital-0.035
 ig-0.033
 desert-0.032
Token operation
Token ID4905
Feature activation+3.84e+00

Pos logit contributions
packs+0.024
aces+0.023
lets+0.022
suits+0.021
 names+0.021
Neg logit contributions
 beast-0.029
 crow-0.028
 hospital-0.028
 ig-0.027
 unfortunate-0.026
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
 beast+0.196
 crow+0.191
 hospital+0.187
 ig+0.179
 desert+0.174
Neg logit contributions
packs-0.183
aces-0.176
lets-0.170
suits-0.166
 names-0.163
Token carrying
Token ID6872
Feature activation+2.94e+00

Pos logit contributions
packs+0.027
aces+0.025
lets+0.025
 names+0.024
 shirts+0.023
Neg logit contributions
 beast-0.030
 crow-0.029
 hospital-0.028
 ig-0.027
 eagle-0.027
Token food
Token ID2057
Feature activation-5.66e-01

Pos logit contributions
 beast+0.155
 crow+0.152
 hospital+0.149
 ig+0.142
 desert+0.138
Neg logit contributions
packs-0.136
aces-0.130
lets-0.125
suits-0.122
 names-0.121
Token back
Token ID736
Feature activation+1.63e+00

Pos logit contributions
packs+0.027
aces+0.026
lets+0.025
suits+0.025
 names+0.024
Neg logit contributions
 beast-0.029
 crow-0.028
 hospital-0.028
 ig-0.026
 unfortunate-0.026
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
 beast+0.043
 hospital+0.042
 crow+0.042
 ig+0.040
 desert+0.039
Neg logit contributions
packs-0.048
aces-0.046
lets-0.045
suits-0.043
 names-0.043
Token all
Token ID477
Feature activation-2.08e+00

Pos logit contributions
packs+0.055
aces+0.054
suits+0.051
lets+0.050
 names+0.047
Neg logit contributions
 crow-0.059
 hospital-0.058
 beast-0.058
 ig-0.056
 desert-0.056
Token the
Token ID262
Feature activation+4.35e-01

Pos logit contributions
packs+0.123
aces+0.122
lets+0.120
suits+0.119
 names+0.113
Neg logit contributions
 hospital-0.078
 beast-0.078
 crow-0.077
 ig-0.075
 desert-0.073
Token magic
Token ID5536
Feature activation-5.36e-01

Pos logit contributions
 desert+0.018
 crow+0.017
 hospital+0.017
 beast+0.017
 ig+0.017
Neg logit contributions
packs-0.026
lets-0.025
aces-0.025
ver-0.024
suits-0.023
Token that
Token ID326
Feature activation+5.50e+00

Pos logit contributions
packs+0.025
aces+0.024
lets+0.023
suits+0.022
 names+0.022
Neg logit contributions
 beast-0.028
 crow-0.028
 hospital-0.027
 ig-0.026
 unfortunate-0.025
Token came
Token ID1625
Feature activation-1.03e-01

Pos logit contributions
 hospital+0.216
 crow+0.214
 beast+0.213
 ig+0.200
 desert+0.200
Neg logit contributions
packs-0.324
aces-0.316
lets-0.313
suits-0.312
 shirts-0.293
Token with
Token ID351
Feature activation-8.64e-01

Pos logit contributions
packs+0.005
aces+0.005
lets+0.005
 names+0.005
suits+0.004
Neg logit contributions
 beast-0.005
 crow-0.005
 hospital-0.005
 ig-0.005
 unfortunate-0.005
Token it
Token ID340
Feature activation-9.74e-01

Pos logit contributions
packs+0.040
aces+0.039
lets+0.038
suits+0.037
 names+0.035
Neg logit contributions
 beast-0.044
 crow-0.044
 hospital-0.043
 ig-0.041
 desert-0.040
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
packs+0.050
aces+0.049
lets+0.048
suits+0.048
 names+0.045
Neg logit contributions
 hospital-0.044
 beast-0.043
 crow-0.043
 ig-0.041
 that-0.040
Token One
Token ID1881
Feature activation-2.64e+00

Pos logit contributions
packs+0.009
aces+0.009
lets+0.009
suits+0.009
 journals+0.008
Neg logit contributions
 hospital-0.012
 crow-0.011
 beast-0.011
 ig-0.011
 desert-0.011
Token day
Token ID1110
Feature activation+1.48e+00

Pos logit contributions
packs+0.138
aces+0.131
lets+0.128
suits+0.126
 names+0.124
Neg logit contributions
 beast-0.124
 crow-0.120
 hospital-0.117
 ig-0.110
 unfortunate-0.108
Token play
Token ID711
Feature activation-4.77e-01

Pos logit contributions
 hospital+0.023
 crow+0.023
 beast+0.022
 ig+0.022
 that+0.021
Neg logit contributions
packs-0.024
aces-0.024
suits-0.023
lets-0.023
 pads-0.022
Token with
Token ID351
Feature activation-7.00e-01

Pos logit contributions
packs+0.025
lets+0.024
aces+0.023
suits+0.022
 pads+0.022
Neg logit contributions
 hospital-0.023
 beast-0.022
 that-0.021
 crow-0.021
 ig-0.021
Token their
Token ID511
Feature activation-3.88e+00

Pos logit contributions
aces+0.024
packs+0.024
lets+0.023
suits+0.023
 pads+0.021
Neg logit contributions
 crow-0.044
 beast-0.043
 hospital-0.043
 ig-0.042
 cob-0.040
Token toy
Token ID13373
Feature activation+2.86e-01

Pos logit contributions
aces+0.133
packs+0.131
lets+0.126
suits+0.115
ver+0.111
Neg logit contributions
 beast-0.249
 hospital-0.247
 crow-0.245
 ig-0.242
 unfortunate-0.229
Token boats
Token ID16659
Feature activation-3.89e+00

Pos logit contributions
 beast+0.018
 crow+0.018
 hospital+0.017
 unfortunate+0.017
 ig+0.016
Neg logit contributions
packs-0.010
aces-0.010
lets-0.009
suits-0.009
 names-0.009
Token in
Token ID287
Feature activation-1.65e+00

Pos logit contributions
packs+0.207
aces+0.203
lets+0.200
suits+0.196
 names+0.193
Neg logit contributions
 hospital-0.164
 beast-0.160
 crow-0.160
 ig-0.154
 that-0.153
Token the
Token ID262
Feature activation+7.34e-01

Pos logit contributions
packs+0.092
aces+0.091
lets+0.089
suits+0.086
 names+0.084
Neg logit contributions
 beast-0.070
 crow-0.068
 hospital-0.067
 ig-0.064
 desert-0.061
Token pool
Token ID5933
Feature activation-9.59e-01

Pos logit contributions
 beast+0.035
 crow+0.034
 eagle+0.031
 unfortunate+0.030
 hospital+0.030
Neg logit contributions
packs-0.039
 names-0.036
suits-0.036
 shirts-0.036
 journals-0.035
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
packs+0.053
aces+0.053
lets+0.053
 mattered+0.049
 journals+0.049
Neg logit contributions
 hospital-0.039
 beast-0.037
 crow-0.037
 ig-0.037
 that-0.036
Token They
Token ID1119
Feature activation-2.94e+00

Pos logit contributions
aces+0.018
lets+0.018
 mattered+0.017
 journals+0.016
es+0.016
Neg logit contributions
 hospital-0.021
 ig-0.020
 crow-0.020
 that-0.020
 desert-0.020
Token had
Token ID550
Feature activation-2.58e+00

Pos logit contributions
packs+0.138
aces+0.137
lets+0.125
 journals+0.125
 hips+0.119
Neg logit contributions
 ig-0.146
 crow-0.144
 hospital-0.138
 that-0.138
 ostr-0.136
Token smiled
Token ID13541
Feature activation-6.63e+00

Pos logit contributions
 beast+0.104
 crow+0.103
 hospital+0.101
 ig+0.097
 unfortunate+0.094
Neg logit contributions
packs-0.084
aces-0.080
lets-0.076
suits-0.075
 names-0.073
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
packs+0.299
aces+0.285
lets+0.280
 names+0.266
suits+0.264
Neg logit contributions
 beast-0.364
 crow-0.354
 hospital-0.349
 ig-0.329
 unfortunate-0.325
Token showed
Token ID3751
Feature activation-4.49e+00

Pos logit contributions
packs+0.066
aces+0.064
suits+0.059
lets+0.059
 names+0.057
Neg logit contributions
 crow-0.065
 beast-0.064
 hospital-0.063
 ig-0.060
 that-0.059
Token it
Token ID340
Feature activation-8.89e-01

Pos logit contributions
packs+0.222
aces+0.218
lets+0.217
 names+0.212
 shirts+0.198
Neg logit contributions
 beast-0.221
 crow-0.214
 hospital-0.210
 ig-0.202
 desert-0.198
Token to
Token ID284
Feature activation+3.69e-01

Pos logit contributions
packs+0.048
aces+0.046
suits+0.045
lets+0.044
 journals+0.042
Neg logit contributions
 beast-0.039
 hospital-0.038
 crow-0.038
 ig-0.036
 that-0.035
Token Pam
Token ID24497
Feature activation-7.97e-02

Pos logit contributions
 beast+0.023
 crow+0.022
 hospital+0.021
 ig+0.021
 desert+0.021
Neg logit contributions
packs-0.014
 names-0.013
aces-0.013
lets-0.013
 shirts-0.012
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
packs+0.004
aces+0.004
lets+0.004
suits+0.003
 names+0.003
Neg logit contributions
 beast-0.004
 crow-0.004
 hospital-0.004
 ig-0.004
 unfortunate-0.004
Token But
Token ID887
Feature activation+4.95e-01

Pos logit contributions
packs+0.021
aces+0.020
lets+0.019
suits+0.019
 journals+0.018
Neg logit contributions
 beast-0.023
 crow-0.023
 hospital-0.023
 ig-0.021
 desert-0.021
Token Pam
Token ID24497
Feature activation-8.65e-02

Pos logit contributions
 beast+0.022
 crow+0.021
 hospital+0.021
 ig+0.020
 unfortunate+0.019
Neg logit contributions
packs-0.027
aces-0.026
lets-0.026
suits-0.025
 names-0.025
Token frowned
Token ID32198
Feature activation-3.01e+00

Pos logit contributions
packs+0.004
aces+0.004
lets+0.004
suits+0.004
 names+0.004
Neg logit contributions
 beast-0.005
 crow-0.005
 hospital-0.004
 ig-0.004
 unfortunate-0.004
Token when
Token ID618
Feature activation+1.40e+00

Pos logit contributions
packs+0.128
aces+0.124
lets+0.120
 names+0.114
suits+0.113
Neg logit contributions
 beast-0.169
 crow-0.165
 hospital-0.161
 ig-0.156
 desert-0.152
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
packs+0.122
aces+0.112
 names+0.111
suits+0.111
lets+0.110
Neg logit contributions
 beast-0.136
 crow-0.132
 hospital-0.130
 ig-0.120
 eagle-0.120
Token,
Token ID11
Feature activation-9.08e-01

Pos logit contributions
 that+0.053
 beast+0.042
 hospital+0.037
 Doctor+0.032
 crow+0.030
Neg logit contributions
aces-0.115
lets-0.115
 pads-0.111
packs-0.110
 wrists-0.107
Token James
Token ID3700
Feature activation-4.12e+00

Pos logit contributions
aces+0.055
lets+0.055
packs+0.055
 pads+0.052
suits+0.050
Neg logit contributions
 that-0.039
 beast-0.037
 hospital-0.036
 crow-0.032
 Doctor-0.031
Token wanted
Token ID2227
Feature activation-2.40e-02

Pos logit contributions
packs+0.178
aces+0.165
lets+0.163
suits+0.159
 names+0.155
Neg logit contributions
 beast-0.233
 crow-0.227
 hospital-0.223
 ig-0.212
 unfortunate-0.210
Token to
Token ID284
Feature activation+5.04e-01

Pos logit contributions
packs+0.001
aces+0.001
lets+0.001
 names+0.001
suits+0.001
Neg logit contributions
 beast-0.001
 crow-0.001
 hospital-0.001
 ig-0.001
 unfortunate-0.001
Token record
Token ID1700
Feature activation-2.14e+00

Pos logit contributions
 beast+0.028
 crow+0.027
 hospital+0.026
 unfortunate+0.025
 ig+0.025
Neg logit contributions
packs-0.022
aces-0.020
 names-0.020
lets-0.020
 shirts-0.019
Token something
Token ID1223
Feature activation+1.24e+00

Pos logit contributions
packs+0.113
aces+0.108
lets+0.107
suits+0.105
 names+0.101
Neg logit contributions
 beast-0.099
 crow-0.095
 hospital-0.095
 ig-0.090
 unfortunate-0.086
Token on
Token ID319
Feature activation-2.94e+00

Pos logit contributions
 beast+0.058
 crow+0.057
 hospital+0.054
 ig+0.051
 desert+0.051
Neg logit contributions
packs-0.064
aces-0.061
lets-0.060
 names-0.060
suits-0.058
Token his
Token ID465
Feature activation-1.08e+00

Pos logit contributions
packs+0.118
 shirts+0.107
 journals+0.107
 names+0.105
 magnets+0.099
Neg logit contributions
 crow-0.166
 beast-0.158
 ven-0.155
 unfortunate-0.152
 eagle-0.152
Token camera
Token ID4676
Feature activation-5.32e+00

Pos logit contributions
packs+0.049
aces+0.046
 names+0.045
 shirts+0.044
lets+0.044
Neg logit contributions
 crow-0.058
 beast-0.058
 hospital-0.054
 unfortunate-0.052
 ig-0.052
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
packs+0.265
aces+0.253
lets+0.247
suits+0.242
 names+0.236
Neg logit contributions
 beast-0.262
 crow-0.254
 hospital-0.252
 ig-0.238
 desert-0.230
Token said
Token ID531
Feature activation+2.69e-01

Pos logit contributions
 beast+0.138
 desert+0.133
 cliff+0.132
 crow+0.131
andel+0.128
Neg logit contributions
aces-0.075
lets-0.075
packs-0.074
 names-0.071
 shirts-0.069
Token he
Token ID339
Feature activation+2.16e+00

Pos logit contributions
 beast+0.011
 crow+0.011
 hospital+0.011
 ig+0.010
 unfortunate+0.009
Neg logit contributions
packs-0.016
aces-0.015
lets-0.015
suits-0.015
 names-0.014
Token would
Token ID561
Feature activation-1.91e+00

Pos logit contributions
 beast+0.117
 crow+0.115
 hospital+0.114
 ig+0.106
 unfortunate+0.105
Neg logit contributions
packs-0.100
aces-0.093
lets-0.089
suits-0.088
 journals-0.085
Token tell
Token ID1560
Feature activation-4.00e-01

Pos logit contributions
 names+0.072
packs+0.071
lets+0.068
aces+0.068
 interests+0.065
Neg logit contributions
 beast-0.117
 crow-0.108
 desert-0.107
andel-0.104
 ig-0.103
Token Tom
Token ID4186
Feature activation-3.44e-01

Pos logit contributions
packs+0.016
 names+0.015
aces+0.015
 shirts+0.014
lets+0.014
Neg logit contributions
 beast-0.023
 crow-0.022
 cliff-0.022
 desert-0.021
 eagle-0.021
Token's
Token ID338
Feature activation-6.36e+00

Pos logit contributions
packs+0.019
aces+0.018
lets+0.018
suits+0.018
 names+0.017
Neg logit contributions
 beast-0.015
 hospital-0.015
 crow-0.014
 that-0.013
 ig-0.013
Token mom
Token ID1995
Feature activation-7.05e-01

Pos logit contributions
packs+0.278
aces+0.265
 names+0.259
lets+0.257
suits+0.242
Neg logit contributions
 beast-0.351
 crow-0.341
 hospital-0.329
 desert-0.317
 ig-0.315
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
packs+0.037
aces+0.036
lets+0.035
suits+0.035
 journals+0.033
Neg logit contributions
 hospital-0.032
 beast-0.032
 crow-0.031
 that-0.029
 ig-0.029
Token Tom
Token ID4186
Feature activation-3.29e-01

Pos logit contributions
 beast+0.020
 crow+0.020
 hospital+0.019
 ig+0.019
 eagle+0.018
Neg logit contributions
packs-0.015
aces-0.014
suits-0.013
 names-0.013
lets-0.013
Token felt
Token ID2936
Feature activation+3.19e+00

Pos logit contributions
packs+0.014
aces+0.014
lets+0.013
suits+0.013
 names+0.012
Neg logit contributions
 beast-0.018
 crow-0.018
 hospital-0.018
 ig-0.017
 desert-0.017
Token ashamed
Token ID22461
Feature activation+2.17e+00

Pos logit contributions
 beast+0.156
 cliff+0.147
 ig+0.146
 crow+0.145
 desert+0.143
Neg logit contributions
aces-0.152
 names-0.143
 mattered-0.138
packs-0.138
 pads-0.131
Token They
Token ID1119
Feature activation-2.56e+00

Pos logit contributions
 beast+0.011
 crow+0.010
 hospital+0.010
 ig+0.010
 desert+0.009
Neg logit contributions
packs-0.009
aces-0.008
lets-0.008
suits-0.008
 names-0.008
Token shared
Token ID4888
Feature activation-5.17e+00

Pos logit contributions
packs+0.106
aces+0.103
suits+0.096
lets+0.092
 journals+0.091
Neg logit contributions
 crow-0.146
 beast-0.146
 hospital-0.141
 eagle-0.136
 ig-0.135
Token their
Token ID511
Feature activation-3.94e+00

Pos logit contributions
aces+0.251
packs+0.242
suits+0.238
lets+0.226
ver+0.220
Neg logit contributions
 crow-0.250
 that-0.248
 beast-0.247
 hospital-0.242
 ig-0.231
Token stories
Token ID3923
Feature activation-4.49e+00

Pos logit contributions
aces+0.198
ver+0.189
lets+0.188
packs+0.185
suits+0.171
Neg logit contributions
 beast-0.189
 crow-0.186
 ig-0.182
 desert-0.180
 hospital-0.179
Token and
Token ID290
Feature activation-8.82e-01

Pos logit contributions
packs+0.239
aces+0.225
lets+0.223
suits+0.220
 names+0.218
Neg logit contributions
 beast-0.194
 that-0.190
 crow-0.188
 hospital-0.188
 ig-0.173
Token laughed
Token ID13818
Feature activation-6.69e+00

Pos logit contributions
packs+0.036
aces+0.036
suits+0.034
lets+0.032
 hips+0.031
Neg logit contributions
 beast-0.051
 crow-0.050
 hospital-0.049
 ig-0.047
 desert-0.046
Token together
Token ID1978
Feature activation-7.91e+00

Pos logit contributions
packs+0.358
aces+0.336
suits+0.329
lets+0.321
 journals+0.306
Neg logit contributions
 beast-0.301
 that-0.293
 hospital-0.292
 crow-0.289
 ig-0.276
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
packs+0.418
aces+0.398
lets+0.388
suits+0.386
 names+0.369
Neg logit contributions
 beast-0.362
 crow-0.351
 hospital-0.349
 ig-0.327
 that-0.324
Token When
Token ID1649
Feature activation+1.68e+00

Pos logit contributions
 beast+0.016
 hospital+0.015
 crow+0.015
 ig+0.014
 desert+0.014
Neg logit contributions
packs-0.015
aces-0.014
lets-0.014
suits-0.014
 shirts-0.013
Token they
Token ID484
Feature activation-3.32e+00

Pos logit contributions
 beast+0.082
 crow+0.081
 hospital+0.077
 ig+0.075
 eagle+0.073
Neg logit contributions
packs-0.081
 names-0.074
aces-0.073
lets-0.073
 journals-0.071
Token arrived
Token ID5284
Feature activation+1.85e-01

Pos logit contributions
packs+0.125
aces+0.117
lets+0.114
suits+0.110
 names+0.108
Neg logit contributions
 beast-0.204
 crow-0.200
 hospital-0.196
 ig-0.189
 unfortunate-0.184
Token.
Token ID13
Feature activation+5.98e-01

Pos logit contributions
 beast+0.102
 crow+0.099
 hospital+0.097
 that+0.090
 ig+0.089
Neg logit contributions
packs-0.125
aces-0.120
suits-0.115
lets-0.113
 journals-0.111
Token Come
Token ID7911
Feature activation-1.05e+00

Pos logit contributions
 beast+0.026
 crow+0.025
 hospital+0.025
 that+0.023
 vet+0.023
Neg logit contributions
packs-0.033
suits-0.030
aces-0.030
 shirts-0.029
 journals-0.029
Token on
Token ID319
Feature activation-2.74e+00

Pos logit contributions
aces+0.049
packs+0.049
 journals+0.045
lets+0.044
 mattered+0.044
Neg logit contributions
 beast-0.053
 crow-0.052
 hospital-0.051
 that-0.049
 ig-0.049
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
packs+0.115
aces+0.108
lets+0.104
suits+0.103
 names+0.102
Neg logit contributions
 beast-0.156
 crow-0.154
 hospital-0.148
 ig-0.144
 desert-0.141
Token Let
Token ID3914
Feature activation-3.59e+00

Pos logit contributions
 beast+0.026
 hospital+0.025
 crow+0.025
 vet+0.023
 that+0.023
Neg logit contributions
packs-0.037
aces-0.034
suits-0.034
 journals-0.033
 shirts-0.032
Token's
Token ID338
Feature activation-6.31e+00

Pos logit contributions
packs+0.115
aces+0.108
lets+0.102
suits+0.098
 names+0.093
Neg logit contributions
 beast-0.242
 crow-0.239
 hospital-0.237
 ig-0.226
 unfortunate-0.223
Token have
Token ID423
Feature activation-1.85e+00

Pos logit contributions
packs+0.290
aces+0.277
lets+0.269
suits+0.262
 names+0.256
Neg logit contributions
 beast-0.336
 crow-0.328
 hospital-0.323
 ig-0.308
 unfortunate-0.299
Token fun
Token ID1257
Feature activation-2.62e+00

Pos logit contributions
packs+0.088
aces+0.088
lets+0.083
suits+0.082
ver+0.079
Neg logit contributions
 crow-0.092
 hospital-0.092
 beast-0.090
 that-0.089
 ig-0.088
Token."
Token ID526
Feature activation-1.94e+00

Pos logit contributions
packs+0.132
aces+0.130
lets+0.129
 journals+0.126
 mattered+0.124
Neg logit contributions
 that-0.113
 hospital-0.112
 ig-0.110
 beast-0.110
 crow-0.107
Token
Token ID198
Feature activation+4.64e-01

Pos logit contributions
packs+0.088
aces+0.083
lets+0.081
suits+0.079
 names+0.078
Neg logit contributions
 beast-0.109
 crow-0.105
 hospital-0.102
 unfortunate-0.098
 ig-0.097
Token
Token ID198
Feature activation+4.59e-01

Pos logit contributions
 beast+0.035
 crow+0.032
 hospital+0.031
 eagle+0.031
 unfortunate+0.030
Neg logit contributions
packs-0.015
suits-0.013
 names-0.012
lets-0.011
 shirts-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.82e+00

Pos logit contributions
 beast+0.017
 crow+0.016
 hospital+0.016
 ig+0.015
 unfortunate+0.015
Neg logit contributions
packs-0.014
aces-0.014
lets-0.013
suits-0.013
 names-0.013
TokenOnce
Token ID7454
Feature activation-5.26e-01

Pos logit contributions
 beast+0.063
 crow+0.058
 hospital+0.058
 desert+0.055
 ig+0.054
Neg logit contributions
lets-0.115
packs-0.113
aces-0.113
 journals-0.108
 shirts-0.107
Token upon
Token ID2402
Feature activation-1.37e+00

Pos logit contributions
packs+0.021
aces+0.020
lets+0.019
suits+0.019
 names+0.018
Neg logit contributions
 beast-0.031
 crow-0.030
 hospital-0.030
 ig-0.029
 unfortunate-0.028
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
packs+0.070
aces+0.068
lets+0.067
suits+0.065
 names+0.063
Neg logit contributions
 beast-0.065
 crow-0.064
 hospital-0.063
 ig-0.059
 unfortunate-0.057
Token time
Token ID640
Feature activation-2.18e+00

Pos logit contributions
packs+0.057
aces+0.055
lets+0.054
suits+0.051
 names+0.050
Neg logit contributions
 beast-0.057
 crow-0.056
 hospital-0.055
 ig-0.052
 unfortunate-0.051
Token there
Token ID612
Feature activation-5.49e+00

Pos logit contributions
packs+0.121
aces+0.120
lets+0.120
 journals+0.115
suits+0.113
Neg logit contributions
 beast-0.091
 crow-0.087
 ig-0.085
 hospital-0.084
 that-0.083
Token was
Token ID373
Feature activation+3.59e-01

Pos logit contributions
packs+0.256
aces+0.243
lets+0.235
suits+0.230
 names+0.228
Neg logit contributions
 beast-0.287
 crow-0.281
 hospital-0.275
 ig-0.259
 unfortunate-0.255
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
 beast+0.016
 crow+0.016
 hospital+0.016
 ig+0.015
 desert+0.015
Neg logit contributions
packs-0.019
lets-0.018
aces-0.018
suits-0.018
 names-0.017
Token lazy
Token ID16931
Feature activation+4.40e-01

Pos logit contributions
suits+0.054
 names+0.049
lets+0.049
packs+0.048
 shirts+0.047
Neg logit contributions
 beast-0.052
 hospital-0.049
 unfortunate-0.048
 depths-0.046
 crow-0.046
Token cat
Token ID3797
Feature activation+2.29e+00

Pos logit contributions
 unfortunate+0.020
 beast+0.020
 depths+0.018
 hospital+0.018
 crow+0.018
Neg logit contributions
suits-0.019
aces-0.019
 names-0.018
lets-0.018
 shirts-0.018
Token who
Token ID508
Feature activation+4.39e+00

Pos logit contributions
 beast+0.121
 unfortunate+0.116
 crow+0.111
andel+0.109
 hospital+0.107
Neg logit contributions
packs-0.104
lets-0.099
aces-0.098
 pads-0.090
suits-0.090
Tokenmy
Token ID1820
Feature activation-6.50e-02

Pos logit contributions
packs+0.002
aces+0.002
lets+0.002
suits+0.002
 names+0.002
Neg logit contributions
 beast-0.005
 crow-0.005
 hospital-0.005
 ig-0.004
 unfortunate-0.004
Token replied
Token ID8712
Feature activation-5.11e-01

Pos logit contributions
packs+0.003
aces+0.002
lets+0.002
suits+0.002
 names+0.002
Neg logit contributions
 beast-0.004
 crow-0.004
 hospital-0.004
 ig-0.003
 desert-0.003
Token,
Token ID11
Feature activation-5.74e-01

Pos logit contributions
packs+0.027
aces+0.025
lets+0.025
suits+0.024
 names+0.024
Neg logit contributions
 beast-0.024
 crow-0.023
 hospital-0.023
 ig-0.022
 that-0.022
Token "
Token ID366
Feature activation-2.37e+00

Pos logit contributions
packs+0.024
aces+0.023
lets+0.022
suits+0.021
 names+0.021
Neg logit contributions
 beast-0.033
 crow-0.032
 hospital-0.032
 ig-0.031
 unfortunate-0.030
TokenThat
Token ID2504
Feature activation+5.98e-01

Pos logit contributions
packs+0.120
aces+0.115
lets+0.114
suits+0.110
 names+0.109
Neg logit contributions
 beast-0.114
 crow-0.110
 hospital-0.109
 ig-0.103
 desert-0.099
Token's
Token ID338
Feature activation-6.40e+00

Pos logit contributions
 beast+0.039
 crow+0.038
 hospital+0.038
 ig+0.036
 unfortunate+0.035
Neg logit contributions
packs-0.021
aces-0.019
lets-0.019
suits-0.018
 names-0.017
Token okay
Token ID8788
Feature activation-3.29e+00

Pos logit contributions
 names+0.165
packs+0.164
aces+0.158
 mattered+0.154
 interests+0.145
Neg logit contributions
 beast-0.448
 crow-0.446
 eagle-0.419
 hospital-0.413
 desert-0.412
Token,
Token ID11
Feature activation-5.31e-01

Pos logit contributions
packs+0.156
aces+0.148
lets+0.145
suits+0.141
 names+0.139
Neg logit contributions
 beast-0.171
 crow-0.167
 hospital-0.164
 ig-0.156
 desert-0.152
Token Mom
Token ID11254
Feature activation-1.39e+00

Pos logit contributions
packs+0.023
aces+0.022
lets+0.022
 names+0.021
suits+0.020
Neg logit contributions
 beast-0.029
 crow-0.028
 hospital-0.027
 desert-0.026
 ig-0.026
Tokenmy
Token ID1820
Feature activation-2.30e-01

Pos logit contributions
packs+0.052
aces+0.051
suits+0.047
 journals+0.046
lets+0.045
Neg logit contributions
 beast-0.083
 crow-0.083
 hospital-0.082
 ig-0.081
 that-0.077
Token.
Token ID13
Feature activation+3.51e-02

Pos logit contributions
packs+0.012
aces+0.011
lets+0.011
suits+0.011
 journals+0.011
Neg logit contributions
 beast-0.011
 crow-0.011
 hospital-0.010
 ig-0.010
 unfortunate-0.010