TOP ACTIVATIONS
MAX = 0.358

our dreams can have sad endings . <|endoftext|> Once upon a
will never lead to bad endings . <|endoftext|> Once upon a
scary things can have happy endings . <|endoftext|> Once upon a
lead to problems and bad endings . <|endoftext|> There was a
, but sometimes , the endings are not . <|endoftext|> Once
Not all dreams have happy endings ." The
and make up your own endings too . What do you
searches can lead to happy endings - but sometimes it just
She did not like bad endings . <|endoftext|> It was Lily
and gave them all happy endings . Then
fighting can lead to bad endings . <|endoftext|> One day ,
impatient can lead to bad endings . Don â
, there are still bad endings . <|endoftext|> Once there was
Not all dreams have happy endings ." The
her wand and a magical portal opened . The
said the mirror was a portal to another world . He
! He stepped through the portal , and landed in the
. The light was a portal . It took her to
. She skipped into the portal and began her story .
top had become a magic portal that took her straight to

MIN ACTIVATIONS
MIN = -0.306

temper and often threw temper tant rums . One day ,
surprise . He threw a tant rum . He kicked and
stubborn . She throws a tant rum . She kicks and
. He kept throwing a tant rum . His mom was
his feet and threw a tant rum . "
kept crying and throwing a tant rum . Finally
listen . She threw a tant rum in the store .
sad . She threw a tant rum and cried loudly .
her feet and threw a tant rum . She grabbed the
and started to throw a tant rum . Suddenly , she
to cry and threw a tant rum . She didn 't
and so she threw a tant rum when her parents said
She cried and threw a tant rum . Her mom tried
them and never threw a tant rum again . <|endoftext|> Once
Mom said , holding a spoon ful of soup to his
That is a very special spoon . Maybe it will make
. They found a big spoon and a bowl . They
a band age and a spoon . They said she was
out a bottle and a spoon from his bag . He
big impact . The small spoon may have seemed insignificant ,

INTERVAL 0.192 - 0.358
CONTAINS 0.501%

villagers ran away to find safety . But one
Then he noticed a spark ly one floating in the sea
's mom my told her that they were going to have
something important . She knew that there was a war going
would never forget . <|endoftext|> A human was walking through a

INTERVAL 0.026 - 0.192
CONTAINS 35.008%

but this sunset was not beautiful
to laugh with me ?" Mary smiled and she and her
never forget his journey to a distant land . <|endoftext|> Once
When they got to grandma 's house , Lily
it would always remember the great wings that had taken it

INTERVAL -0.140 - 0.026
CONTAINS 62.130%

It had her name , her picture , and some words
Why are you fighting over the beads ? You have enough
straight away to ask her mum a question . " What
I have a new pen . It is very good .
kin and tries to wipe her shirt , but it is

INTERVAL -0.306 - -0.140
CONTAINS 2.361%

pieces and they made a y ummy apple pie . But
One day , her mom my asked her to add some
. She takes her new doll and runs away . She
, and it looked so y ummy . Suddenly
dragged the stool to the cup board and climbed on it
Token our
Token ID674
Feature activation+2.87e-03

Pos logit contributions
fire+0.019
 portal+0.018
eware+0.018
lan+0.018
 traveler+0.017
Neg logit contributions
c-0.021
ags-0.020
my-0.020
aces-0.019
 hair-0.018
Token dreams
Token ID10625
Feature activation+8.19e-02

Pos logit contributions
 possible+0.005
fire+0.005
 reality+0.004
eware+0.004
 portal+0.004
Neg logit contributions
ags-0.005
c-0.005
aces-0.005
icer-0.005
my-0.005
Token can
Token ID460
Feature activation-6.04e-02

Pos logit contributions
 possible+0.172
fire+0.165
 reality+0.163
 portal+0.160
eware+0.159
Neg logit contributions
ags-0.120
c-0.115
aces-0.105
icer-0.098
my-0.097
Token have
Token ID423
Feature activation-1.58e-02

Pos logit contributions
ags+0.116
c+0.116
my+0.105
aces+0.104
icer+0.099
Neg logit contributions
 possible-0.094
fire-0.092
lan-0.090
 reality-0.090
eware-0.090
Token sad
Token ID6507
Feature activation-1.05e-01

Pos logit contributions
c+0.029
ags+0.029
aces+0.028
my+0.026
rell+0.025
Neg logit contributions
fire-0.026
 possible-0.025
 traveler-0.025
lan-0.025
 portal-0.024
Token endings
Token ID38168
Feature activation+3.58e-01

Pos logit contributions
ags+0.165
aces+0.158
c+0.158
my+0.150
 hair+0.144
Neg logit contributions
fire-0.197
 possible-0.191
lan-0.188
 portal-0.185
 traveler-0.184
Token.
Token ID13
Feature activation+5.99e-03

Pos logit contributions
 possible+0.610
eware+0.609
fire+0.605
lan+0.604
 traveler+0.601
Neg logit contributions
c-0.608
ags-0.599
aces-0.592
my-0.553
rell-0.474
Token<|endoftext|>
Token ID50256
Feature activation-4.41e-02

Pos logit contributions
 possible+0.009
 reality+0.009
 portal+0.009
fire+0.009
lan+0.008
Neg logit contributions
ags-0.012
aces-0.011
c-0.011
icer-0.010
aters-0.010
TokenOnce
Token ID7454
Feature activation+1.02e-03

Pos logit contributions
ags+0.087
c+0.084
aces+0.081
icer+0.074
aters+0.073
Neg logit contributions
 possible-0.068
fire-0.066
 portal-0.062
 reality-0.062
lan-0.061
Token upon
Token ID2402
Feature activation-1.06e-01

Pos logit contributions
 possible+0.002
 reality+0.002
 unexpected+0.002
 portal+0.002
eware+0.002
Neg logit contributions
ags-0.002
c-0.002
aces-0.002
icer-0.002
aters-0.002
Token a
Token ID257
Feature activation+1.19e-01

Pos logit contributions
ags+0.248
c+0.240
aces+0.227
icer+0.220
my+0.214
Neg logit contributions
 possible-0.130
fire-0.120
 reality-0.119
 unexpected-0.113
 portal-0.112
Token will
Token ID481
Feature activation-4.98e-02

Pos logit contributions
 possible+0.291
fire+0.280
 reality+0.278
lan+0.278
eware+0.276
Neg logit contributions
ags-0.184
c-0.173
aces-0.162
my-0.151
icer-0.141
Token never
Token ID1239
Feature activation+1.32e-01

Pos logit contributions
ags+0.092
c+0.088
my+0.084
aces+0.079
rell+0.076
Neg logit contributions
lan-0.079
eware-0.079
 possible-0.075
fire-0.075
 reality-0.075
Token lead
Token ID1085
Feature activation+9.03e-02

Pos logit contributions
lan+0.234
fire+0.232
 portal+0.229
 reality+0.220
 traveler+0.219
Neg logit contributions
my-0.219
c-0.215
ags-0.208
aces-0.191
rell-0.184
Token to
Token ID284
Feature activation+5.28e-02

Pos logit contributions
fire+0.151
 reality+0.149
 portal+0.146
lan+0.144
 traveler+0.144
Neg logit contributions
ags-0.169
c-0.162
my-0.154
aces-0.146
rell-0.136
Token bad
Token ID2089
Feature activation+1.10e-01

Pos logit contributions
eware+0.088
fire+0.085
lan+0.085
 portal+0.084
 traveler+0.082
Neg logit contributions
c-0.093
ags-0.092
my-0.085
aces-0.078
 hair-0.076
Token endings
Token ID38168
Feature activation+3.58e-01

Pos logit contributions
fire+0.181
lan+0.181
eware+0.178
 portal+0.169
 possible+0.166
Neg logit contributions
ags-0.201
aces-0.191
 hair-0.179
 hands-0.168
c-0.167
Token.
Token ID13
Feature activation+2.93e-03

Pos logit contributions
eware+0.632
 portal+0.613
fire+0.608
 traveler+0.603
 possible+0.601
Neg logit contributions
aces-0.573
c-0.571
ags-0.564
my-0.522
icer-0.437
Token<|endoftext|>
Token ID50256
Feature activation-5.02e-02

Pos logit contributions
 possible+0.005
fire+0.005
 reality+0.005
 portal+0.005
eware+0.005
Neg logit contributions
ags-0.006
c-0.005
aces-0.005
icer-0.005
my-0.004
TokenOnce
Token ID7454
Feature activation-7.88e-04

Pos logit contributions
ags+0.097
c+0.092
aces+0.087
icer+0.081
my+0.080
Neg logit contributions
 possible-0.081
fire-0.078
 reality-0.075
 portal-0.075
eware-0.073
Token upon
Token ID2402
Feature activation-1.07e-01

Pos logit contributions
ags+0.002
c+0.001
aces+0.001
icer+0.001
aters+0.001
Neg logit contributions
 possible-0.001
 reality-0.001
 unexpected-0.001
 portal-0.001
eware-0.001
Token a
Token ID257
Feature activation+1.23e-01

Pos logit contributions
ags+0.237
c+0.231
aces+0.218
icer+0.209
my+0.207
Neg logit contributions
 possible-0.144
fire-0.134
 reality-0.132
 portal-0.127
eware-0.126
Token scary
Token ID14343
Feature activation+1.15e-01

Pos logit contributions
fire+0.022
 portal+0.021
lan+0.021
eware+0.021
 possible+0.021
Neg logit contributions
c-0.026
ags-0.025
my-0.024
aces-0.024
 hair-0.021
Token things
Token ID1243
Feature activation+7.40e-03

Pos logit contributions
 possible+0.222
fire+0.214
lan+0.211
 reality+0.209
 portal+0.208
Neg logit contributions
c-0.179
ags-0.178
aces-0.167
my-0.156
icer-0.142
Token can
Token ID460
Feature activation-5.75e-02

Pos logit contributions
 possible+0.015
fire+0.014
 reality+0.014
 portal+0.014
lan+0.014
Neg logit contributions
ags-0.012
c-0.011
aces-0.010
my-0.010
icer-0.009
Token have
Token ID423
Feature activation-1.49e-02

Pos logit contributions
ags+0.107
c+0.105
my+0.097
aces+0.096
rell+0.090
Neg logit contributions
 possible-0.091
lan-0.089
fire-0.089
 portal-0.089
 reality-0.089
Token happy
Token ID3772
Feature activation-1.23e-01

Pos logit contributions
ags+0.027
c+0.027
aces+0.026
my+0.025
rell+0.023
Neg logit contributions
fire-0.024
 portal-0.023
 traveler-0.023
 possible-0.023
lan-0.023
Token endings
Token ID38168
Feature activation+3.57e-01

Pos logit contributions
ags+0.196
aces+0.190
c+0.189
my+0.175
 hair+0.161
Neg logit contributions
fire-0.228
 possible-0.226
 traveler-0.220
lan-0.220
 portal-0.218
Token.
Token ID13
Feature activation+3.91e-03

Pos logit contributions
 possible+0.606
lan+0.606
eware+0.605
fire+0.600
 traveler+0.596
Neg logit contributions
ags-0.601
c-0.593
aces-0.588
my-0.545
rell-0.479
Token<|endoftext|>
Token ID50256
Feature activation-5.24e-02

Pos logit contributions
 possible+0.006
 reality+0.006
fire+0.006
lan+0.006
 portal+0.006
Neg logit contributions
ags-0.008
c-0.007
aces-0.007
icer-0.006
rell-0.006
TokenOnce
Token ID7454
Feature activation-2.50e-03

Pos logit contributions
ags+0.103
c+0.098
aces+0.094
icer+0.086
my+0.085
Neg logit contributions
 possible-0.083
fire-0.080
 reality-0.076
 portal-0.076
eware-0.074
Token upon
Token ID2402
Feature activation-1.07e-01

Pos logit contributions
ags+0.005
c+0.004
aces+0.004
aters+0.004
icer+0.004
Neg logit contributions
 possible-0.004
 reality-0.004
 unexpected-0.004
eware-0.004
 portal-0.004
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
ags+0.243
c+0.236
aces+0.222
icer+0.214
my+0.211
Neg logit contributions
 possible-0.142
fire-0.132
 reality-0.130
 portal-0.125
 unexpected-0.124
Token lead
Token ID1085
Feature activation+8.92e-02

Pos logit contributions
c+0.106
ags+0.106
my+0.100
aces+0.094
rell+0.089
Neg logit contributions
 possible-0.094
 reality-0.092
lan-0.092
 portal-0.091
fire-0.090
Token to
Token ID284
Feature activation+5.29e-02

Pos logit contributions
fire+0.147
 reality+0.147
 portal+0.146
 possible+0.144
 traveler+0.142
Neg logit contributions
ags-0.165
c-0.160
my-0.152
aces-0.140
 hands-0.128
Token problems
Token ID2761
Feature activation+8.62e-02

Pos logit contributions
fire+0.092
 portal+0.092
eware+0.092
lan+0.090
 traveler+0.088
Neg logit contributions
ags-0.093
c-0.082
my-0.082
 hair-0.075
 hands-0.071
Token and
Token ID290
Feature activation-8.84e-02

Pos logit contributions
 possible+0.141
lan+0.140
 reality+0.139
 portal+0.138
 traveler+0.137
Neg logit contributions
ags-0.154
c-0.149
aces-0.146
my-0.139
icer-0.124
Token bad
Token ID2089
Feature activation+1.08e-01

Pos logit contributions
ags+0.161
c+0.159
my+0.155
aces+0.144
 hands+0.139
Neg logit contributions
eware-0.130
 possible-0.129
 portal-0.128
fire-0.127
lan-0.127
Token endings
Token ID38168
Feature activation+3.55e-01

Pos logit contributions
fire+0.212
 portal+0.206
lan+0.200
eware+0.199
 possible+0.198
Neg logit contributions
ags-0.157
aces-0.152
 hair-0.138
 hands-0.137
c-0.134
Token.
Token ID13
Feature activation+8.79e-04

Pos logit contributions
 possible+0.626
eware+0.625
lan+0.624
 portal+0.615
 traveler+0.613
Neg logit contributions
ags-0.562
aces-0.556
c-0.547
my-0.510
icer-0.429
Token<|endoftext|>
Token ID50256
Feature activation-5.21e-02

Pos logit contributions
 possible+0.002
fire+0.001
 reality+0.001
 portal+0.001
lan+0.001
Neg logit contributions
ags-0.002
c-0.002
aces-0.001
icer-0.001
my-0.001
TokenThere
Token ID1858
Feature activation+1.56e-01

Pos logit contributions
ags+0.100
c+0.095
aces+0.090
icer+0.084
my+0.083
Neg logit contributions
 possible-0.085
fire-0.082
 reality-0.079
 portal-0.079
eware-0.077
Token was
Token ID373
Feature activation-1.72e-02

Pos logit contributions
 possible+0.277
 reality+0.262
fire+0.259
 unexpected+0.249
 portal+0.249
Neg logit contributions
ags-0.289
c-0.264
aces-0.252
icer-0.233
my-0.227
Token a
Token ID257
Feature activation+1.22e-01

Pos logit contributions
ags+0.039
c+0.038
aces+0.036
icer+0.035
my+0.033
Neg logit contributions
 possible-0.023
 reality-0.020
fire-0.020
 unexpected-0.020
 portal-0.019
Token,
Token ID11
Feature activation+5.20e-02

Pos logit contributions
eware+0.112
fire+0.110
 possible+0.110
lan+0.106
 portal+0.106
Neg logit contributions
c-0.096
ags-0.088
aces-0.087
my-0.083
aters-0.076
Token but
Token ID475
Feature activation-1.08e-01

Pos logit contributions
 possible+0.087
fire+0.084
 reality+0.082
eware+0.081
 portal+0.081
Neg logit contributions
ags-0.095
c-0.095
aces-0.087
my-0.084
icer-0.081
Token sometimes
Token ID3360
Feature activation-4.83e-02

Pos logit contributions
c+0.214
ags+0.212
my+0.193
aces+0.193
icer+0.182
Neg logit contributions
fire-0.155
 possible-0.152
eware-0.149
 reality-0.147
lan-0.147
Token,
Token ID11
Feature activation+5.28e-02

Pos logit contributions
c+0.088
my+0.079
aces+0.076
ags+0.073
 hands+0.073
Neg logit contributions
fire-0.078
eware-0.077
lan-0.076
 portal-0.072
 traveler-0.070
Token the
Token ID262
Feature activation-6.75e-02

Pos logit contributions
fire+0.074
 possible+0.072
eware+0.072
 portal+0.071
lan+0.069
Neg logit contributions
c-0.109
ags-0.105
aces-0.099
my-0.098
icer-0.093
Token endings
Token ID38168
Feature activation+3.55e-01

Pos logit contributions
c+0.131
ags+0.130
aces+0.119
my+0.116
icer+0.114
Neg logit contributions
 possible-0.109
fire-0.102
 reality-0.100
eware-0.099
lan-0.098
Token are
Token ID389
Feature activation-9.69e-02

Pos logit contributions
 possible+0.627
eware+0.613
lan+0.612
fire+0.596
 reality+0.589
Neg logit contributions
ags-0.606
c-0.584
aces-0.570
my-0.522
rell-0.471
Token not
Token ID407
Feature activation+5.83e-02

Pos logit contributions
ags+0.194
c+0.193
aces+0.180
my+0.176
icer+0.154
Neg logit contributions
fire-0.139
eware-0.136
 portal-0.131
lan-0.130
fall-0.128
Token.
Token ID13
Feature activation-4.88e-03

Pos logit contributions
fire+0.090
eware+0.087
lan+0.086
 portal+0.084
 traveler+0.080
Neg logit contributions
c-0.112
ags-0.109
aces-0.103
my-0.103
rell-0.090
Token<|endoftext|>
Token ID50256
Feature activation-5.27e-02

Pos logit contributions
ags+0.010
c+0.009
aces+0.009
icer+0.008
my+0.008
Neg logit contributions
 possible-0.008
 reality-0.007
fire-0.007
 portal-0.007
lan-0.007
TokenOnce
Token ID7454
Feature activation-3.87e-03

Pos logit contributions
ags+0.107
c+0.101
aces+0.097
icer+0.090
my+0.088
Neg logit contributions
 possible-0.079
fire-0.076
 reality-0.073
 portal-0.072
eware-0.071
Token Not
Token ID1892
Feature activation+5.95e-02

Pos logit contributions
ags+0.007
c+0.006
aces+0.006
icer+0.006
my+0.006
Neg logit contributions
 possible-0.005
fire-0.005
 reality-0.005
 portal-0.004
eware-0.004
Token all
Token ID477
Feature activation-6.26e-02

Pos logit contributions
fire+0.093
 portal+0.088
 possible+0.086
 traveler+0.086
lan+0.086
Neg logit contributions
c-0.111
ags-0.111
aces-0.106
my-0.103
icer-0.092
Token dreams
Token ID10625
Feature activation+8.02e-02

Pos logit contributions
ags+0.122
c+0.121
aces+0.112
my+0.107
icer+0.105
Neg logit contributions
 possible-0.099
fire-0.095
 reality-0.093
eware-0.090
 portal-0.090
Token have
Token ID423
Feature activation-1.66e-02

Pos logit contributions
 possible+0.126
fire+0.120
 reality+0.119
 portal+0.115
eware+0.114
Neg logit contributions
ags-0.158
c-0.154
aces-0.145
my-0.136
icer-0.134
Token happy
Token ID3772
Feature activation-1.26e-01

Pos logit contributions
c+0.032
ags+0.032
aces+0.031
my+0.028
rell+0.027
Neg logit contributions
fire-0.025
 possible-0.025
eware-0.024
 portal-0.024
 reality-0.024
Token endings
Token ID38168
Feature activation+3.55e-01

Pos logit contributions
ags+0.210
c+0.205
aces+0.203
my+0.181
aters+0.172
Neg logit contributions
 possible-0.227
fire-0.225
eware-0.213
 portal-0.211
 traveler-0.208
Token."
Token ID526
Feature activation-7.17e-02

Pos logit contributions
 possible+0.613
fire+0.597
eware+0.593
 reality+0.588
lan+0.587
Neg logit contributions
ags-0.614
c-0.598
aces-0.579
my-0.530
icer-0.496
Token
Token ID220
Feature activation-2.71e-01

Pos logit contributions
ags+0.128
aces+0.116
c+0.114
icer+0.113
wy+0.105
Neg logit contributions
 possible-0.122
 reality-0.119
fire-0.118
 portal-0.115
lan-0.115
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
ags+0.514
c+0.491
aces+0.464
icer+0.450
my+0.429
Neg logit contributions
 possible-0.455
fire-0.424
 reality-0.414
 portal-0.403
eware-0.402
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
ags+0.031
c+0.030
aces+0.029
icer+0.028
aters+0.026
Neg logit contributions
 possible-0.024
fire-0.023
 reality-0.022
lan-0.022
 portal-0.022
TokenThe
Token ID464
Feature activation-1.52e-02

Pos logit contributions
ags+0.032
c+0.031
aces+0.029
icer+0.028
my+0.027
Neg logit contributions
 possible-0.025
fire-0.023
 reality-0.023
 portal-0.022
 unexpected-0.022
Token and
Token ID290
Feature activation-9.76e-02

Pos logit contributions
 possible+0.037
fire+0.036
 reality+0.035
eware+0.035
 portal+0.035
Neg logit contributions
ags-0.027
c-0.026
aces-0.024
my-0.023
icer-0.022
Token make
Token ID787
Feature activation-6.04e-02

Pos logit contributions
ags+0.167
c+0.159
aces+0.151
my+0.144
 hair+0.133
Neg logit contributions
eware-0.168
fire-0.166
 possible-0.165
 reality-0.161
 portal-0.158
Token up
Token ID510
Feature activation-3.54e-02

Pos logit contributions
ags+0.116
c+0.113
aces+0.105
icer+0.100
my+0.100
Neg logit contributions
 possible-0.100
fire-0.095
 reality-0.093
 portal-0.091
eware-0.090
Token your
Token ID534
Feature activation-5.03e-02

Pos logit contributions
ags+0.065
c+0.064
aces+0.059
my+0.056
icer+0.053
Neg logit contributions
 possible-0.058
fire-0.057
eware-0.057
 portal-0.055
 reality-0.054
Token own
Token ID898
Feature activation-1.20e-01

Pos logit contributions
ags+0.088
c+0.084
icer+0.078
aces+0.077
rell+0.074
Neg logit contributions
 possible-0.088
 reality-0.085
 portal-0.084
fire-0.083
 unexpected-0.081
Token endings
Token ID38168
Feature activation+3.54e-01

Pos logit contributions
ags+0.221
c+0.211
icer+0.196
aces+0.194
my+0.182
Neg logit contributions
 possible-0.206
 reality-0.198
 portal-0.193
fire-0.192
 unexpected-0.187
Token too
Token ID1165
Feature activation-3.93e-02

Pos logit contributions
 possible+0.602
eware+0.601
fire+0.591
 reality+0.582
 portal+0.582
Neg logit contributions
ags-0.622
c-0.600
aces-0.582
my-0.539
icer-0.500
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
c+0.053
ags+0.051
aces+0.048
my+0.048
icer+0.043
Neg logit contributions
fire-0.078
 possible-0.078
 portal-0.077
eware-0.076
lan-0.076
Token What
Token ID1867
Feature activation+1.41e-01

Pos logit contributions
ags+0.023
c+0.022
aces+0.021
icer+0.020
my+0.019
Neg logit contributions
 possible-0.018
fire-0.017
eware-0.017
lan-0.017
 portal-0.016
Token do
Token ID466
Feature activation+2.88e-02

Pos logit contributions
 possible+0.224
 reality+0.202
 unexpected+0.198
 portal+0.189
fire+0.188
Neg logit contributions
ags-0.301
c-0.287
icer-0.271
aces-0.263
rell-0.259
Token you
Token ID345
Feature activation+9.28e-02

Pos logit contributions
 reality+0.037
fire+0.037
 possible+0.037
 portal+0.035
eware+0.035
Neg logit contributions
ags-0.065
c-0.061
aces-0.059
my-0.056
icer-0.054
Token searches
Token ID15455
Feature activation+1.61e-01

Pos logit contributions
c+0.076
my+0.071
ags+0.065
aces+0.064
 hands+0.062
Neg logit contributions
lan-0.076
eware-0.074
fire-0.074
 traveler-0.070
 portal-0.069
Token can
Token ID460
Feature activation-6.07e-02

Pos logit contributions
fire+0.270
 possible+0.269
eware+0.261
 traveler+0.260
 portal+0.260
Neg logit contributions
ags-0.289
c-0.281
aces-0.260
my-0.256
icer-0.237
Token lead
Token ID1085
Feature activation+8.92e-02

Pos logit contributions
c+0.104
ags+0.100
my+0.098
aces+0.089
rell+0.085
Neg logit contributions
lan-0.105
eware-0.102
 portal-0.100
 possible-0.099
 traveler-0.099
Token to
Token ID284
Feature activation+5.27e-02

Pos logit contributions
fire+0.139
 portal+0.138
 possible+0.137
 reality+0.135
 traveler+0.133
Neg logit contributions
c-0.169
ags-0.168
my-0.159
aces-0.149
icer-0.135
Token happy
Token ID3772
Feature activation-1.21e-01

Pos logit contributions
eware+0.090
fire+0.085
lan+0.085
 portal+0.081
 traveler+0.080
Neg logit contributions
ags-0.094
c-0.093
my-0.087
aces-0.080
 hair-0.077
Token endings
Token ID38168
Feature activation+3.54e-01

Pos logit contributions
ags+0.191
c+0.183
aces+0.179
my+0.176
 hair+0.171
Neg logit contributions
fire-0.212
eware-0.208
 possible-0.206
 traveler-0.203
lan-0.203
Token -
Token ID532
Feature activation+1.85e-01

Pos logit contributions
eware+0.608
lan+0.603
 possible+0.597
 portal+0.586
fire+0.585
Neg logit contributions
c-0.590
ags-0.584
aces-0.583
my-0.542
rell-0.462
Token but
Token ID475
Feature activation-1.09e-01

Pos logit contributions
fire+0.258
eware+0.256
 portal+0.255
 possible+0.250
lan+0.247
Neg logit contributions
c-0.367
ags-0.346
my-0.341
aces-0.334
icer-0.301
Token sometimes
Token ID3360
Feature activation-4.56e-02

Pos logit contributions
c+0.216
ags+0.210
my+0.204
aces+0.190
icer+0.182
Neg logit contributions
eware-0.153
fire-0.152
lan-0.148
 portal-0.145
 possible-0.143
Token it
Token ID340
Feature activation-5.73e-02

Pos logit contributions
c+0.080
my+0.077
's+0.071
 hands+0.069
aces+0.069
Neg logit contributions
eware-0.073
lan-0.072
fire-0.071
 portal-0.069
 traveler-0.066
Token just
Token ID655
Feature activation-8.72e-02

Pos logit contributions
ags+0.062
c+0.062
aces+0.054
my+0.050
icer+0.048
Neg logit contributions
 possible-0.139
fire-0.135
 reality-0.134
 portal-0.132
lan-0.131
Token She
Token ID1375
Feature activation+3.32e-02

Pos logit contributions
ags+0.032
c+0.031
aces+0.029
icer+0.027
my+0.027
Neg logit contributions
 possible-0.037
fire-0.035
 reality-0.035
 portal-0.034
eware-0.034
Token did
Token ID750
Feature activation-9.46e-02

Pos logit contributions
 possible+0.056
fire+0.055
 reality+0.054
 portal+0.053
 unexpected+0.053
Neg logit contributions
ags-0.062
c-0.058
aces-0.054
icer-0.052
my-0.051
Token not
Token ID407
Feature activation+5.20e-02

Pos logit contributions
ags+0.195
c+0.188
aces+0.177
my+0.170
icer+0.161
Neg logit contributions
fire-0.140
 possible-0.137
eware-0.132
lan-0.131
 reality-0.130
Token like
Token ID588
Feature activation-2.64e-02

Pos logit contributions
 possible+0.100
fire+0.096
 reality+0.095
 portal+0.095
eware+0.094
Neg logit contributions
ags-0.083
c-0.081
my-0.077
aces-0.074
 hair-0.066
Token bad
Token ID2089
Feature activation+1.03e-01

Pos logit contributions
ags+0.048
c+0.047
aces+0.045
my+0.043
icer+0.041
Neg logit contributions
fire-0.044
 possible-0.044
 portal-0.042
eware-0.041
 reality-0.041
Token endings
Token ID38168
Feature activation+3.54e-01

Pos logit contributions
 possible+0.225
fire+0.222
 portal+0.216
 reality+0.210
 unexpected+0.202
Neg logit contributions
ags-0.149
aces-0.139
c-0.133
 hair-0.122
my-0.120
Token.
Token ID13
Feature activation-2.13e-02

Pos logit contributions
 possible+0.604
fire+0.582
eware+0.576
 portal+0.573
 reality+0.572
Neg logit contributions
ags-0.624
c-0.619
aces-0.600
my-0.559
icer-0.520
Token<|endoftext|>
Token ID50256
Feature activation-5.44e-02

Pos logit contributions
ags+0.037
c+0.036
aces+0.033
icer+0.031
my+0.031
Neg logit contributions
 possible-0.039
fire-0.037
 reality-0.037
 portal-0.036
eware-0.036
TokenIt
Token ID1026
Feature activation+6.88e-02

Pos logit contributions
ags+0.112
c+0.108
aces+0.104
icer+0.096
my+0.095
Neg logit contributions
 possible-0.078
fire-0.076
 portal-0.071
lan-0.070
 reality-0.070
Token was
Token ID373
Feature activation-2.38e-02

Pos logit contributions
 possible+0.110
fire+0.107
 reality+0.103
 unexpected+0.101
 portal+0.100
Neg logit contributions
ags-0.140
aces-0.123
c-0.121
aters-0.111
icer-0.110
Token Lily
Token ID20037
Feature activation-2.45e-02

Pos logit contributions
ags+0.052
c+0.051
aces+0.048
icer+0.046
my+0.045
Neg logit contributions
 possible-0.033
fire-0.030
 reality-0.030
 portal-0.029
 unexpected-0.029
Token and
Token ID290
Feature activation-8.82e-02

Pos logit contributions
 possible+0.179
 reality+0.169
fire+0.169
eware+0.168
 portal+0.165
Neg logit contributions
ags-0.151
c-0.149
aces-0.138
my-0.130
icer-0.126
Token gave
Token ID2921
Feature activation-8.30e-02

Pos logit contributions
c+0.162
ags+0.150
my+0.148
aces+0.145
icer+0.127
Neg logit contributions
 possible-0.147
eware-0.142
fire-0.140
 reality-0.137
 portal-0.137
Token them
Token ID606
Feature activation-9.99e-02

Pos logit contributions
ags+0.152
c+0.148
aces+0.138
icer+0.131
my+0.128
Neg logit contributions
 possible-0.144
fire-0.136
 reality-0.135
 unexpected-0.131
lan-0.130
Token all
Token ID477
Feature activation-6.58e-02

Pos logit contributions
ags+0.203
c+0.198
aces+0.185
icer+0.180
my+0.174
Neg logit contributions
 possible-0.154
fire-0.143
 reality-0.142
lan-0.138
 unexpected-0.137
Token happy
Token ID3772
Feature activation-1.21e-01

Pos logit contributions
ags+0.131
c+0.127
aces+0.125
icer+0.123
aters+0.111
Neg logit contributions
 possible-0.104
 reality-0.091
 unexpected-0.090
fire-0.090
lan-0.090
Token endings
Token ID38168
Feature activation+3.53e-01

Pos logit contributions
ags+0.178
aces+0.176
c+0.173
my+0.158
 hair+0.149
Neg logit contributions
 possible-0.249
fire-0.233
 portal-0.231
eware-0.230
 traveler-0.230
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
eware+0.671
 portal+0.641
 possible+0.640
 reality+0.636
 traveler+0.635
Neg logit contributions
aces-0.539
c-0.533
ags-0.511
my-0.468
rell-0.399
Token
Token ID220
Feature activation-2.69e-01

Pos logit contributions
ags+0.004
c+0.003
aces+0.003
icer+0.003
aters+0.003
Neg logit contributions
 possible-0.003
fire-0.003
 reality-0.003
lan-0.003
 portal-0.003
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
ags+0.552
c+0.546
aces+0.507
icer+0.497
 tant+0.467
Neg logit contributions
 possible-0.387
fire-0.359
eware-0.335
lan-0.332
 reality-0.329
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
ags+0.037
aces+0.034
c+0.034
ishes+0.032
ish+0.031
Neg logit contributions
 possible-0.024
fire-0.023
 reality-0.022
lan-0.022
 unexpected-0.021
TokenThen
Token ID6423
Feature activation+1.06e-01

Pos logit contributions
ags+0.038
c+0.036
aces+0.035
icer+0.033
my+0.032
Neg logit contributions
 possible-0.026
fire-0.024
 reality-0.024
 portal-0.023
 unexpected-0.023
Token fighting
Token ID4330
Feature activation-8.39e-02

Pos logit contributions
ags+0.209
c+0.203
aces+0.191
my+0.184
icer+0.178
Neg logit contributions
 possible-0.143
fire-0.137
 reality-0.135
 portal-0.131
eware-0.129
Token can
Token ID460
Feature activation-6.84e-02

Pos logit contributions
c+0.146
ags+0.145
icer+0.141
aces+0.136
aters+0.124
Neg logit contributions
 possible-0.151
 unexpected-0.143
 reality-0.135
 portal-0.134
fire-0.132
Token lead
Token ID1085
Feature activation+8.18e-02

Pos logit contributions
ags+0.136
c+0.132
aces+0.122
my+0.119
icer+0.116
Neg logit contributions
 possible-0.107
fire-0.103
 reality-0.102
 portal-0.100
eware-0.098
Token to
Token ID284
Feature activation+4.53e-02

Pos logit contributions
 possible+0.119
fire+0.117
 reality+0.117
 portal+0.115
 traveler+0.115
Neg logit contributions
ags-0.171
c-0.167
my-0.156
aces-0.152
icer-0.138
Token bad
Token ID2089
Feature activation+1.07e-01

Pos logit contributions
fire+0.085
eware+0.082
 portal+0.082
 traveler+0.081
lan+0.079
Neg logit contributions
ags-0.078
c-0.069
my-0.066
 hair-0.063
aces-0.062
Token endings
Token ID38168
Feature activation+3.53e-01

Pos logit contributions
fire+0.234
 possible+0.228
 portal+0.226
 reality+0.219
 traveler+0.212
Neg logit contributions
ags-0.157
 hair-0.133
c-0.132
aces-0.132
 hands-0.125
Token.
Token ID13
Feature activation-8.96e-03

Pos logit contributions
 possible+0.610
fire+0.596
 reality+0.593
eware+0.591
 portal+0.590
Neg logit contributions
ags-0.621
c-0.594
aces-0.584
my-0.545
icer-0.482
Token<|endoftext|>
Token ID50256
Feature activation-4.72e-02

Pos logit contributions
ags+0.017
c+0.016
aces+0.015
icer+0.014
my+0.014
Neg logit contributions
 possible-0.015
fire-0.014
 reality-0.014
 portal-0.014
eware-0.014
TokenOne
Token ID3198
Feature activation+5.10e-02

Pos logit contributions
ags+0.098
c+0.093
aces+0.092
aters+0.084
icer+0.083
Neg logit contributions
 possible-0.067
fire-0.066
 portal-0.060
lan-0.059
 reality-0.059
Token day
Token ID1110
Feature activation+2.68e-02

Pos logit contributions
 possible+0.074
 unexpected+0.066
 reality+0.065
fire+0.065
 portal+0.062
Neg logit contributions
ags-0.114
c-0.106
aces-0.105
icer-0.100
aters-0.097
Token,
Token ID11
Feature activation+6.42e-02

Pos logit contributions
 possible+0.040
 reality+0.039
fire+0.038
 portal+0.037
 traveler+0.036
Neg logit contributions
ags-0.055
c-0.054
aces-0.051
my-0.049
icer-0.048
Token impatient
Token ID33440
Feature activation-3.62e-02

Pos logit contributions
c+0.063
ags+0.060
aces+0.056
my+0.056
 hands+0.051
Neg logit contributions
 reality-0.060
 possible-0.060
fire-0.057
 traveler-0.056
lan-0.055
Token can
Token ID460
Feature activation-6.42e-02

Pos logit contributions
ags+0.064
c+0.062
aces+0.059
my+0.058
icer+0.052
Neg logit contributions
 possible-0.063
fire-0.062
 reality-0.061
 traveler-0.060
lan-0.060
Token lead
Token ID1085
Feature activation+8.74e-02

Pos logit contributions
ags+0.114
c+0.114
my+0.107
aces+0.102
rell+0.095
Neg logit contributions
lan-0.106
 possible-0.105
eware-0.104
 portal-0.104
 reality-0.103
Token to
Token ID284
Feature activation+4.96e-02

Pos logit contributions
fire+0.135
 reality+0.134
 traveler+0.134
 possible+0.133
 portal+0.132
Neg logit contributions
ags-0.170
c-0.165
my-0.152
aces-0.147
 hands-0.133
Token bad
Token ID2089
Feature activation+1.10e-01

Pos logit contributions
eware+0.090
fire+0.086
lan+0.085
 traveler+0.081
 portal+0.081
Neg logit contributions
ags-0.087
c-0.080
my-0.074
aces-0.071
 hair-0.070
Token endings
Token ID38168
Feature activation+3.53e-01

Pos logit contributions
fire+0.237
 possible+0.226
 portal+0.220
eware+0.214
lan+0.214
Neg logit contributions
ags-0.164
aces-0.146
 hair-0.142
 hands-0.138
c-0.130
Token.
Token ID13
Feature activation+4.57e-04

Pos logit contributions
eware+0.632
 possible+0.628
 portal+0.625
lan+0.624
fire+0.615
Neg logit contributions
ags-0.566
aces-0.556
c-0.550
my-0.495
ates-0.421
Token Don
Token ID2094
Feature activation-2.33e-02

Pos logit contributions
 possible+0.001
fire+0.001
 reality+0.001
 portal+0.001
eware+0.001
Neg logit contributions
ags-0.001
c-0.001
aces-0.001
icer-0.001
my-0.001
Tokenâ
Token ID22940
Feature activation-7.95e-02

Pos logit contributions
ags+0.051
c+0.049
aces+0.046
icer+0.045
my+0.044
Neg logit contributions
 possible-0.033
fire-0.031
 reality-0.030
 portal-0.029
eware-0.029
Token
Token ID26391
Feature activation-2.09e-02

Pos logit contributions
ags+0.126
c+0.122
aces+0.114
icer+0.108
my+0.105
Neg logit contributions
 possible-0.157
fire-0.152
 portal-0.145
 reality-0.145
eware-0.144
Token
Token ID8151
Feature activation+7.53e-02

Pos logit contributions
ags+0.036
c+0.035
aces+0.032
icer+0.030
my+0.030
Neg logit contributions
 possible-0.039
fire-0.037
 reality-0.036
 portal-0.036
 unexpected-0.035
Token,
Token ID11
Feature activation+5.26e-02

Pos logit contributions
 possible+0.159
fire+0.156
 traveler+0.153
 reality+0.152
 portal+0.152
Neg logit contributions
c-0.185
ags-0.184
aces-0.173
my-0.172
icer-0.151
Token there
Token ID612
Feature activation+1.27e-02

Pos logit contributions
fire+0.073
 possible+0.072
eware+0.072
 portal+0.071
lan+0.070
Neg logit contributions
c-0.106
ags-0.104
aces-0.098
my-0.097
icer-0.090
Token are
Token ID389
Feature activation-9.80e-02

Pos logit contributions
 possible+0.028
 reality+0.026
fire+0.026
 unexpected+0.026
 portal+0.025
Neg logit contributions
ags-0.018
c-0.017
aces-0.015
icer-0.015
my-0.014
Token still
Token ID991
Feature activation-8.39e-02

Pos logit contributions
c+0.170
ags+0.168
aces+0.161
my+0.155
icer+0.139
Neg logit contributions
fire-0.165
 portal-0.164
 possible-0.159
 traveler-0.157
eware-0.156
Token bad
Token ID2089
Feature activation+1.07e-01

Pos logit contributions
c+0.160
ags+0.159
my+0.146
aces+0.145
icer+0.129
Neg logit contributions
fire-0.127
 possible-0.126
eware-0.125
 portal-0.124
lan-0.124
Token endings
Token ID38168
Feature activation+3.52e-01

Pos logit contributions
 possible+0.236
fire+0.235
 portal+0.229
lan+0.223
 reality+0.221
Neg logit contributions
ags-0.137
aces-0.128
my-0.117
c-0.117
 hair-0.115
Token.
Token ID13
Feature activation-8.74e-03

Pos logit contributions
 possible+0.611
eware+0.594
lan+0.585
fire+0.585
 reality+0.583
Neg logit contributions
c-0.592
ags-0.588
aces-0.577
my-0.540
icer-0.477
Token<|endoftext|>
Token ID50256
Feature activation-5.60e-02

Pos logit contributions
ags+0.017
c+0.016
aces+0.016
icer+0.014
rell+0.014
Neg logit contributions
 possible-0.014
fire-0.013
 reality-0.013
 portal-0.013
lan-0.013
TokenOnce
Token ID7454
Feature activation-6.74e-03

Pos logit contributions
ags+0.118
c+0.111
aces+0.108
icer+0.098
aters+0.097
Neg logit contributions
 possible-0.079
fire-0.077
 portal-0.072
 reality-0.071
eware-0.071
Token there
Token ID612
Feature activation+2.23e-02

Pos logit contributions
ags+0.013
c+0.012
aces+0.012
aters+0.011
icer+0.011
Neg logit contributions
 possible-0.012
 reality-0.010
 unexpected-0.010
 portal-0.010
eware-0.010
Token was
Token ID373
Feature activation-2.07e-02

Pos logit contributions
 possible+0.032
 unexpected+0.029
 reality+0.029
fire+0.028
eware+0.027
Neg logit contributions
ags-0.052
aces-0.046
c-0.044
aters-0.043
icer-0.041
Token Not
Token ID1892
Feature activation+5.47e-02

Pos logit contributions
 possible+0.001
fire+0.001
 reality+0.001
eware+0.001
 portal+0.001
Neg logit contributions
ags-0.001
c-0.001
aces-0.001
icer-0.001
my-0.001
Token all
Token ID477
Feature activation-6.64e-02

Pos logit contributions
fire+0.085
 portal+0.081
 possible+0.080
 reality+0.079
 traveler+0.079
Neg logit contributions
ags-0.103
c-0.102
aces-0.097
my-0.096
icer-0.086
Token dreams
Token ID10625
Feature activation+7.60e-02

Pos logit contributions
ags+0.131
c+0.129
aces+0.120
my+0.115
icer+0.112
Neg logit contributions
 possible-0.104
fire-0.100
 reality-0.098
 portal-0.095
eware-0.095
Token have
Token ID423
Feature activation-2.14e-02

Pos logit contributions
 possible+0.122
fire+0.117
 reality+0.116
 portal+0.112
lan+0.112
Neg logit contributions
ags-0.147
c-0.143
aces-0.135
my-0.127
icer-0.124
Token happy
Token ID3772
Feature activation-1.28e-01

Pos logit contributions
c+0.041
ags+0.041
aces+0.040
my+0.037
rell+0.035
Neg logit contributions
fire-0.032
 possible-0.032
 portal-0.031
eware-0.031
 reality-0.031
Token endings
Token ID38168
Feature activation+3.52e-01

Pos logit contributions
ags+0.213
aces+0.205
c+0.205
my+0.184
 hair+0.175
Neg logit contributions
 possible-0.235
fire-0.232
 portal-0.218
eware-0.217
 traveler-0.216
Token."
Token ID526
Feature activation-6.20e-02

Pos logit contributions
 possible+0.645
eware+0.639
fire+0.636
lan+0.635
 portal+0.629
Neg logit contributions
ags-0.555
c-0.535
aces-0.533
my-0.478
rell-0.423
Token
Token ID220
Feature activation-2.71e-01

Pos logit contributions
ags+0.095
c+0.088
aces+0.085
icer+0.081
my+0.076
Neg logit contributions
 possible-0.123
fire-0.121
 reality-0.118
 portal-0.116
lan-0.116
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
ags+0.527
c+0.506
aces+0.481
icer+0.465
my+0.445
Neg logit contributions
 possible-0.434
fire-0.408
 reality-0.390
eware-0.386
 portal-0.381
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
ags+0.027
c+0.026
aces+0.025
icer+0.023
my+0.023
Neg logit contributions
 possible-0.025
fire-0.024
 reality-0.023
 portal-0.023
lan-0.023
TokenThe
Token ID464
Feature activation-1.75e-02

Pos logit contributions
ags+0.028
c+0.028
aces+0.025
my+0.024
icer+0.024
Neg logit contributions
 possible-0.027
fire-0.025
 reality-0.025
eware-0.025
 portal-0.024
Token her
Token ID607
Feature activation-7.74e-02

Pos logit contributions
 possible+0.065
fire+0.063
 reality+0.061
eware+0.060
 portal+0.059
Neg logit contributions
ags-0.045
c-0.043
aces-0.039
my-0.038
 hands-0.034
Token wand
Token ID11569
Feature activation+4.83e-03

Pos logit contributions
c+0.147
ags+0.142
icer+0.129
aces+0.127
my+0.126
Neg logit contributions
 possible-0.131
 portal-0.124
 reality-0.123
fire-0.122
 traveler-0.120
Token and
Token ID290
Feature activation-8.28e-02

Pos logit contributions
 possible+0.009
fire+0.009
 reality+0.009
 portal+0.008
eware+0.008
Neg logit contributions
ags-0.008
c-0.008
aces-0.007
icer-0.007
my-0.007
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
c+0.119
my+0.110
aces+0.097
 hands+0.096
 arms+0.092
Neg logit contributions
 possible-0.171
eware-0.163
 reality-0.158
lan-0.157
fire-0.156
Token magical
Token ID10883
Feature activation+1.43e-01

Pos logit contributions
 possible+0.184
eware+0.171
lan+0.167
fire+0.164
 reality+0.162
Neg logit contributions
c-0.204
ags-0.192
my-0.180
aces-0.174
icer-0.168
Token portal
Token ID17898
Feature activation+3.46e-01

Pos logit contributions
 possible+0.230
eware+0.218
lan+0.212
fire+0.199
 reality+0.197
Neg logit contributions
c-0.277
ags-0.258
my-0.238
aces-0.237
 hair-0.224
Token opened
Token ID4721
Feature activation+4.88e-02

Pos logit contributions
 reality+0.567
eware+0.562
lan+0.556
 possible+0.546
fire+0.533
Neg logit contributions
c-0.623
aces-0.580
ags-0.561
my-0.536
rell-0.474
Token.
Token ID13
Feature activation+7.19e-03

Pos logit contributions
 possible+0.085
eware+0.084
fire+0.084
lan+0.080
 reality+0.079
Neg logit contributions
c-0.087
ags-0.085
aces-0.077
my-0.075
icer-0.072
Token
Token ID198
Feature activation-5.39e-03

Pos logit contributions
 possible+0.015
 reality+0.014
fire+0.014
 unexpected+0.014
eware+0.013
Neg logit contributions
ags-0.011
c-0.011
aces-0.010
my-0.009
icer-0.009
Token
Token ID198
Feature activation-6.41e-03

Pos logit contributions
ags+0.009
c+0.009
aces+0.008
my+0.007
icer+0.007
Neg logit contributions
 possible-0.010
fire-0.010
 reality-0.010
 portal-0.010
eware-0.009
TokenThe
Token ID464
Feature activation-8.67e-03

Pos logit contributions
c+0.011
ags+0.011
aces+0.010
my+0.010
icer+0.009
Neg logit contributions
 possible-0.012
 reality-0.011
eware-0.011
lan-0.010
fire-0.010
Token said
Token ID531
Feature activation-8.81e-02

Pos logit contributions
 possible+0.007
fire+0.006
 reality+0.006
eware+0.006
 portal+0.006
Neg logit contributions
c-0.006
ags-0.006
aces-0.005
my-0.005
icer-0.005
Token the
Token ID262
Feature activation-6.26e-02

Pos logit contributions
c+0.184
aces+0.180
ags+0.173
my+0.172
 hair+0.158
Neg logit contributions
fire-0.116
 portal-0.114
 possible-0.114
 reality-0.111
eware-0.105
Token mirror
Token ID10162
Feature activation-8.58e-02

Pos logit contributions
ags+0.102
c+0.099
aces+0.091
my+0.086
icer+0.085
Neg logit contributions
 possible-0.122
fire-0.117
 reality-0.115
eware-0.112
 portal-0.112
Token was
Token ID373
Feature activation-2.95e-02

Pos logit contributions
c+0.140
ags+0.138
aces+0.129
my+0.121
icer+0.117
Neg logit contributions
 possible-0.164
 reality-0.155
fire-0.155
 portal-0.151
eware-0.148
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
ags+0.056
c+0.054
aces+0.050
my+0.049
icer+0.047
Neg logit contributions
 possible-0.047
fire-0.047
eware-0.046
 reality-0.045
 portal-0.045
Token portal
Token ID17898
Feature activation+3.46e-01

Pos logit contributions
eware+0.188
 possible+0.184
fire+0.181
uring+0.170
lan+0.170
Neg logit contributions
c-0.180
ags-0.165
my-0.164
 hair-0.154
icer-0.151
Token to
Token ID284
Feature activation+5.12e-02

Pos logit contributions
 reality+0.596
eware+0.581
 possible+0.567
lan+0.565
 unexpected+0.562
Neg logit contributions
c-0.590
aces-0.529
ags-0.521
my-0.520
 hair-0.435
Token another
Token ID1194
Feature activation+1.35e-01

Pos logit contributions
 possible+0.083
eware+0.083
lan+0.079
fire+0.078
 reality+0.076
Neg logit contributions
ags-0.098
c-0.095
my-0.084
aces-0.084
icer-0.075
Token world
Token ID995
Feature activation+4.76e-02

Pos logit contributions
eware+0.251
 possible+0.239
lan+0.221
fire+0.213
fall+0.212
Neg logit contributions
c-0.246
my-0.219
ags-0.185
icer-0.185
 hair-0.175
Token.
Token ID13
Feature activation-1.70e-03

Pos logit contributions
 reality+0.087
 possible+0.087
eware+0.082
 miracle+0.082
lan+0.082
Neg logit contributions
c-0.078
ags-0.073
aces-0.072
my-0.069
rell-0.060
Token He
Token ID679
Feature activation+8.24e-03

Pos logit contributions
ags+0.003
c+0.002
aces+0.002
my+0.002
icer+0.002
Neg logit contributions
 possible-0.004
fire-0.003
 reality-0.003
 portal-0.003
eware-0.003
Token!
Token ID0
Feature activation+1.11e-01

Pos logit contributions
my+0.035
c+0.034
ags+0.032
aces+0.028
ish+0.026
Neg logit contributions
fire-0.063
eware-0.061
 possible-0.061
 portal-0.059
 reality-0.058
Token He
Token ID679
Feature activation+8.57e-03

Pos logit contributions
 possible+0.250
fire+0.239
 reality+0.235
 unexpected+0.232
eware+0.228
Neg logit contributions
ags-0.147
c-0.142
aces-0.135
my-0.120
icer-0.110
Token stepped
Token ID10764
Feature activation+9.64e-02

Pos logit contributions
 possible+0.018
 reality+0.016
lan+0.016
fire+0.016
eware+0.016
Neg logit contributions
c-0.013
my-0.012
aces-0.011
ags-0.011
rell-0.009
Token through
Token ID832
Feature activation+2.55e-02

Pos logit contributions
 possible+0.200
 reality+0.189
eware+0.187
fire+0.185
 unexpected+0.182
Neg logit contributions
ags-0.152
c-0.141
my-0.129
aces-0.127
icer-0.116
Token the
Token ID262
Feature activation-6.21e-02

Pos logit contributions
eware+0.057
 possible+0.056
fire+0.053
lan+0.052
fall+0.050
Neg logit contributions
c-0.033
ags-0.030
aces-0.029
my-0.027
 arms-0.027
Token portal
Token ID17898
Feature activation+3.45e-01

Pos logit contributions
c+0.121
ags+0.116
my+0.103
aces+0.101
 arms+0.096
Neg logit contributions
 possible-0.105
eware-0.097
lan-0.095
fire-0.093
 diver-0.090
Token,
Token ID11
Feature activation+6.06e-02

Pos logit contributions
 possible+0.708
eware+0.702
fire+0.689
lan+0.687
 reality+0.684
Neg logit contributions
c-0.497
ags-0.460
aces-0.441
my-0.429
rell-0.363
Token and
Token ID290
Feature activation-8.12e-02

Pos logit contributions
 possible+0.105
fire+0.100
eware+0.099
 reality+0.099
lan+0.096
Neg logit contributions
c-0.108
ags-0.108
aces-0.098
my-0.097
icer-0.090
Token landed
Token ID11406
Feature activation+9.60e-02

Pos logit contributions
c+0.140
my+0.129
aces+0.121
ags+0.118
 hands+0.109
Neg logit contributions
 possible-0.136
eware-0.132
 reality-0.127
lan-0.124
fire-0.122
Token in
Token ID287
Feature activation-3.73e-02

Pos logit contributions
 possible+0.202
eware+0.194
 reality+0.193
 unexpected+0.190
fire+0.188
Neg logit contributions
ags-0.142
c-0.136
aces-0.129
my-0.113
icer-0.108
Token the
Token ID262
Feature activation-6.22e-02

Pos logit contributions
ags+0.060
c+0.059
my+0.054
icer+0.049
aces+0.048
Neg logit contributions
eware-0.073
 possible-0.069
fire-0.067
lan-0.065
fall-0.064
Token.
Token ID13
Feature activation-6.96e-03

Pos logit contributions
 possible+0.067
eware+0.065
 reality+0.063
lan+0.062
fire+0.062
Neg logit contributions
c-0.056
ags-0.051
my-0.047
aces-0.047
 hair-0.043
Token The
Token ID383
Feature activation-7.47e-02

Pos logit contributions
ags+0.010
c+0.009
aces+0.009
my+0.008
icer+0.008
Neg logit contributions
 possible-0.015
fire-0.015
 reality-0.014
 traveler-0.014
 unexpected-0.014
Token light
Token ID1657
Feature activation+3.36e-02

Pos logit contributions
ags+0.117
c+0.114
aces+0.104
icer+0.097
my+0.095
Neg logit contributions
 possible-0.150
fire-0.145
 portal-0.144
 reality-0.143
 unexpected-0.139
Token was
Token ID373
Feature activation-3.12e-02

Pos logit contributions
 possible+0.064
 reality+0.060
fire+0.059
eware+0.059
lan+0.058
Neg logit contributions
c-0.055
ags-0.053
aces-0.049
my-0.048
icer-0.044
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
ags+0.059
c+0.056
aces+0.052
my+0.052
icer+0.048
Neg logit contributions
fire-0.050
 possible-0.050
eware-0.049
 reality-0.048
lan-0.048
Token portal
Token ID17898
Feature activation+3.45e-01

Pos logit contributions
eware+0.200
 possible+0.197
lan+0.184
uring+0.182
fire+0.180
Neg logit contributions
c-0.168
my-0.153
ags-0.141
 hair-0.140
icer-0.136
Token.
Token ID13
Feature activation-4.96e-03

Pos logit contributions
 reality+0.657
lan+0.638
eware+0.623
 possible+0.620
fire+0.607
Neg logit contributions
c-0.550
aces-0.482
my-0.472
ags-0.460
's-0.406
Token It
Token ID632
Feature activation-5.08e-02

Pos logit contributions
ags+0.008
c+0.008
aces+0.007
my+0.007
icer+0.007
Neg logit contributions
 possible-0.010
fire-0.009
 reality-0.009
 portal-0.009
eware-0.009
Token took
Token ID1718
Feature activation-4.86e-02

Pos logit contributions
c+0.078
ags+0.071
aces+0.066
my+0.066
icer+0.059
Neg logit contributions
 possible-0.104
fire-0.099
 reality-0.098
lan-0.098
eware-0.096
Token her
Token ID607
Feature activation-8.10e-02

Pos logit contributions
ags+0.082
c+0.080
aces+0.074
my+0.069
icer+0.065
Neg logit contributions
 possible-0.092
fire-0.089
 reality-0.086
lan-0.085
eware-0.085
Token to
Token ID284
Feature activation+5.07e-02

Pos logit contributions
ags+0.159
c+0.149
icer+0.142
aces+0.141
my+0.134
Neg logit contributions
 possible-0.135
 portal-0.124
fire-0.123
 reality-0.122
eware-0.120
Token.
Token ID13
Feature activation-5.51e-03

Pos logit contributions
my+0.123
ags+0.122
c+0.113
aces+0.112
 hair+0.098
Neg logit contributions
fire-0.195
 reality-0.187
 possible-0.184
 portal-0.181
 traveler-0.180
Token She
Token ID1375
Feature activation+4.10e-02

Pos logit contributions
ags+0.008
c+0.008
aces+0.007
my+0.007
icer+0.007
Neg logit contributions
 possible-0.011
fire-0.011
 reality-0.011
 unexpected-0.010
lan-0.010
Token skipped
Token ID26684
Feature activation+8.45e-02

Pos logit contributions
 possible+0.082
fire+0.075
 reality+0.073
lan+0.072
eware+0.071
Neg logit contributions
c-0.067
ags-0.065
aces-0.062
my-0.059
icer-0.054
Token into
Token ID656
Feature activation+4.08e-02

Pos logit contributions
 possible+0.164
fire+0.160
 reality+0.160
lan+0.154
eware+0.154
Neg logit contributions
ags-0.141
c-0.131
aces-0.123
my-0.117
rell-0.105
Token the
Token ID262
Feature activation-6.74e-02

Pos logit contributions
 possible+0.079
fire+0.079
eware+0.077
lan+0.074
fall+0.072
Neg logit contributions
ags-0.066
c-0.064
aces-0.057
my-0.055
icer-0.053
Token portal
Token ID17898
Feature activation+3.44e-01

Pos logit contributions
c+0.111
ags+0.111
my+0.094
icer+0.089
aces+0.088
Neg logit contributions
 possible-0.136
fire-0.128
eware-0.127
lan-0.123
fall-0.119
Token and
Token ID290
Feature activation-9.30e-02

Pos logit contributions
 possible+0.656
fire+0.641
eware+0.635
 reality+0.635
 unexpected+0.620
Neg logit contributions
c-0.560
ags-0.535
aces-0.511
my-0.481
aters-0.417
Token began
Token ID2540
Feature activation+6.11e-02

Pos logit contributions
c+0.155
ags+0.140
my+0.136
aces+0.128
 hair+0.116
Neg logit contributions
 possible-0.178
fire-0.169
 reality-0.166
lan-0.162
eware-0.162
Token her
Token ID607
Feature activation-8.16e-02

Pos logit contributions
fire+0.107
 possible+0.100
eware+0.096
 reality+0.095
 unexpected+0.094
Neg logit contributions
c-0.113
ags-0.104
 hair-0.096
aces-0.096
 arms-0.094
Token story
Token ID1621
Feature activation-7.43e-03

Pos logit contributions
ags+0.150
c+0.149
aces+0.137
my+0.128
icer+0.124
Neg logit contributions
 possible-0.135
fire-0.132
 reality-0.127
eware-0.126
lan-0.123
Token.
Token ID13
Feature activation-3.51e-03

Pos logit contributions
c+0.012
aces+0.012
ags+0.012
my+0.010
 hair+0.009
Neg logit contributions
lan-0.013
eware-0.013
 reality-0.013
 possible-0.013
 traveler-0.013
Token top
Token ID1353
Feature activation+4.60e-02

Pos logit contributions
ags+0.128
c+0.122
aces+0.114
icer+0.108
my+0.108
Neg logit contributions
 possible-0.134
fire-0.126
eware-0.126
 reality-0.125
lan-0.121
Token had
Token ID550
Feature activation+2.47e-03

Pos logit contributions
 possible+0.087
 reality+0.082
lan+0.080
eware+0.080
fire+0.079
Neg logit contributions
c-0.078
ags-0.069
aces-0.068
my-0.067
icer-0.061
Token become
Token ID1716
Feature activation+2.70e-02

Pos logit contributions
fire+0.004
 possible+0.004
 traveler+0.004
lan+0.004
eware+0.004
Neg logit contributions
c-0.004
ags-0.004
aces-0.004
my-0.004
rell-0.004
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
fire+0.044
 possible+0.042
eware+0.042
 portal+0.041
lan+0.041
Neg logit contributions
c-0.052
ags-0.051
aces-0.048
my-0.045
icer-0.043
Token magic
Token ID5536
Feature activation+8.50e-02

Pos logit contributions
eware+0.204
 possible+0.184
lan+0.183
fire+0.183
uring+0.176
Neg logit contributions
c-0.164
 hair-0.144
 doll-0.142
my-0.141
icer-0.140
Token portal
Token ID17898
Feature activation+3.43e-01

Pos logit contributions
 possible+0.143
eware+0.143
lan+0.138
fire+0.132
 unexpected+0.130
Neg logit contributions
c-0.141
aces-0.130
ags-0.126
 hair-0.121
my-0.119
Token that
Token ID326
Feature activation+1.90e-01

Pos logit contributions
lan+0.591
fire+0.590
eware+0.583
 reality+0.580
 possible+0.575
Neg logit contributions
c-0.607
aces-0.549
ags-0.531
my-0.530
rell-0.463
Token took
Token ID1718
Feature activation-4.70e-02

Pos logit contributions
 possible+0.301
fire+0.300
 reality+0.288
eware+0.287
lan+0.287
Neg logit contributions
c-0.357
ags-0.351
aces-0.330
my-0.323
icer-0.299
Token her
Token ID607
Feature activation-7.77e-02

Pos logit contributions
ags+0.072
c+0.069
aces+0.066
my+0.060
 hair+0.055
Neg logit contributions
 possible-0.092
lan-0.092
fire-0.091
eware-0.090
 reality-0.089
Token straight
Token ID3892
Feature activation+8.20e-02

Pos logit contributions
ags+0.149
c+0.144
aces+0.134
icer+0.128
my+0.127
Neg logit contributions
 possible-0.130
fire-0.123
 reality-0.121
 portal-0.118
eware-0.117
Token to
Token ID284
Feature activation+5.41e-02

Pos logit contributions
 possible+0.152
fire+0.146
lan+0.140
 reality+0.140
eware+0.138
Neg logit contributions
c-0.142
ags-0.142
aces-0.133
my-0.118
 hair-0.113
Token temper
Token ID4124
Feature activation-2.27e-02

Pos logit contributions
ags+0.073
c+0.071
aces+0.065
icer+0.062
my+0.062
Neg logit contributions
 possible-0.082
fire-0.078
 reality-0.077
 portal-0.075
eware-0.075
Token and
Token ID290
Feature activation-8.34e-02

Pos logit contributions
ags+0.040
c+0.039
aces+0.036
my+0.034
icer+0.034
Neg logit contributions
 possible-0.041
fire-0.039
 reality-0.039
 portal-0.038
eware-0.037
Token often
Token ID1690
Feature activation-6.91e-02

Pos logit contributions
ags+0.150
c+0.148
aces+0.138
my+0.131
icer+0.129
Neg logit contributions
 possible-0.145
fire-0.138
 reality-0.135
 portal-0.133
eware-0.132
Token threw
Token ID9617
Feature activation-6.23e-02

Pos logit contributions
c+0.106
aces+0.096
my+0.095
ags+0.094
 hair+0.082
Neg logit contributions
 possible-0.140
fire-0.131
 portal-0.129
eware-0.129
 reality-0.128
Token temper
Token ID4124
Feature activation-2.27e-02

Pos logit contributions
ags+0.121
c+0.115
aces+0.108
icer+0.106
aters+0.100
Neg logit contributions
 possible-0.101
 unexpected-0.096
 reality-0.093
fire-0.091
 miracle-0.087
Token tant
Token ID24246
Feature activation-3.06e-01

Pos logit contributions
ags+0.038
c+0.036
icer+0.033
aces+0.032
my+0.032
Neg logit contributions
 possible-0.042
fire-0.041
eware-0.040
 reality-0.040
 unexpected-0.039
Tokenrums
Token ID45241
Feature activation+2.84e-02

Pos logit contributions
ags+0.489
c+0.479
icer+0.437
my+0.411
aces+0.399
Neg logit contributions
 possible-0.602
eware-0.550
fire-0.550
 unexpected-0.539
lan-0.537
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
 possible+0.037
fire+0.034
 reality+0.033
eware+0.031
lan+0.031
Neg logit contributions
ags-0.066
c-0.063
icer-0.059
aces-0.058
my-0.055
Token One
Token ID1881
Feature activation-1.87e-02

Pos logit contributions
 possible+0.039
 reality+0.036
fire+0.036
 unexpected+0.035
lan+0.034
Neg logit contributions
ags-0.044
c-0.042
aces-0.039
icer-0.036
ishes-0.035
Token day
Token ID1110
Feature activation+2.47e-02

Pos logit contributions
ags+0.038
c+0.036
aces+0.034
icer+0.032
my+0.031
Neg logit contributions
 possible-0.030
fire-0.028
 reality-0.028
 portal-0.027
eware-0.027
Token,
Token ID11
Feature activation+6.54e-02

Pos logit contributions
 possible+0.043
 portal+0.042
 reality+0.041
fire+0.041
lan+0.040
Neg logit contributions
c-0.039
my-0.039
ags-0.038
aces-0.037
 hands-0.035
Token surprise
Token ID5975
Feature activation+1.60e-01

Pos logit contributions
ags+0.121
c+0.117
my+0.109
icer+0.108
aces+0.106
Neg logit contributions
 possible-0.121
fire-0.116
eware-0.114
 portal-0.113
 reality-0.113
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
 possible+0.269
eware+0.264
 portal+0.260
fire+0.259
 reality+0.258
Neg logit contributions
c-0.284
ags-0.283
aces-0.266
my-0.263
icer-0.247
Token He
Token ID679
Feature activation-6.76e-03

Pos logit contributions
ags+0.023
c+0.022
aces+0.020
icer+0.019
my+0.019
Neg logit contributions
 possible-0.026
fire-0.025
 reality-0.024
 portal-0.024
eware-0.024
Token threw
Token ID9617
Feature activation-6.51e-02

Pos logit contributions
ags+0.012
c+0.012
aces+0.011
icer+0.010
my+0.010
Neg logit contributions
 possible-0.012
fire-0.012
 reality-0.011
 portal-0.011
 unexpected-0.011
Token a
Token ID257
Feature activation+1.02e-01

Pos logit contributions
ags+0.150
c+0.145
aces+0.139
aters+0.134
icer+0.130
Neg logit contributions
 possible-0.077
 unexpected-0.072
fire-0.068
 reality-0.068
 clues-0.067
Token tant
Token ID24246
Feature activation-3.05e-01

Pos logit contributions
fire+0.184
 possible+0.181
 portal+0.179
 miracle+0.175
 unexpected+0.175
Neg logit contributions
ags-0.184
aces-0.165
c-0.164
aters-0.150
icer-0.146
Tokenrum
Token ID6582
Feature activation-1.46e-01

Pos logit contributions
ags+0.421
c+0.406
icer+0.360
aces+0.347
my+0.337
Neg logit contributions
 possible-0.658
fire-0.629
eware-0.624
lan-0.604
 unexpected-0.601
Token.
Token ID13
Feature activation-8.30e-03

Pos logit contributions
ags+0.251
aces+0.240
c+0.239
my+0.223
icer+0.208
Neg logit contributions
 possible-0.271
 reality-0.259
fire-0.257
 portal-0.254
 unexpected-0.254
Token He
Token ID679
Feature activation+5.74e-04

Pos logit contributions
ags+0.014
c+0.013
aces+0.012
icer+0.011
my+0.011
Neg logit contributions
 possible-0.016
fire-0.015
 reality-0.015
 portal-0.015
eware-0.015
Token kicked
Token ID12165
Feature activation-7.92e-02

Pos logit contributions
 possible+0.001
fire+0.001
 reality+0.001
 unexpected+0.001
lan+0.001
Neg logit contributions
ags-0.001
c-0.001
aces-0.001
icer-0.001
my-0.001
Token and
Token ID290
Feature activation-9.59e-02

Pos logit contributions
ags+0.182
c+0.179
aces+0.167
aters+0.163
icer+0.157
Neg logit contributions
 possible-0.092
 unexpected-0.088
fire-0.082
 reality-0.082
 clues-0.082
Token stubborn
Token ID24444
Feature activation+2.65e-03

Pos logit contributions
c+0.100
ags+0.097
icer+0.094
aces+0.091
aters+0.088
Neg logit contributions
 possible-0.079
 unexpected-0.069
fire-0.068
 reality-0.068
lan-0.066
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
 possible+0.005
fire+0.005
 reality+0.005
 portal+0.005
eware+0.004
Neg logit contributions
ags-0.005
c-0.004
aces-0.004
my-0.004
icer-0.004
Token She
Token ID1375
Feature activation+2.82e-02

Pos logit contributions
ags+0.038
c+0.038
aces+0.034
icer+0.033
my+0.032
Neg logit contributions
 possible-0.039
fire-0.037
 reality-0.037
eware-0.037
 portal-0.036
Token throws
Token ID12542
Feature activation-6.08e-03

Pos logit contributions
 possible+0.054
 unexpected+0.053
 reality+0.052
 portal+0.052
fire+0.051
Neg logit contributions
ags-0.043
c-0.042
aces-0.037
icer-0.037
aters-0.035
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
ags+0.014
c+0.014
aces+0.013
aters+0.013
icer+0.013
Neg logit contributions
 possible-0.007
 unexpected-0.006
 that-0.006
 clues-0.006
 miracle-0.006
Token tant
Token ID24246
Feature activation-3.03e-01

Pos logit contributions
 possible+0.208
 miracle+0.200
fire+0.200
 unexpected+0.198
 portal+0.197
Neg logit contributions
ags-0.155
aces-0.137
c-0.131
aters-0.121
icer-0.118
Tokenrum
Token ID6582
Feature activation-1.47e-01

Pos logit contributions
ags+0.433
c+0.419
icer+0.381
aces+0.352
my+0.351
Neg logit contributions
 possible-0.640
eware-0.606
fire-0.599
 unexpected-0.578
lan-0.576
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
ags+0.276
c+0.265
aces+0.256
my+0.242
icer+0.230
Neg logit contributions
 possible-0.250
fire-0.240
 reality-0.237
 portal-0.231
 unexpected-0.229
Token She
Token ID1375
Feature activation+3.24e-02

Pos logit contributions
c+0.035
ags+0.035
aces+0.031
icer+0.031
my+0.030
Neg logit contributions
 possible-0.034
fire-0.032
 reality-0.032
eware-0.032
 portal-0.031
Token kicks
Token ID17364
Feature activation-1.55e-02

Pos logit contributions
 possible+0.057
 unexpected+0.055
 reality+0.054
fire+0.053
lan+0.053
Neg logit contributions
ags-0.056
c-0.052
aces-0.049
aters-0.046
icer-0.045
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
c+0.036
ags+0.036
aces+0.033
icer+0.033
aters+0.033
Neg logit contributions
 possible-0.016
 unexpected-0.016
 that-0.014
 clues-0.014
 soon-0.013
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 possible+0.184
 portal+0.181
 reality+0.181
fire+0.181
 traveler+0.175
Neg logit contributions
ags-0.162
c-0.156
my-0.152
aces-0.146
 hair-0.128
Token He
Token ID679
Feature activation-2.32e-03

Pos logit contributions
ags+0.018
c+0.017
aces+0.016
icer+0.015
my+0.015
Neg logit contributions
 possible-0.022
fire-0.021
 reality-0.021
 portal-0.021
eware-0.021
Token kept
Token ID4030
Feature activation+2.17e-02

Pos logit contributions
ags+0.004
c+0.004
aces+0.004
icer+0.003
my+0.003
Neg logit contributions
 possible-0.004
fire-0.004
 reality-0.004
 portal-0.004
eware-0.004
Token throwing
Token ID9644
Feature activation-5.78e-02

Pos logit contributions
 possible+0.034
fire+0.031
 reality+0.031
 unexpected+0.030
lan+0.029
Neg logit contributions
ags-0.043
c-0.043
aces-0.041
icer-0.039
my-0.038
Token a
Token ID257
Feature activation+1.06e-01

Pos logit contributions
ags+0.114
c+0.112
aces+0.103
icer+0.102
aters+0.100
Neg logit contributions
 unexpected-0.081
 possible-0.080
fire-0.079
 clues-0.078
 reality-0.076
Token tant
Token ID24246
Feature activation-3.02e-01

Pos logit contributions
fire+0.196
 possible+0.195
 portal+0.190
 reality+0.188
 unexpected+0.188
Neg logit contributions
ags-0.182
c-0.168
aces-0.165
icer-0.147
my-0.143
Tokenrum
Token ID6582
Feature activation-1.46e-01

Pos logit contributions
ags+0.430
c+0.411
icer+0.374
aces+0.361
aters+0.343
Neg logit contributions
 possible-0.637
fire-0.611
eware-0.607
lan-0.582
 unexpected-0.581
Token.
Token ID13
Feature activation-6.87e-03

Pos logit contributions
ags+0.250
aces+0.237
c+0.233
my+0.226
icer+0.205
Neg logit contributions
 possible-0.265
fire-0.258
 reality-0.257
 portal-0.255
eware-0.255
Token His
Token ID2399
Feature activation-1.74e-01

Pos logit contributions
ags+0.011
c+0.010
aces+0.009
icer+0.009
my+0.009
Neg logit contributions
 possible-0.014
fire-0.014
 reality-0.013
 portal-0.013
eware-0.013
Token mom
Token ID1995
Feature activation-5.63e-02

Pos logit contributions
ags+0.349
c+0.345
aces+0.319
icer+0.303
my+0.300
Neg logit contributions
 possible-0.270
fire-0.256
 reality-0.254
 portal-0.246
lan-0.245
Token was
Token ID373
Feature activation-3.34e-02

Pos logit contributions
ags+0.111
c+0.101
aces+0.097
icer+0.093
aters+0.090
Neg logit contributions
fire-0.091
 possible-0.089
 unexpected-0.086
 reality-0.086
 portal-0.085
Token his
Token ID465
Feature activation-1.26e-01

Pos logit contributions
 possible+0.048
fire+0.046
 reality+0.046
 unexpected+0.045
eware+0.045
Neg logit contributions
ags-0.038
c-0.038
aces-0.035
icer-0.033
my-0.032
Token feet
Token ID3625
Feature activation-6.57e-02

Pos logit contributions
c+0.163
ags+0.162
icer+0.139
aces+0.138
my+0.133
Neg logit contributions
 possible-0.281
fire-0.271
 reality-0.264
 traveler-0.262
 portal-0.262
Token and
Token ID290
Feature activation-9.18e-02

Pos logit contributions
ags+0.115
c+0.108
aces+0.099
icer+0.098
my+0.090
Neg logit contributions
 possible-0.123
fire-0.116
 reality-0.112
 portal-0.110
eware-0.108
Token threw
Token ID9617
Feature activation-6.20e-02

Pos logit contributions
ags+0.161
c+0.158
aces+0.145
my+0.137
icer+0.136
Neg logit contributions
 possible-0.169
fire-0.158
 reality-0.158
 portal-0.154
eware-0.153
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
c+0.121
ags+0.120
aces+0.113
icer+0.110
aters+0.106
Neg logit contributions
 possible-0.095
 unexpected-0.087
fire-0.087
 reality-0.084
 miracle-0.082
Token tant
Token ID24246
Feature activation-3.02e-01

Pos logit contributions
 possible+0.179
fire+0.178
 portal+0.173
 miracle+0.172
 traveler+0.171
Neg logit contributions
ags-0.202
aces-0.181
c-0.176
icer-0.172
ates-0.165
Tokenrum
Token ID6582
Feature activation-1.43e-01

Pos logit contributions
c+0.418
ags+0.413
icer+0.363
my+0.349
aces+0.341
Neg logit contributions
 possible-0.652
fire-0.609
eware-0.605
lan-0.585
 unexpected-0.584
Token.
Token ID13
Feature activation-6.27e-03

Pos logit contributions
ags+0.263
c+0.253
aces+0.247
my+0.232
icer+0.220
Neg logit contributions
 possible-0.246
 reality-0.235
fire-0.234
 portal-0.233
 unexpected-0.230
Token
Token ID198
Feature activation-1.91e-02

Pos logit contributions
ags+0.011
c+0.010
aces+0.009
icer+0.009
my+0.009
Neg logit contributions
 possible-0.012
fire-0.011
 reality-0.011
 portal-0.011
eware-0.011
Token
Token ID198
Feature activation-2.06e-02

Pos logit contributions
ags+0.033
c+0.032
aces+0.029
icer+0.028
my+0.027
Neg logit contributions
 possible-0.036
fire-0.034
 reality-0.034
 portal-0.033
eware-0.033
Token"
Token ID1
Feature activation-2.72e-02

Pos logit contributions
ags+0.038
c+0.037
my+0.033
aces+0.033
icer+0.033
Neg logit contributions
 possible-0.036
 reality-0.035
 portal-0.033
fire-0.033
eware-0.032
Token kept
Token ID4030
Feature activation+2.04e-02

Pos logit contributions
 possible+0.078
fire+0.075
 reality+0.075
 unexpected+0.074
eware+0.073
Neg logit contributions
ags-0.106
c-0.098
aces-0.093
icer-0.090
aters-0.085
Token crying
Token ID13774
Feature activation-3.29e-02

Pos logit contributions
 possible+0.030
 reality+0.027
 unexpected+0.027
fire+0.026
 miracle+0.026
Neg logit contributions
ags-0.042
c-0.042
aces-0.041
icer-0.039
aters-0.037
Token and
Token ID290
Feature activation-8.21e-02

Pos logit contributions
ags+0.052
c+0.050
aces+0.046
my+0.044
icer+0.043
Neg logit contributions
 possible-0.066
fire-0.063
 reality-0.062
eware-0.061
 portal-0.061
Token throwing
Token ID9644
Feature activation-5.93e-02

Pos logit contributions
ags+0.141
c+0.141
my+0.131
aces+0.127
icer+0.118
Neg logit contributions
 possible-0.150
fire-0.144
eware-0.140
 reality-0.139
 portal-0.138
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
ags+0.123
c+0.119
aces+0.116
icer+0.116
aters+0.110
Neg logit contributions
 unexpected-0.076
 possible-0.072
fire-0.068
 clues-0.066
 that-0.066
Token tant
Token ID24246
Feature activation-3.00e-01

Pos logit contributions
fire+0.211
 unexpected+0.207
 miracle+0.202
 possible+0.196
 portal+0.195
Neg logit contributions
ags-0.191
aces-0.180
c-0.156
ates-0.153
icer-0.147
Tokenrum
Token ID6582
Feature activation-1.41e-01

Pos logit contributions
ags+0.429
c+0.406
icer+0.383
aces+0.353
my+0.344
Neg logit contributions
 possible-0.623
eware-0.595
fire-0.587
 unexpected-0.572
 portal-0.564
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
ags+0.246
c+0.241
aces+0.238
my+0.225
aters+0.200
Neg logit contributions
 possible-0.252
fire-0.246
 reality-0.238
 traveler-0.236
 portal-0.235
Token
Token ID198
Feature activation+7.92e-05

Pos logit contributions
 possible+0.019
eware+0.019
 portal+0.018
 reality+0.018
fire+0.018
Neg logit contributions
ags-0.025
c-0.024
aces-0.023
icer-0.022
wy-0.021
Token
Token ID198
Feature activation-7.68e-04

Pos logit contributions
 possible+0.000
fire+0.000
 reality+0.000
 unexpected+0.000
eware+0.000
Neg logit contributions
ags-0.000
c-0.000
aces-0.000
icer-0.000
aters-0.000
TokenFinally
Token ID11158
Feature activation+1.16e-01

Pos logit contributions
ags+0.002
c+0.002
aces+0.002
icer+0.002
my+0.002
Neg logit contributions
 possible-0.001
fire-0.001
 reality-0.001
 portal-0.001
eware-0.001
Token listen
Token ID6004
Feature activation+9.99e-02

Pos logit contributions
 possible+0.101
fire+0.099
 reality+0.097
 portal+0.096
lan+0.096
Neg logit contributions
ags-0.092
c-0.088
my-0.087
aces-0.080
rell-0.072
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 possible+0.167
fire+0.166
 reality+0.164
 portal+0.161
 traveler+0.159
Neg logit contributions
ags-0.180
c-0.178
my-0.171
aces-0.168
 hair-0.148
Token She
Token ID1375
Feature activation+3.61e-02

Pos logit contributions
ags+0.020
c+0.019
aces+0.017
my+0.016
icer+0.016
Neg logit contributions
 possible-0.027
fire-0.026
 reality-0.025
 portal-0.024
 unexpected-0.024
Token threw
Token ID9617
Feature activation-6.21e-02

Pos logit contributions
 possible+0.066
fire+0.063
 reality+0.062
 portal+0.060
eware+0.060
Neg logit contributions
ags-0.064
c-0.061
aces-0.057
icer-0.054
my-0.053
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
ags+0.134
c+0.129
icer+0.125
aces+0.124
aters+0.119
Neg logit contributions
 possible-0.081
 unexpected-0.076
 reality-0.073
 clues-0.070
 miracle-0.070
Token tant
Token ID24246
Feature activation-3.00e-01

Pos logit contributions
 possible+0.242
fire+0.242
 portal+0.237
 reality+0.234
 unexpected+0.233
Neg logit contributions
ags-0.131
c-0.117
aces-0.115
icer-0.098
my-0.097
Tokenrum
Token ID6582
Feature activation-1.44e-01

Pos logit contributions
ags+0.425
c+0.407
icer+0.378
aces+0.357
my+0.344
Neg logit contributions
 possible-0.619
eware-0.601
fire-0.592
 unexpected-0.576
lan-0.573
Token in
Token ID287
Feature activation-4.31e-02

Pos logit contributions
ags+0.215
aces+0.203
c+0.203
my+0.190
icer+0.165
Neg logit contributions
 possible-0.303
fire-0.293
 reality-0.290
 unexpected-0.284
eware-0.283
Token the
Token ID262
Feature activation-6.83e-02

Pos logit contributions
ags+0.084
c+0.081
aces+0.077
icer+0.074
my+0.071
Neg logit contributions
 possible-0.069
 reality-0.064
 unexpected-0.064
fire-0.062
 portal-0.062
Token store
Token ID3650
Feature activation-2.38e-02

Pos logit contributions
ags+0.120
c+0.117
aces+0.108
icer+0.102
my+0.101
Neg logit contributions
 possible-0.124
fire-0.119
 reality-0.117
 portal-0.114
eware-0.113
Token.
Token ID13
Feature activation-6.93e-03

Pos logit contributions
c+0.037
ags+0.036
aces+0.035
my+0.032
icer+0.029
Neg logit contributions
 possible-0.048
 reality-0.045
fire-0.045
eware-0.044
 unexpected-0.044
Token sad
Token ID6507
Feature activation-1.02e-01

Pos logit contributions
ags+0.142
c+0.138
aces+0.127
my+0.121
icer+0.117
Neg logit contributions
 possible-0.162
fire-0.155
 reality-0.153
 portal-0.149
eware-0.148
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
ags+0.163
aces+0.154
c+0.151
my+0.148
 hair+0.134
Neg logit contributions
 possible-0.199
fire-0.197
 portal-0.190
 traveler-0.190
 reality-0.188
Token She
Token ID1375
Feature activation+4.55e-02

Pos logit contributions
 possible+0.015
eware+0.015
 unexpected+0.014
 portal+0.014
fall+0.014
Neg logit contributions
ags-0.019
c-0.019
aces-0.018
icer-0.018
igh-0.017
Token threw
Token ID9617
Feature activation-6.10e-02

Pos logit contributions
 possible+0.074
fire+0.070
 reality+0.070
 unexpected+0.068
 portal+0.068
Neg logit contributions
ags-0.088
c-0.085
aces-0.080
icer-0.077
my-0.074
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
ags+0.147
aces+0.143
c+0.141
icer+0.139
aters+0.139
Neg logit contributions
 unexpected-0.059
 possible-0.056
 clues-0.054
 that-0.052
 miracle-0.050
Token tant
Token ID24246
Feature activation-2.99e-01

Pos logit contributions
 unexpected+0.219
 portal+0.218
 miracle+0.215
fire+0.213
 possible+0.213
Neg logit contributions
ags-0.182
aces-0.163
c-0.144
aters-0.142
ates-0.142
Tokenrum
Token ID6582
Feature activation-1.40e-01

Pos logit contributions
ags+0.428
c+0.408
icer+0.386
aces+0.353
my+0.346
Neg logit contributions
 possible-0.620
eware-0.596
fire-0.585
 unexpected-0.571
lan-0.564
Token and
Token ID290
Feature activation-8.54e-02

Pos logit contributions
ags+0.202
c+0.194
aces+0.185
my+0.175
icer+0.157
Neg logit contributions
 possible-0.300
fire-0.291
 reality-0.285
 unexpected-0.279
 portal-0.278
Token cried
Token ID16896
Feature activation-5.16e-02

Pos logit contributions
c+0.148
ags+0.147
aces+0.137
my+0.133
icer+0.125
Neg logit contributions
 possible-0.155
fire-0.148
 reality-0.144
eware-0.143
 portal-0.142
Token loudly
Token ID23112
Feature activation+9.37e-02

Pos logit contributions
ags+0.103
c+0.100
aces+0.094
my+0.089
icer+0.088
Neg logit contributions
 possible-0.082
fire-0.078
 reality-0.076
 portal-0.075
eware-0.074
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 possible+0.170
 reality+0.163
fire+0.157
 portal+0.155
 unexpected+0.154
Neg logit contributions
ags-0.161
aces-0.159
c-0.157
my-0.142
 hair-0.134
Token her
Token ID607
Feature activation-8.06e-02

Pos logit contributions
 possible+0.200
fire+0.191
 reality+0.190
 unexpected+0.189
eware+0.184
Neg logit contributions
ags-0.209
c-0.200
aces-0.189
icer-0.184
my-0.172
Token feet
Token ID3625
Feature activation-6.05e-02

Pos logit contributions
ags+0.117
c+0.115
aces+0.098
icer+0.096
my+0.094
Neg logit contributions
 possible-0.168
fire-0.162
 reality-0.162
 portal-0.159
 traveler-0.157
Token and
Token ID290
Feature activation-8.64e-02

Pos logit contributions
ags+0.107
c+0.100
icer+0.092
aces+0.092
aters+0.084
Neg logit contributions
 possible-0.111
fire-0.105
 reality-0.101
 portal-0.100
 unexpected-0.100
Token threw
Token ID9617
Feature activation-5.92e-02

Pos logit contributions
ags+0.163
c+0.156
aces+0.145
icer+0.140
my+0.136
Neg logit contributions
 possible-0.146
fire-0.140
 reality-0.137
 portal-0.134
 unexpected-0.133
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
ags+0.121
c+0.115
aces+0.110
icer+0.110
aters+0.104
Neg logit contributions
 possible-0.087
 unexpected-0.081
 reality-0.077
 miracle-0.075
fire-0.075
Token tant
Token ID24246
Feature activation-2.99e-01

Pos logit contributions
 possible+0.259
fire+0.257
 portal+0.255
 unexpected+0.254
 reality+0.251
Neg logit contributions
ags-0.133
aces-0.109
c-0.104
aters-0.094
icer-0.092
Tokenrum
Token ID6582
Feature activation-1.41e-01

Pos logit contributions
ags+0.412
c+0.391
icer+0.354
aces+0.341
my+0.328
Neg logit contributions
 possible-0.645
fire-0.620
eware-0.612
 unexpected-0.594
lan-0.592
Token.
Token ID13
Feature activation+5.89e-03

Pos logit contributions
ags+0.243
c+0.238
aces+0.237
my+0.220
aters+0.199
Neg logit contributions
 possible-0.255
fire-0.248
 reality-0.242
 portal-0.241
 unexpected-0.238
Token She
Token ID1375
Feature activation+4.83e-02

Pos logit contributions
 possible+0.011
eware+0.011
 reality+0.011
fire+0.011
lan+0.011
Neg logit contributions
ags-0.009
c-0.009
aces-0.008
icer-0.008
my-0.008
Token grabbed
Token ID13176
Feature activation-5.35e-03

Pos logit contributions
 possible+0.071
fire+0.071
 unexpected+0.070
 reality+0.069
 portal+0.067
Neg logit contributions
ags-0.100
c-0.092
aces-0.087
icer-0.084
aters-0.082
Token the
Token ID262
Feature activation-6.45e-02

Pos logit contributions
ags+0.011
c+0.011
aces+0.010
icer+0.010
aters+0.010
Neg logit contributions
 possible-0.008
 unexpected-0.007
 reality-0.007
fire-0.007
lan-0.006
Token and
Token ID290
Feature activation-8.53e-02

Pos logit contributions
ags+0.043
c+0.040
aces+0.039
my+0.038
icer+0.030
Neg logit contributions
 possible-0.092
fire-0.090
 reality-0.088
 portal-0.087
eware-0.086
Token started
Token ID2067
Feature activation+1.98e-02

Pos logit contributions
ags+0.156
c+0.151
aces+0.141
icer+0.135
my+0.131
Neg logit contributions
 possible-0.149
fire-0.142
 reality-0.140
 portal-0.136
eware-0.136
Token to
Token ID284
Feature activation+5.44e-02

Pos logit contributions
 possible+0.021
fire+0.019
 reality+0.018
eware+0.018
lan+0.018
Neg logit contributions
ags-0.050
c-0.049
aces-0.047
icer-0.045
my-0.044
Token throw
Token ID3714
Feature activation-6.02e-02

Pos logit contributions
 possible+0.103
fire+0.099
 reality+0.098
 unexpected+0.096
eware+0.095
Neg logit contributions
ags-0.091
c-0.087
aces-0.082
icer-0.077
my-0.072
Token a
Token ID257
Feature activation+1.12e-01

Pos logit contributions
ags+0.136
aces+0.131
c+0.123
aters+0.120
opl+0.120
Neg logit contributions
 unexpected-0.065
 possible-0.061
 clues-0.059
fire-0.058
 that-0.057
Token tant
Token ID24246
Feature activation-2.99e-01

Pos logit contributions
fire+0.195
 possible+0.192
 unexpected+0.186
 miracle+0.186
 reality+0.186
Neg logit contributions
ags-0.216
aces-0.197
c-0.176
aters-0.172
rell-0.165
Tokenrum
Token ID6582
Feature activation-1.40e-01

Pos logit contributions
ags+0.421
c+0.399
icer+0.364
aces+0.353
my+0.334
Neg logit contributions
 possible-0.638
fire-0.608
eware-0.604
 unexpected-0.584
lan-0.582
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
ags+0.243
c+0.232
aces+0.232
my+0.217
aters+0.193
Neg logit contributions
 possible-0.256
fire-0.250
 reality-0.245
 traveler-0.240
 portal-0.240
Token Suddenly
Token ID24975
Feature activation+7.49e-02

Pos logit contributions
eware+0.022
 possible+0.021
 portal+0.021
fall+0.020
 reality+0.020
Neg logit contributions
ags-0.021
c-0.021
icer-0.020
aces-0.019
wy-0.018
Token,
Token ID11
Feature activation+6.20e-02

Pos logit contributions
fire+0.148
 possible+0.146
 reality+0.136
lan+0.136
 traveler+0.132
Neg logit contributions
c-0.121
my-0.111
ags-0.111
aces-0.108
 hair-0.105
Token she
Token ID673
Feature activation+1.15e-01

Pos logit contributions
fire+0.113
 possible+0.111
eware+0.105
 portal+0.104
lan+0.104
Neg logit contributions
c-0.103
ags-0.102
my-0.096
aces-0.091
 hair-0.091
Token to
Token ID284
Feature activation+5.61e-02

Pos logit contributions
 possible+0.024
fire+0.022
 reality+0.022
 portal+0.021
eware+0.021
Neg logit contributions
ags-0.053
c-0.051
aces-0.049
icer-0.047
my-0.046
Token cry
Token ID3960
Feature activation-8.23e-02

Pos logit contributions
 possible+0.101
 reality+0.096
fire+0.096
 portal+0.093
 unexpected+0.092
Neg logit contributions
ags-0.101
c-0.096
aces-0.091
icer-0.086
my-0.082
Token and
Token ID290
Feature activation-8.19e-02

Pos logit contributions
ags+0.121
c+0.116
my+0.107
aces+0.106
icer+0.092
Neg logit contributions
 possible-0.176
fire-0.168
 portal-0.162
eware-0.162
lan-0.161
Token threw
Token ID9617
Feature activation-5.68e-02

Pos logit contributions
c+0.111
my+0.105
ags+0.102
 hair+0.088
 hands+0.087
Neg logit contributions
 possible-0.185
fire-0.180
eware-0.173
lan-0.172
 reality-0.167
Token a
Token ID257
Feature activation+1.13e-01

Pos logit contributions
ags+0.117
c+0.112
icer+0.111
aces+0.110
aters+0.105
Neg logit contributions
 possible-0.080
 unexpected-0.078
 clues-0.071
 miracle-0.071
 reality-0.070
Token tant
Token ID24246
Feature activation-2.97e-01

Pos logit contributions
 possible+0.224
 portal+0.223
 unexpected+0.220
 reality+0.218
 miracle+0.217
Neg logit contributions
ags-0.188
aces-0.169
c-0.159
icer-0.151
aters-0.147
Tokenrum
Token ID6582
Feature activation-1.38e-01

Pos logit contributions
ags+0.414
c+0.394
icer+0.361
aces+0.345
my+0.329
Neg logit contributions
 possible-0.634
eware-0.602
fire-0.601
 unexpected-0.583
 portal-0.581
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
ags+0.234
c+0.229
aces+0.226
my+0.213
 hair+0.188
Neg logit contributions
 possible-0.256
fire-0.250
 reality-0.241
 unexpected-0.238
 portal-0.236
Token She
Token ID1375
Feature activation+5.59e-02

Pos logit contributions
 possible+0.030
 reality+0.029
fire+0.029
eware+0.029
 portal+0.029
Neg logit contributions
ags-0.021
c-0.021
aces-0.018
icer-0.018
my-0.017
Token didn
Token ID1422
Feature activation-5.14e-02

Pos logit contributions
 unexpected+0.077
 reality+0.077
 possible+0.076
fire+0.075
 portal+0.073
Neg logit contributions
ags-0.123
c-0.110
aces-0.107
icer-0.104
aters-0.100
Token't
Token ID470
Feature activation+4.61e-02

Pos logit contributions
ags+0.114
c+0.112
aces+0.104
icer+0.100
my+0.099
Neg logit contributions
 possible-0.071
fire-0.065
 reality-0.063
 portal-0.062
eware-0.061
Token and
Token ID290
Feature activation-7.51e-02

Pos logit contributions
 possible+0.026
fire+0.026
 reality+0.025
 portal+0.025
lan+0.024
Neg logit contributions
ags-0.017
c-0.016
aces-0.016
my-0.015
icer-0.013
Token so
Token ID523
Feature activation+2.56e-02

Pos logit contributions
ags+0.144
c+0.136
aces+0.127
icer+0.126
aters+0.118
Neg logit contributions
 possible-0.127
fire-0.119
 reality-0.117
 unexpected-0.117
eware-0.116
Token she
Token ID673
Feature activation+1.23e-01

Pos logit contributions
 possible+0.031
fire+0.029
 reality+0.028
 portal+0.027
eware+0.027
Neg logit contributions
ags-0.060
c-0.060
aces-0.056
my-0.055
icer-0.054
Token threw
Token ID9617
Feature activation-5.23e-02

Pos logit contributions
 possible+0.190
fire+0.187
 reality+0.180
 unexpected+0.178
 traveler+0.176
Neg logit contributions
ags-0.249
c-0.236
aces-0.222
icer-0.213
my-0.208
Token a
Token ID257
Feature activation+1.20e-01

Pos logit contributions
icer+0.110
c+0.108
ags+0.108
aces+0.100
aters+0.099
Neg logit contributions
 unexpected-0.069
 possible-0.069
 clues-0.062
 miracle-0.061
 that-0.057
Token tant
Token ID24246
Feature activation-2.96e-01

Pos logit contributions
 possible+0.200
 unexpected+0.194
fire+0.193
 reality+0.192
 portal+0.190
Neg logit contributions
ags-0.231
aces-0.216
c-0.215
aters-0.192
icer-0.191
Tokenrum
Token ID6582
Feature activation-1.35e-01

Pos logit contributions
ags+0.398
c+0.379
icer+0.340
aces+0.329
my+0.317
Neg logit contributions
 possible-0.660
fire-0.628
eware-0.620
lan-0.606
 reality-0.606
Token when
Token ID618
Feature activation+5.38e-02

Pos logit contributions
ags+0.246
c+0.237
aces+0.233
my+0.215
icer+0.203
Neg logit contributions
 possible-0.235
fire-0.228
 reality-0.223
 unexpected-0.220
 portal-0.219
Token her
Token ID607
Feature activation-6.88e-02

Pos logit contributions
fire+0.085
 possible+0.078
eware+0.078
 traveler+0.077
 portal+0.076
Neg logit contributions
c-0.104
ags-0.102
my-0.096
aces-0.094
 hair-0.092
Token parents
Token ID3397
Feature activation+5.18e-02

Pos logit contributions
ags+0.121
c+0.115
aces+0.109
icer+0.101
my+0.101
Neg logit contributions
 possible-0.126
fire-0.120
 reality-0.117
 portal-0.116
eware-0.114
Token said
Token ID531
Feature activation-7.26e-02

Pos logit contributions
 possible+0.088
fire+0.086
 reality+0.083
eware+0.081
 unexpected+0.081
Neg logit contributions
ags-0.097
c-0.092
aces-0.086
icer-0.082
my-0.079
Token She
Token ID1375
Feature activation+5.43e-02

Pos logit contributions
 possible+0.018
eware+0.018
 portal+0.017
 reality+0.017
lan+0.017
Neg logit contributions
ags-0.020
c-0.019
aces-0.018
icer-0.018
ates-0.016
Token cried
Token ID16896
Feature activation-5.26e-02

Pos logit contributions
 possible+0.088
fire+0.084
 reality+0.083
 unexpected+0.081
 portal+0.081
Neg logit contributions
ags-0.107
c-0.101
aces-0.095
icer-0.091
my-0.087
Token and
Token ID290
Feature activation-8.76e-02

Pos logit contributions
ags+0.088
c+0.087
aces+0.079
icer+0.078
my+0.072
Neg logit contributions
 possible-0.097
fire-0.094
 reality-0.093
 portal-0.090
 unexpected-0.090
Token threw
Token ID9617
Feature activation-5.88e-02

Pos logit contributions
ags+0.141
c+0.136
aces+0.126
my+0.118
icer+0.113
Neg logit contributions
 possible-0.174
fire-0.166
 reality-0.163
 portal-0.160
eware-0.160
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
ags+0.113
icer+0.109
c+0.108
aces+0.104
aters+0.099
Neg logit contributions
 unexpected-0.089
 possible-0.085
 miracle-0.081
 clues-0.080
 reality-0.080
Token tant
Token ID24246
Feature activation-2.96e-01

Pos logit contributions
fire+0.235
 portal+0.235
 possible+0.232
 unexpected+0.230
 reality+0.229
Neg logit contributions
ags-0.160
aces-0.138
c-0.133
icer-0.122
aters-0.120
Tokenrum
Token ID6582
Feature activation-1.40e-01

Pos logit contributions
ags+0.426
c+0.403
icer+0.374
aces+0.356
my+0.338
Neg logit contributions
 possible-0.612
eware-0.592
fire-0.585
 unexpected-0.568
lan-0.563
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
ags+0.249
c+0.242
aces+0.240
my+0.225
icer+0.202
Neg logit contributions
 possible-0.251
fire-0.240
 reality-0.234
 portal-0.231
 traveler-0.230
Token Her
Token ID2332
Feature activation-1.18e-01

Pos logit contributions
eware+0.020
 possible+0.019
 portal+0.019
lan+0.019
 reality+0.019
Neg logit contributions
ags-0.019
c-0.018
aces-0.017
icer-0.017
wy-0.015
Token mom
Token ID1995
Feature activation-4.79e-02

Pos logit contributions
ags+0.229
c+0.224
aces+0.207
icer+0.198
my+0.195
Neg logit contributions
 possible-0.193
fire-0.182
 reality-0.181
 portal-0.176
eware-0.174
Token tried
Token ID3088
Feature activation-5.31e-02

Pos logit contributions
ags+0.097
c+0.087
aces+0.085
icer+0.082
aters+0.077
Neg logit contributions
 possible-0.074
fire-0.074
 reality-0.072
 unexpected-0.072
 portal-0.070
Token them
Token ID606
Feature activation-9.59e-02

Pos logit contributions
eware+0.015
 possible+0.015
 portal+0.014
fire+0.014
 traveler+0.014
Neg logit contributions
ags-0.014
c-0.014
aces-0.013
my-0.012
icer-0.012
Token and
Token ID290
Feature activation-9.08e-02

Pos logit contributions
ags+0.188
c+0.185
aces+0.174
my+0.166
icer+0.160
Neg logit contributions
 possible-0.149
fire-0.142
 reality-0.141
 portal-0.140
eware-0.138
Token never
Token ID1239
Feature activation+1.34e-01

Pos logit contributions
ags+0.186
c+0.186
aces+0.171
my+0.169
icer+0.156
Neg logit contributions
 possible-0.131
fire-0.125
eware-0.123
 portal-0.121
 reality-0.121
Token threw
Token ID9617
Feature activation-6.13e-02

Pos logit contributions
fire+0.243
 possible+0.237
 portal+0.232
 traveler+0.230
 reality+0.230
Neg logit contributions
c-0.227
my-0.219
ags-0.206
aces-0.201
 hair-0.181
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
ags+0.116
c+0.112
aces+0.105
icer+0.100
my+0.099
Neg logit contributions
 possible-0.103
fire-0.097
 reality-0.096
 unexpected-0.093
 portal-0.092
Token tant
Token ID24246
Feature activation-2.96e-01

Pos logit contributions
 possible+0.209
fire+0.202
 reality+0.199
 portal+0.194
 unexpected+0.194
Neg logit contributions
ags-0.185
c-0.167
aces-0.165
icer-0.151
my-0.145
Tokenrum
Token ID6582
Feature activation-1.38e-01

Pos logit contributions
ags+0.419
c+0.410
icer+0.374
my+0.352
aces+0.346
Neg logit contributions
 possible-0.631
fire-0.580
eware-0.576
lan-0.566
 reality-0.565
Token again
Token ID757
Feature activation+5.15e-02

Pos logit contributions
ags+0.256
c+0.252
aces+0.242
my+0.233
icer+0.213
Neg logit contributions
 possible-0.229
fire-0.224
 reality-0.222
 portal-0.220
 traveler-0.217
Token.
Token ID13
Feature activation-2.68e-03

Pos logit contributions
 portal+0.093
eware+0.092
 traveler+0.092
fire+0.092
 possible+0.092
Neg logit contributions
c-0.088
ags-0.080
my-0.078
aces-0.076
aters-0.068
Token<|endoftext|>
Token ID50256
Feature activation-4.78e-02

Pos logit contributions
ags+0.005
c+0.005
aces+0.005
icer+0.004
my+0.004
Neg logit contributions
 possible-0.004
 reality-0.004
fire-0.004
lan-0.004
 portal-0.004
TokenOnce
Token ID7454
Feature activation+7.81e-04

Pos logit contributions
ags+0.103
c+0.097
aces+0.096
ishes+0.090
ish+0.088
Neg logit contributions
 possible-0.063
fire-0.062
 portal-0.058
 reality-0.055
 unexpected-0.055
Token Mom
Token ID11254
Feature activation-5.82e-02

Pos logit contributions
ags+0.076
c+0.076
aces+0.069
my+0.068
icer+0.064
Neg logit contributions
 possible-0.064
fire-0.061
 reality-0.060
 portal-0.059
eware-0.059
Token said
Token ID531
Feature activation-9.04e-02

Pos logit contributions
ags+0.121
c+0.108
icer+0.107
aces+0.105
aters+0.098
Neg logit contributions
 possible-0.087
fire-0.086
lan-0.080
eware-0.079
 unexpected-0.079
Token,
Token ID11
Feature activation+5.11e-02

Pos logit contributions
ags+0.175
c+0.170
aces+0.161
my+0.152
icer+0.144
Neg logit contributions
 possible-0.148
 reality-0.141
 portal-0.138
fire-0.137
eware-0.134
Token holding
Token ID4769
Feature activation-1.32e-01

Pos logit contributions
 possible+0.079
fire+0.076
 reality+0.074
 portal+0.071
 traveler+0.070
Neg logit contributions
ags-0.103
c-0.100
aces-0.094
icer-0.091
my-0.088
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
ags+0.259
c+0.256
aces+0.244
icer+0.231
rell+0.231
Neg logit contributions
 possible-0.195
fire-0.186
 unexpected-0.182
 reality-0.180
lan-0.176
Token spoon
Token ID24556
Feature activation-2.87e-01

Pos logit contributions
 possible+0.174
 reality+0.167
fire+0.166
 portal+0.166
 unexpected+0.165
Neg logit contributions
ags-0.190
aces-0.179
c-0.174
icer-0.165
rell-0.157
Tokenful
Token ID913
Feature activation+1.94e-01

Pos logit contributions
ags+0.632
c+0.605
aces+0.568
icer+0.559
my+0.538
Neg logit contributions
 possible-0.404
fire-0.379
 reality-0.361
lan-0.350
 portal-0.347
Token of
Token ID286
Feature activation+4.01e-02

Pos logit contributions
 possible+0.270
fire+0.260
 reality+0.242
 unexpected+0.240
lan+0.240
Neg logit contributions
ags-0.417
c-0.405
icer-0.378
aces-0.377
my-0.358
Token soup
Token ID17141
Feature activation-6.40e-02

Pos logit contributions
 possible+0.068
 unexpected+0.062
 reality+0.061
 miracle+0.058
fire+0.058
Neg logit contributions
c-0.076
aces-0.072
ags-0.072
icer-0.067
aters-0.064
Token to
Token ID284
Feature activation+4.68e-02

Pos logit contributions
ags+0.127
c+0.123
aces+0.115
icer+0.110
my+0.109
Neg logit contributions
 possible-0.102
fire-0.096
 reality-0.094
 portal-0.092
eware-0.091
Token his
Token ID465
Feature activation-1.30e-01

Pos logit contributions
 possible+0.078
fire+0.073
 reality+0.073
 portal+0.072
eware+0.072
Neg logit contributions
ags-0.090
c-0.087
aces-0.080
my-0.077
icer-0.077
TokenThat
Token ID2504
Feature activation+2.24e-02

Pos logit contributions
ags+0.080
c+0.077
aces+0.072
my+0.069
icer+0.068
Neg logit contributions
 possible-0.067
fire-0.063
 reality-0.063
 portal-0.061
eware-0.061
Token is
Token ID318
Feature activation-5.28e-02

Pos logit contributions
 possible+0.050
fire+0.047
 reality+0.047
 portal+0.046
lan+0.046
Neg logit contributions
ags-0.033
c-0.028
aces-0.027
icer-0.025
aters-0.024
Token a
Token ID257
Feature activation+9.63e-02

Pos logit contributions
ags+0.100
c+0.093
aces+0.089
my+0.086
icer+0.082
Neg logit contributions
fire-0.085
 reality-0.084
 possible-0.083
eware-0.083
 portal-0.082
Token very
Token ID845
Feature activation-2.57e-02

Pos logit contributions
 possible+0.162
eware+0.161
fire+0.154
 portal+0.146
 reality+0.145
Neg logit contributions
ags-0.170
c-0.165
icer-0.152
my-0.149
aces-0.143
Token special
Token ID2041
Feature activation+1.83e-02

Pos logit contributions
ags+0.039
c+0.035
my+0.031
aces+0.031
 hair+0.031
Neg logit contributions
 reality-0.048
fire-0.047
 possible-0.047
eware-0.046
 portal-0.045
Token spoon
Token ID24556
Feature activation-2.86e-01

Pos logit contributions
 possible+0.041
fire+0.039
 reality+0.039
eware+0.039
 portal+0.039
Neg logit contributions
ags-0.023
c-0.023
aces-0.020
icer-0.020
my-0.019
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
c+0.529
ags+0.515
aces+0.500
my+0.475
icer+0.439
Neg logit contributions
 reality-0.468
 possible-0.461
 portal-0.456
eware-0.437
 unexpected-0.432
Token Maybe
Token ID6674
Feature activation+1.19e-01

Pos logit contributions
ags+0.020
c+0.019
aces+0.018
icer+0.016
my+0.016
Neg logit contributions
 possible-0.029
fire-0.028
 reality-0.028
 unexpected-0.027
 portal-0.027
Token it
Token ID340
Feature activation-6.43e-02

Pos logit contributions
fire+0.210
 portal+0.206
eware+0.202
 traveler+0.196
 reality+0.190
Neg logit contributions
c-0.199
my-0.184
ags-0.181
 hands-0.178
 hair-0.174
Token will
Token ID481
Feature activation-6.05e-02

Pos logit contributions
ags+0.115
c+0.095
aces+0.094
icer+0.092
aters+0.091
Neg logit contributions
 possible-0.122
fire-0.117
 unexpected-0.113
 reality-0.111
 portal-0.110
Token make
Token ID787
Feature activation-6.31e-02

Pos logit contributions
ags+0.107
c+0.103
aces+0.094
my+0.094
icer+0.085
Neg logit contributions
 possible-0.101
eware-0.100
lan-0.098
fire-0.098
 portal-0.097
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
 possible+0.028
eware+0.027
 reality+0.027
fire+0.027
lan+0.026
Neg logit contributions
ags-0.026
c-0.025
aces-0.025
my-0.022
icer-0.020
Token They
Token ID1119
Feature activation+1.81e-02

Pos logit contributions
ags+0.025
c+0.025
aces+0.022
icer+0.021
my+0.021
Neg logit contributions
 possible-0.031
fire-0.030
 reality-0.029
 portal-0.029
eware-0.028
Token found
Token ID1043
Feature activation+5.39e-02

Pos logit contributions
 possible+0.032
fire+0.031
 reality+0.030
 portal+0.030
 unexpected+0.029
Neg logit contributions
ags-0.033
c-0.031
aces-0.029
icer-0.028
my-0.027
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 possible+0.085
fire+0.081
 reality+0.080
 portal+0.078
eware+0.077
Neg logit contributions
ags-0.108
c-0.105
aces-0.098
my-0.093
icer-0.092
Token big
Token ID1263
Feature activation+6.11e-02

Pos logit contributions
 possible+0.186
fire+0.177
 reality+0.173
eware+0.173
lan+0.172
Neg logit contributions
c-0.166
ags-0.165
my-0.146
aces-0.145
icer-0.142
Token spoon
Token ID24556
Feature activation-2.86e-01

Pos logit contributions
 possible+0.127
fire+0.120
 reality+0.119
eware+0.117
lan+0.117
Neg logit contributions
ags-0.091
c-0.089
aces-0.079
my-0.077
icer-0.076
Token and
Token ID290
Feature activation-1.03e-01

Pos logit contributions
c+0.483
ags+0.477
aces+0.446
my+0.423
icer+0.404
Neg logit contributions
 possible-0.528
 reality-0.510
 portal-0.494
fire-0.492
eware-0.491
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
c+0.167
ags+0.159
aces+0.156
my+0.146
 hair+0.139
Neg logit contributions
 possible-0.199
 reality-0.185
fire-0.184
 portal-0.183
eware-0.181
Token bowl
Token ID9396
Feature activation-2.44e-01

Pos logit contributions
 possible+0.202
fire+0.193
 reality+0.192
 portal+0.187
eware+0.185
Neg logit contributions
ags-0.161
c-0.154
aces-0.142
icer-0.132
my-0.131
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
ags+0.408
c+0.408
aces+0.377
my+0.364
icer+0.338
Neg logit contributions
 possible-0.449
 reality-0.441
eware-0.428
 portal-0.423
 unexpected-0.419
Token They
Token ID1119
Feature activation+1.87e-02

Pos logit contributions
ags+0.024
c+0.023
aces+0.021
icer+0.020
my+0.019
Neg logit contributions
 possible-0.036
 reality-0.034
fire-0.034
 portal-0.033
 unexpected-0.033
Token a
Token ID257
Feature activation+1.05e-01

Pos logit contributions
ags+0.201
icer+0.189
aces+0.183
c+0.182
rell+0.174
Neg logit contributions
 possible-0.094
 reality-0.088
 unexpected-0.084
lan-0.078
 that-0.078
Token band
Token ID4097
Feature activation-1.16e-01

Pos logit contributions
 possible+0.174
 reality+0.164
 portal+0.163
 miracle+0.161
fire+0.160
Neg logit contributions
ags-0.204
aces-0.192
c-0.185
icer-0.182
aters-0.167
Tokenage
Token ID496
Feature activation+4.51e-02

Pos logit contributions
ags+0.219
c+0.200
icer+0.199
aces+0.192
my+0.180
Neg logit contributions
fire-0.194
 possible-0.194
eware-0.181
lan-0.180
 traveler-0.178
Token and
Token ID290
Feature activation-9.37e-02

Pos logit contributions
 possible+0.085
fire+0.082
 reality+0.079
 portal+0.078
lan+0.078
Neg logit contributions
ags-0.075
c-0.072
icer-0.067
aces-0.065
my-0.061
Token a
Token ID257
Feature activation+1.03e-01

Pos logit contributions
c+0.157
ags+0.153
aces+0.143
my+0.137
icer+0.124
Neg logit contributions
 possible-0.175
fire-0.167
 portal-0.163
eware-0.162
 reality-0.162
Token spoon
Token ID24556
Feature activation-2.85e-01

Pos logit contributions
 possible+0.196
 reality+0.189
fire+0.187
 portal+0.184
 miracle+0.182
Neg logit contributions
ags-0.173
c-0.159
aces-0.158
icer-0.150
my-0.136
Token.
Token ID13
Feature activation-7.14e-04

Pos logit contributions
ags+0.579
c+0.564
aces+0.528
icer+0.502
my+0.502
Neg logit contributions
 possible-0.439
fire-0.413
 reality-0.408
 portal-0.397
eware-0.393
Token They
Token ID1119
Feature activation+2.84e-02

Pos logit contributions
ags+0.001
c+0.001
aces+0.001
icer+0.001
my+0.001
Neg logit contributions
 possible-0.001
fire-0.001
 reality-0.001
 portal-0.001
lan-0.001
Token said
Token ID531
Feature activation-9.32e-02

Pos logit contributions
fire+0.042
 reality+0.040
 unexpected+0.040
 possible+0.040
lan+0.039
Neg logit contributions
ags-0.062
c-0.054
icer-0.052
aces-0.052
aters-0.048
Token she
Token ID673
Feature activation+1.12e-01

Pos logit contributions
ags+0.190
c+0.185
aces+0.180
my+0.165
 hair+0.160
Neg logit contributions
 possible-0.138
fire-0.133
 portal-0.128
 reality-0.128
eware-0.125
Token was
Token ID373
Feature activation-3.58e-02

Pos logit contributions
 possible+0.194
fire+0.189
 reality+0.186
 unexpected+0.183
eware+0.181
Neg logit contributions
ags-0.211
c-0.187
aces-0.176
icer-0.175
rell-0.163
Token out
Token ID503
Feature activation+2.70e-02

Pos logit contributions
ags+0.111
c+0.108
aces+0.101
icer+0.096
my+0.096
Neg logit contributions
 possible-0.087
fire-0.082
 reality-0.080
 portal-0.078
eware-0.078
Token a
Token ID257
Feature activation+1.05e-01

Pos logit contributions
 possible+0.034
 reality+0.033
lan+0.032
 unexpected+0.031
 miracle+0.031
Neg logit contributions
ags-0.059
c-0.059
aces-0.057
icer-0.053
aters-0.052
Token bottle
Token ID9294
Feature activation-1.30e-01

Pos logit contributions
fire+0.190
 possible+0.190
 portal+0.180
 unexpected+0.179
 reality+0.179
Neg logit contributions
ags-0.189
aces-0.177
c-0.170
icer-0.158
aters-0.154
Token and
Token ID290
Feature activation-9.87e-02

Pos logit contributions
ags+0.253
c+0.245
aces+0.229
icer+0.219
my+0.216
Neg logit contributions
 possible-0.211
fire-0.201
 reality-0.196
 portal-0.191
eware-0.190
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
c+0.155
ags+0.150
aces+0.137
my+0.132
 hands+0.120
Neg logit contributions
 possible-0.198
 reality-0.184
 portal-0.183
fire-0.182
eware-0.181
Token spoon
Token ID24556
Feature activation-2.85e-01

Pos logit contributions
 possible+0.206
fire+0.201
 reality+0.197
 portal+0.193
 unexpected+0.190
Neg logit contributions
ags-0.167
c-0.156
aces-0.150
icer-0.139
my-0.135
Token from
Token ID422
Feature activation+9.76e-02

Pos logit contributions
ags+0.600
c+0.579
aces+0.543
icer+0.525
my+0.515
Neg logit contributions
 possible-0.422
fire-0.400
 reality-0.387
 portal-0.375
eware-0.375
Token his
Token ID465
Feature activation-1.24e-01

Pos logit contributions
 possible+0.192
fire+0.186
eware+0.186
 portal+0.183
 reality+0.182
Neg logit contributions
ags-0.161
c-0.148
aces-0.134
my-0.133
icer-0.130
Token bag
Token ID6131
Feature activation-1.17e-01

Pos logit contributions
ags+0.188
c+0.182
aces+0.165
icer+0.155
my+0.154
Neg logit contributions
 possible-0.258
fire-0.246
 reality-0.244
 portal-0.239
eware-0.238
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
ags+0.248
c+0.240
aces+0.223
icer+0.219
my+0.212
Neg logit contributions
 possible-0.168
fire-0.161
 reality-0.153
 portal-0.150
eware-0.149
Token He
Token ID679
Feature activation-5.06e-03

Pos logit contributions
ags+0.020
c+0.019
aces+0.018
my+0.017
icer+0.017
Neg logit contributions
 possible-0.029
fire-0.027
 reality-0.027
 unexpected-0.027
 portal-0.026
Token big
Token ID1263
Feature activation+6.75e-02

Pos logit contributions
 possible+0.184
eware+0.177
fire+0.176
lan+0.171
 portal+0.168
Neg logit contributions
c-0.181
ags-0.173
my-0.163
 hair-0.155
icer-0.153
Token impact
Token ID2928
Feature activation+7.72e-02

Pos logit contributions
 possible+0.127
fire+0.120
eware+0.118
lan+0.116
 reality+0.114
Neg logit contributions
ags-0.112
c-0.111
my-0.100
icer-0.098
aces-0.098
Token.
Token ID13
Feature activation+3.08e-03

Pos logit contributions
 possible+0.123
fire+0.116
 reality+0.114
 portal+0.111
eware+0.110
Neg logit contributions
ags-0.153
c-0.149
aces-0.139
icer-0.133
my-0.132
Token The
Token ID383
Feature activation-8.31e-02

Pos logit contributions
 possible+0.005
fire+0.005
 reality+0.005
 portal+0.005
eware+0.005
Neg logit contributions
ags-0.006
c-0.005
aces-0.005
icer-0.005
my-0.005
Token small
Token ID1402
Feature activation-2.05e-02

Pos logit contributions
ags+0.151
c+0.146
aces+0.136
icer+0.132
my+0.131
Neg logit contributions
 possible-0.145
fire-0.138
 reality-0.134
 portal-0.134
eware-0.133
Token spoon
Token ID24556
Feature activation-2.85e-01

Pos logit contributions
ags+0.029
c+0.029
aces+0.026
icer+0.025
my+0.025
Neg logit contributions
 possible-0.044
fire-0.042
 reality-0.041
 portal-0.041
eware-0.040
Token may
Token ID743
Feature activation-6.81e-02

Pos logit contributions
ags+0.479
c+0.447
aces+0.420
icer+0.404
my+0.385
Neg logit contributions
 possible-0.544
fire-0.522
 reality-0.508
 portal-0.499
 unexpected-0.496
Token have
Token ID423
Feature activation-1.84e-02

Pos logit contributions
c+0.117
ags+0.114
my+0.108
aces+0.103
rell+0.098
Neg logit contributions
fire-0.113
lan-0.113
eware-0.113
 portal-0.111
 possible-0.110
Token seemed
Token ID3947
Feature activation-3.54e-02

Pos logit contributions
c+0.033
ags+0.033
aces+0.031
my+0.030
rell+0.028
Neg logit contributions
fire-0.030
 portal-0.029
 possible-0.029
 traveler-0.028
eware-0.028
Token insignificant
Token ID32081
Feature activation+1.07e-01

Pos logit contributions
c+0.060
ags+0.059
aces+0.057
my+0.055
 hair+0.049
Neg logit contributions
fire-0.059
 portal-0.057
 traveler-0.055
lan-0.054
eware-0.054
Token,
Token ID11
Feature activation+6.10e-02

Pos logit contributions
fire+0.201
eware+0.197
 portal+0.195
lan+0.192
 traveler+0.192
Neg logit contributions
c-0.164
aces-0.156
ags-0.154
my-0.150
 hands-0.131
Token villagers
Token ID25263
Feature activation+1.70e-01

Pos logit contributions
ags+0.119
c+0.112
aces+0.105
my+0.104
icer+0.102
Neg logit contributions
 possible-0.085
eware-0.079
 reality-0.077
lan-0.076
fire-0.076
Token ran
Token ID4966
Feature activation+7.77e-02

Pos logit contributions
 possible+0.308
fire+0.297
 reality+0.290
 portal+0.284
 unexpected+0.283
Neg logit contributions
ags-0.299
c-0.286
aces-0.266
icer-0.256
my-0.248
Token away
Token ID1497
Feature activation+2.19e-02

Pos logit contributions
 reality+0.189
 possible+0.183
lan+0.178
 miracle+0.170
 portal+0.170
Neg logit contributions
ags-0.096
my-0.079
c-0.078
aces-0.064
rell-0.064
Token to
Token ID284
Feature activation+5.80e-02

Pos logit contributions
 reality+0.046
eware+0.045
lan+0.045
 possible+0.045
 traveler+0.044
Neg logit contributions
ags-0.025
c-0.024
aces-0.021
 arms-0.021
 hands-0.021
Token find
Token ID1064
Feature activation+4.00e-02

Pos logit contributions
eware+0.110
 possible+0.106
lan+0.105
fire+0.100
 portal+0.099
Neg logit contributions
ags-0.100
c-0.087
my-0.079
 hands-0.078
 hair-0.077
Token safety
Token ID3747
Feature activation+1.92e-01

Pos logit contributions
 possible+0.068
fire+0.066
eware+0.066
 portal+0.065
 reality+0.065
Neg logit contributions
ags-0.075
c-0.071
my-0.062
aces-0.062
 hands-0.059
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 possible+0.316
 reality+0.307
fire+0.303
 portal+0.298
eware+0.296
Neg logit contributions
ags-0.360
c-0.357
aces-0.333
my-0.316
icer-0.296
Token
Token ID198
Feature activation+7.10e-03

Pos logit contributions
 possible+0.038
 reality+0.036
fire+0.035
 portal+0.034
 unexpected+0.034
Neg logit contributions
ags-0.028
c-0.026
aces-0.024
my-0.023
icer-0.023
Token
Token ID198
Feature activation+5.71e-03

Pos logit contributions
 possible+0.014
 reality+0.014
fire+0.013
 portal+0.013
eware+0.013
Neg logit contributions
ags-0.011
c-0.011
aces-0.010
my-0.009
icer-0.009
TokenBut
Token ID1537
Feature activation+2.45e-02

Pos logit contributions
 possible+0.011
 reality+0.011
lan+0.010
eware+0.009
 portal+0.009
Neg logit contributions
c-0.009
ags-0.009
my-0.008
icer-0.008
aces-0.008
Token one
Token ID530
Feature activation-1.33e-02

Pos logit contributions
 possible+0.042
 reality+0.040
eware+0.040
 portal+0.038
lan+0.038
Neg logit contributions
c-0.043
ags-0.040
my-0.038
aces-0.035
 hands-0.035
Token Then
Token ID3244
Feature activation+8.20e-02

Pos logit contributions
 possible+0.042
eware+0.040
fall+0.040
fire+0.037
 portal+0.037
Neg logit contributions
ags-0.066
c-0.062
ishes-0.059
aces-0.059
wy-0.058
Token he
Token ID339
Feature activation+7.75e-02

Pos logit contributions
fire+0.148
 possible+0.142
 reality+0.141
 portal+0.139
lan+0.133
Neg logit contributions
c-0.143
my-0.139
ags-0.128
 hair-0.126
aces-0.125
Token noticed
Token ID6810
Feature activation+7.19e-02

Pos logit contributions
 possible+0.141
fire+0.132
 reality+0.127
 portal+0.125
eware+0.124
Neg logit contributions
c-0.143
my-0.130
ags-0.127
aces-0.123
icer-0.116
Token a
Token ID257
Feature activation+1.23e-01

Pos logit contributions
 possible+0.114
fire+0.113
 portal+0.110
eware+0.107
 traveler+0.104
Neg logit contributions
c-0.138
my-0.128
ags-0.128
aces-0.125
icer-0.114
Token spark
Token ID9009
Feature activation+5.03e-03

Pos logit contributions
 possible+0.212
eware+0.198
lan+0.193
fire+0.193
 reality+0.181
Neg logit contributions
c-0.214
ags-0.196
my-0.191
icer-0.190
 tant-0.176
Tokenly
Token ID306
Feature activation+2.19e-01

Pos logit contributions
 possible+0.009
fire+0.009
 reality+0.009
eware+0.009
lan+0.009
Neg logit contributions
c-0.008
ags-0.008
aces-0.008
my-0.007
icer-0.007
Token one
Token ID530
Feature activation-7.52e-03

Pos logit contributions
 possible+0.409
fire+0.379
eware+0.378
 reality+0.374
lan+0.370
Neg logit contributions
c-0.369
ags-0.353
aces-0.327
my-0.318
icer-0.312
Token floating
Token ID12462
Feature activation+3.57e-02

Pos logit contributions
c+0.016
ags+0.016
aces+0.015
my+0.015
icer+0.014
Neg logit contributions
 possible-0.010
fire-0.010
 reality-0.010
eware-0.010
 portal-0.009
Token in
Token ID287
Feature activation-3.02e-02

Pos logit contributions
 possible+0.068
 reality+0.064
fire+0.064
eware+0.063
 unexpected+0.062
Neg logit contributions
ags-0.059
c-0.059
aces-0.053
my-0.050
icer-0.050
Token the
Token ID262
Feature activation-4.71e-02

Pos logit contributions
ags+0.048
c+0.047
my+0.042
icer+0.041
aces+0.038
Neg logit contributions
eware-0.061
 possible-0.059
fire-0.058
lan-0.055
 portal-0.055
Token sea
Token ID5417
Feature activation+1.25e-01

Pos logit contributions
ags+0.089
c+0.086
aces+0.082
icer+0.077
my+0.076
Neg logit contributions
 possible-0.079
fire-0.076
 reality-0.074
 portal-0.073
lan-0.071
Token's
Token ID338
Feature activation-2.05e-01

Pos logit contributions
c+0.036
ags+0.035
aces+0.031
my+0.030
icer+0.029
Neg logit contributions
 possible-0.065
fire-0.062
 reality-0.061
 portal-0.060
lan-0.059
Token mom
Token ID1995
Feature activation-4.14e-02

Pos logit contributions
ags+0.342
c+0.325
my+0.300
aces+0.299
icer+0.288
Neg logit contributions
 possible-0.397
fire-0.376
 reality-0.360
 portal-0.357
eware-0.352
Tokenmy
Token ID1820
Feature activation-1.79e-01

Pos logit contributions
ags+0.050
aces+0.043
icer+0.033
c+0.031
 tant+0.029
Neg logit contributions
 planned-0.100
 possible-0.099
 soon-0.097
 that-0.094
 granted-0.093
Token told
Token ID1297
Feature activation-7.23e-02

Pos logit contributions
ags+0.303
c+0.300
aces+0.272
my+0.266
icer+0.259
Neg logit contributions
 possible-0.333
fire-0.315
 reality-0.312
 portal-0.305
eware-0.301
Token her
Token ID607
Feature activation-7.50e-02

Pos logit contributions
ags+0.099
c+0.093
my+0.082
aces+0.081
icer+0.075
Neg logit contributions
 possible-0.164
fire-0.160
 reality-0.155
 portal-0.153
lan-0.151
Token that
Token ID326
Feature activation+1.96e-01

Pos logit contributions
c+0.137
my+0.134
aces+0.130
 hands+0.124
 hair+0.124
Neg logit contributions
fire-0.117
 possible-0.105
 portal-0.103
 reality-0.100
 traveler-0.100
Token they
Token ID484
Feature activation+8.49e-02

Pos logit contributions
 possible+0.217
fire+0.200
 reality+0.195
 portal+0.187
eware+0.185
Neg logit contributions
ags-0.487
c-0.475
aces-0.450
icer-0.434
my-0.431
Token were
Token ID547
Feature activation-7.78e-02

Pos logit contributions
 possible+0.143
 unexpected+0.136
 reality+0.136
fire+0.135
eware+0.131
Neg logit contributions
ags-0.163
c-0.155
aces-0.144
icer-0.139
aters-0.130
Token going
Token ID1016
Feature activation+7.19e-02

Pos logit contributions
ags+0.158
c+0.152
aces+0.141
my+0.138
icer+0.136
Neg logit contributions
 possible-0.120
fire-0.116
 reality-0.112
 portal-0.112
eware-0.112
Token to
Token ID284
Feature activation+5.72e-02

Pos logit contributions
 reality+0.118
 possible+0.118
fire+0.116
 portal+0.114
 unexpected+0.112
Neg logit contributions
ags-0.141
c-0.136
my-0.125
aces-0.121
aters-0.112
Token have
Token ID423
Feature activation-1.05e-02

Pos logit contributions
 possible+0.098
eware+0.096
fire+0.095
 portal+0.094
lan+0.093
Neg logit contributions
c-0.101
ags-0.099
my-0.090
icer-0.082
aces-0.081
Token something
Token ID1223
Feature activation+1.55e-01

Pos logit contributions
 possible+0.206
fire+0.196
 reality+0.193
 portal+0.188
eware+0.187
Neg logit contributions
ags-0.219
c-0.212
aces-0.197
my-0.186
icer-0.185
Token important
Token ID1593
Feature activation+6.72e-02

Pos logit contributions
eware+0.279
fire+0.278
 portal+0.275
 reality+0.269
 possible+0.268
Neg logit contributions
ags-0.223
c-0.214
 hair-0.199
aces-0.193
 hands-0.193
Token.
Token ID13
Feature activation+2.46e-02

Pos logit contributions
 possible+0.107
 reality+0.105
fire+0.100
 portal+0.100
 unexpected+0.099
Neg logit contributions
ags-0.128
c-0.124
aces-0.124
my-0.111
aters-0.105
Token She
Token ID1375
Feature activation+5.67e-02

Pos logit contributions
 possible+0.041
 reality+0.039
 portal+0.039
eware+0.038
fire+0.038
Neg logit contributions
ags-0.046
c-0.043
aces-0.041
icer-0.041
ates-0.037
Token knew
Token ID2993
Feature activation+4.89e-02

Pos logit contributions
 possible+0.111
fire+0.104
 reality+0.103
eware+0.101
 portal+0.101
Neg logit contributions
ags-0.092
c-0.091
aces-0.083
my-0.079
icer-0.078
Token that
Token ID326
Feature activation+1.98e-01

Pos logit contributions
fire+0.062
eware+0.057
 possible+0.054
 portal+0.053
fall+0.053
Neg logit contributions
c-0.112
ags-0.107
aces-0.105
my-0.105
 hair-0.096
Token there
Token ID612
Feature activation+2.35e-02

Pos logit contributions
 possible+0.265
fire+0.249
 reality+0.243
 portal+0.236
eware+0.235
Neg logit contributions
ags-0.441
c-0.434
aces-0.407
my-0.391
icer-0.390
Token was
Token ID373
Feature activation-1.91e-02

Pos logit contributions
 possible+0.044
fire+0.041
 reality+0.041
 unexpected+0.040
 portal+0.040
Neg logit contributions
ags-0.043
c-0.039
aces-0.036
icer-0.035
my-0.033
Token a
Token ID257
Feature activation+1.17e-01

Pos logit contributions
c+0.034
ags+0.031
aces+0.030
my+0.030
 hair+0.027
Neg logit contributions
fire-0.033
 portal-0.031
fall-0.030
 possible-0.030
lan-0.030
Token war
Token ID1175
Feature activation+1.12e-01

Pos logit contributions
 possible+0.213
eware+0.213
lan+0.198
uring+0.195
fire+0.192
Neg logit contributions
c-0.172
my-0.154
icer-0.151
 mustache-0.149
 doll-0.148
Token going
Token ID1016
Feature activation+7.12e-02

Pos logit contributions
lan+0.213
 possible+0.212
eware+0.205
 reality+0.204
 traveler+0.194
Neg logit contributions
c-0.169
aces-0.157
ags-0.155
my-0.152
rell-0.127
Token would
Token ID561
Feature activation-7.28e-02

Pos logit contributions
 possible+0.136
fire+0.130
 reality+0.128
eware+0.126
 portal+0.125
Neg logit contributions
ags-0.138
c-0.136
aces-0.126
my-0.121
icer-0.116
Token never
Token ID1239
Feature activation+1.29e-01

Pos logit contributions
ags+0.138
c+0.135
aces+0.125
my+0.124
icer+0.115
Neg logit contributions
eware-0.114
 possible-0.114
fire-0.111
lan-0.110
 portal-0.105
Token forget
Token ID6044
Feature activation+9.13e-02

Pos logit contributions
fire+0.207
lan+0.200
 portal+0.198
 reality+0.190
 traveler+0.189
Neg logit contributions
c-0.247
my-0.241
aces-0.218
ags-0.216
rell-0.203
Token.
Token ID13
Feature activation-1.69e-03

Pos logit contributions
 possible+0.138
fire+0.132
 reality+0.130
eware+0.129
 portal+0.128
Neg logit contributions
ags-0.183
c-0.180
aces-0.168
my-0.160
icer-0.156
Token<|endoftext|>
Token ID50256
Feature activation-5.40e-02

Pos logit contributions
ags+0.003
c+0.003
aces+0.003
icer+0.003
my+0.003
Neg logit contributions
 possible-0.003
fire-0.003
 reality-0.003
lan-0.003
 portal-0.003
TokenA
Token ID32
Feature activation+2.44e-01

Pos logit contributions
ags+0.108
c+0.102
aces+0.097
icer+0.091
my+0.089
Neg logit contributions
 possible-0.084
fire-0.081
 portal-0.077
 reality-0.077
eware-0.075
Token human
Token ID1692
Feature activation+9.03e-02

Pos logit contributions
 reality+0.439
 possible+0.436
 portal+0.431
 clues+0.428
eware+0.423
Neg logit contributions
c-0.381
my-0.375
icer-0.297
 -0.293
ags-0.278
Token was
Token ID373
Feature activation-1.80e-02

Pos logit contributions
 possible+0.184
 reality+0.169
fire+0.168
eware+0.165
 portal+0.161
Neg logit contributions
c-0.149
my-0.134
ags-0.130
aces-0.124
icer-0.115
Token walking
Token ID6155
Feature activation+7.73e-02

Pos logit contributions
ags+0.035
c+0.033
aces+0.032
icer+0.030
aters+0.029
Neg logit contributions
 possible-0.029
fire-0.027
 reality-0.027
 unexpected-0.026
 portal-0.026
Token through
Token ID832
Feature activation+3.09e-02

Pos logit contributions
 possible+0.151
fire+0.145
 reality+0.142
 portal+0.139
eware+0.138
Neg logit contributions
ags-0.125
c-0.121
aces-0.112
icer-0.105
my-0.103
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
 possible+0.068
eware+0.067
lan+0.063
fire+0.063
 traveler+0.062
Neg logit contributions
c-0.043
ags-0.037
my-0.037
 hands-0.033
icer-0.033
Token
Token ID251
Feature activation+2.07e-02

Pos logit contributions
ags+0.025
aces+0.024
c+0.024
my+0.021
aters+0.021
Neg logit contributions
fire-0.019
 possible-0.019
 reality-0.018
 portal-0.018
 unexpected-0.018
Token but
Token ID475
Feature activation-1.03e-01

Pos logit contributions
 possible+0.042
fire+0.040
 reality+0.040
 portal+0.039
eware+0.039
Neg logit contributions
ags-0.032
c-0.032
aces-0.028
my-0.027
icer-0.026
Token this
Token ID428
Feature activation+1.11e-02

Pos logit contributions
c+0.193
ags+0.187
my+0.171
aces+0.171
icer+0.158
Neg logit contributions
 possible-0.167
fire-0.161
eware-0.159
 reality-0.158
lan-0.156
Token sunset
Token ID26428
Feature activation+7.18e-02

Pos logit contributions
 possible+0.020
fire+0.019
 reality+0.018
eware+0.018
lan+0.018
Neg logit contributions
c-0.020
ags-0.019
aces-0.017
my-0.017
icer-0.017
Token was
Token ID373
Feature activation-2.71e-02

Pos logit contributions
 possible+0.141
 reality+0.134
fire+0.131
eware+0.130
lan+0.130
Neg logit contributions
c-0.112
ags-0.105
aces-0.102
my-0.097
icer-0.089
Token not
Token ID407
Feature activation+6.20e-02

Pos logit contributions
ags+0.054
c+0.053
aces+0.049
my+0.048
icer+0.046
Neg logit contributions
 possible-0.042
fire-0.040
eware-0.039
 portal-0.039
 reality-0.039
Token beautiful
Token ID4950
Feature activation+6.10e-02

Pos logit contributions
eware+0.098
fire+0.097
lan+0.097
 portal+0.094
 possible+0.093
Neg logit contributions
c-0.113
ags-0.113
my-0.106
aces-0.105
icer-0.094
Token
Token ID6184
Feature activation+3.94e-02

Pos logit contributions
fire+0.109
eware+0.109
 portal+0.107
 traveler+0.105
lan+0.105
Neg logit contributions
c-0.098
aces-0.093
ags-0.091
my-0.090
 hair-0.079
Token
Token ID95
Feature activation+5.36e-02

Pos logit contributions
 possible+0.072
 reality+0.069
 unexpected+0.065
 clues+0.064
 traveler+0.064
Neg logit contributions
ags-0.067
c-0.066
aces-0.063
icer-0.062
rell-0.054
Token
Token ID26391
Feature activation-1.47e-02

Pos logit contributions
 possible+0.098
fire+0.094
 reality+0.092
 portal+0.090
eware+0.090
Neg logit contributions
ags-0.094
c-0.091
aces-0.084
icer-0.079
my-0.079
Token
Token ID447
Feature activation-1.11e-02

Pos logit contributions
ags+0.027
c+0.026
aces+0.024
icer+0.023
my+0.023
Neg logit contributions
 possible-0.026
fire-0.024
 reality-0.024
 portal-0.023
eware-0.023
Token to
Token ID284
Feature activation+5.25e-02

Pos logit contributions
ags+0.062
c+0.059
my+0.057
aces+0.054
 hair+0.050
Neg logit contributions
fire-0.048
 reality-0.047
 portal-0.046
 possible-0.046
 traveler-0.044
Token laugh
Token ID6487
Feature activation-1.70e-01

Pos logit contributions
fire+0.094
 portal+0.090
eware+0.090
 possible+0.089
 reality+0.089
Neg logit contributions
ags-0.091
c-0.087
my-0.081
aces-0.073
 hair-0.073
Token with
Token ID351
Feature activation-4.43e-02

Pos logit contributions
ags+0.316
c+0.307
aces+0.285
icer+0.272
my+0.269
Neg logit contributions
 possible-0.292
fire-0.277
 reality-0.274
 portal-0.267
eware-0.264
Token me
Token ID502
Feature activation+3.27e-02

Pos logit contributions
ags+0.080
c+0.078
aces+0.072
icer+0.068
my+0.068
Neg logit contributions
 possible-0.078
fire-0.075
 reality-0.073
 portal-0.072
eware-0.071
Token?"
Token ID1701
Feature activation-2.92e-02

Pos logit contributions
 possible+0.047
 reality+0.043
fire+0.043
eware+0.042
 portal+0.040
Neg logit contributions
ags-0.069
c-0.066
aces-0.064
icer-0.062
rell-0.059
Token Mary
Token ID5335
Feature activation+4.91e-02

Pos logit contributions
ags+0.054
c+0.054
aces+0.049
my+0.047
icer+0.046
Neg logit contributions
 possible-0.050
 reality-0.047
fire-0.047
 portal-0.045
eware-0.045
Token smiled
Token ID13541
Feature activation-1.05e-01

Pos logit contributions
fire+0.077
 possible+0.075
 traveler+0.074
 unexpected+0.073
 portal+0.073
Neg logit contributions
ags-0.098
icer-0.085
c-0.084
aces-0.083
aters-0.078
Token and
Token ID290
Feature activation-9.52e-02

Pos logit contributions
ags+0.139
c+0.134
aces+0.120
my+0.117
icer+0.105
Neg logit contributions
 possible-0.236
fire-0.229
 reality-0.222
eware-0.222
 portal-0.220
Token she
Token ID673
Feature activation+1.06e-01

Pos logit contributions
c+0.125
ags+0.122
aces+0.111
my+0.106
rell+0.093
Neg logit contributions
 possible-0.211
fire-0.206
 reality-0.203
eware-0.197
 portal-0.196
Token and
Token ID290
Feature activation-9.53e-02

Pos logit contributions
 possible+0.195
fire+0.186
 reality+0.184
 portal+0.180
eware+0.179
Neg logit contributions
ags-0.186
c-0.176
aces-0.164
icer-0.158
my-0.153
Token her
Token ID607
Feature activation-8.24e-02

Pos logit contributions
c+0.153
ags+0.152
aces+0.139
my+0.138
 hair+0.122
Neg logit contributions
 possible-0.184
fire-0.175
eware-0.170
 reality-0.170
 portal-0.166
Token never
Token ID1239
Feature activation+1.33e-01

Pos logit contributions
c+0.085
ags+0.079
my+0.076
aces+0.074
mine+0.071
Neg logit contributions
eware-0.083
fire-0.075
 clues-0.075
 possible-0.074
lan-0.073
Token forget
Token ID6044
Feature activation+9.40e-02

Pos logit contributions
fire+0.250
lan+0.241
 portal+0.231
eware+0.226
 traveler+0.221
Neg logit contributions
my-0.215
c-0.214
aces-0.181
rell-0.177
's-0.173
Token his
Token ID465
Feature activation-1.19e-01

Pos logit contributions
fire+0.153
 possible+0.150
eware+0.150
 portal+0.146
 traveler+0.142
Neg logit contributions
ags-0.173
c-0.172
aces-0.159
my-0.154
icer-0.143
Token journey
Token ID7002
Feature activation+1.75e-01

Pos logit contributions
ags+0.238
c+0.236
aces+0.214
my+0.206
 hands+0.203
Neg logit contributions
fire-0.165
 possible-0.164
eware-0.164
lan-0.148
fall-0.147
Token to
Token ID284
Feature activation+5.55e-02

Pos logit contributions
eware+0.292
fire+0.272
lan+0.270
 portal+0.268
 unexpected+0.265
Neg logit contributions
c-0.310
ags-0.298
aces-0.296
my-0.281
rell-0.242
Token a
Token ID257
Feature activation+1.12e-01

Pos logit contributions
eware+0.100
fire+0.091
lan+0.090
 possible+0.084
fall+0.083
Neg logit contributions
ags-0.100
c-0.099
my-0.085
aces-0.082
 hair-0.077
Token distant
Token ID12899
Feature activation+9.10e-02

Pos logit contributions
eware+0.206
 occurred+0.181
ross+0.172
 endings+0.169
fall+0.166
Neg logit contributions
c-0.191
my-0.164
 nic-0.142
icer-0.139
 -0.137
Token land
Token ID1956
Feature activation+1.11e-01

Pos logit contributions
eware+0.165
 possible+0.143
 occurred+0.140
lan+0.136
fire+0.132
Neg logit contributions
c-0.171
my-0.138
icer-0.123
 hair-0.119
ags-0.118
Token.
Token ID13
Feature activation+9.15e-04

Pos logit contributions
eware+0.181
 unexpected+0.170
 possible+0.169
 portal+0.166
 reality+0.165
Neg logit contributions
c-0.210
ags-0.191
aces-0.185
my-0.183
icer-0.166
Token<|endoftext|>
Token ID50256
Feature activation-5.01e-02

Pos logit contributions
 possible+0.002
fire+0.002
 reality+0.002
 portal+0.002
eware+0.002
Neg logit contributions
ags-0.002
c-0.002
aces-0.001
icer-0.001
my-0.001
TokenOnce
Token ID7454
Feature activation-1.34e-03

Pos logit contributions
ags+0.101
c+0.095
aces+0.092
icer+0.085
my+0.083
Neg logit contributions
 possible-0.076
fire-0.074
 portal-0.070
 reality-0.070
eware-0.068
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
 possible+0.011
fire+0.010
 reality+0.010
 portal+0.010
eware+0.010
Neg logit contributions
ags-0.010
c-0.010
aces-0.009
icer-0.009
my-0.008
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
ags+0.047
c+0.044
aces+0.043
icer+0.040
rell+0.038
Neg logit contributions
 possible-0.044
fire-0.042
 reality-0.041
 portal-0.040
 unexpected-0.040
TokenWhen
Token ID2215
Feature activation+1.02e-01

Pos logit contributions
ags+0.043
c+0.042
aces+0.039
icer+0.037
my+0.037
Neg logit contributions
 possible-0.031
fire-0.029
 reality-0.029
 portal-0.028
eware-0.028
Token they
Token ID484
Feature activation+8.52e-02

Pos logit contributions
eware+0.168
fire+0.167
 portal+0.162
 unexpected+0.161
 possible+0.158
Neg logit contributions
my-0.170
c-0.169
 hands-0.151
 hair-0.150
 -0.149
Token got
Token ID1392
Feature activation+5.23e-02

Pos logit contributions
 possible+0.139
fire+0.128
 reality+0.126
eware+0.125
lan+0.123
Neg logit contributions
ags-0.167
c-0.165
my-0.150
aces-0.150
icer-0.146
Token to
Token ID284
Feature activation+6.02e-02

Pos logit contributions
 possible+0.079
fire+0.076
 reality+0.075
 portal+0.073
eware+0.072
Neg logit contributions
ags-0.109
c-0.105
aces-0.097
my-0.096
icer-0.093
Token grandma
Token ID49890
Feature activation+1.65e-02

Pos logit contributions
eware+0.116
fire+0.115
 possible+0.114
 portal+0.109
lan+0.108
Neg logit contributions
ags-0.095
c-0.092
my-0.084
icer-0.077
aces-0.076
Token's
Token ID338
Feature activation-2.00e-01

Pos logit contributions
 possible+0.043
fire+0.042
 reality+0.042
 portal+0.041
 unexpected+0.041
Neg logit contributions
ags-0.016
c-0.014
aces-0.013
icer-0.011
my-0.010
Token house
Token ID2156
Feature activation+1.48e-02

Pos logit contributions
ags+0.326
c+0.317
aces+0.285
my+0.274
icer+0.273
Neg logit contributions
 possible-0.395
fire-0.373
 reality-0.364
eware-0.363
 portal-0.357
Token,
Token ID11
Feature activation+6.31e-02

Pos logit contributions
 possible+0.024
 portal+0.023
eware+0.023
fire+0.023
 reality+0.023
Neg logit contributions
c-0.028
ags-0.027
my-0.026
aces-0.025
icer-0.023
Token Lily
Token ID20037
Feature activation-2.70e-02

Pos logit contributions
 possible+0.093
fire+0.090
 portal+0.087
eware+0.087
 reality+0.084
Neg logit contributions
c-0.127
ags-0.123
my-0.118
aces-0.115
icer-0.110
Token it
Token ID340
Feature activation-5.36e-02

Pos logit contributions
fire+0.053
eware+0.052
lan+0.049
 portal+0.048
 diver+0.047
Neg logit contributions
c-0.094
ags-0.088
my-0.087
aces-0.086
 hair-0.075
Token would
Token ID561
Feature activation-7.01e-02

Pos logit contributions
c+0.079
ags+0.071
my+0.070
aces+0.070
icer+0.060
Neg logit contributions
 possible-0.108
fire-0.106
lan-0.106
 reality-0.105
eware-0.103
Token always
Token ID1464
Feature activation-9.06e-04

Pos logit contributions
c+0.118
ags+0.117
my+0.112
aces+0.102
rell+0.096
Neg logit contributions
lan-0.122
eware-0.122
fire-0.119
 traveler-0.113
 portal-0.113
Token remember
Token ID3505
Feature activation+4.36e-02

Pos logit contributions
c+0.002
ags+0.002
aces+0.001
my+0.001
rell+0.001
Neg logit contributions
lan-0.001
fire-0.001
eware-0.001
 possible-0.001
 reality-0.001
Token the
Token ID262
Feature activation-6.25e-02

Pos logit contributions
fire+0.071
eware+0.069
 portal+0.068
 possible+0.068
lan+0.065
Neg logit contributions
c-0.081
ags-0.080
my-0.073
aces-0.073
icer-0.064
Token great
Token ID1049
Feature activation+8.37e-02

Pos logit contributions
c+0.120
ags+0.119
aces+0.109
my+0.108
icer+0.102
Neg logit contributions
 possible-0.096
fire-0.094
eware-0.092
 reality-0.091
lan-0.091
Token wings
Token ID12098
Feature activation-1.30e-02

Pos logit contributions
fire+0.130
lan+0.129
 possible+0.127
eware+0.125
 reality+0.123
Neg logit contributions
c-0.162
ags-0.153
aces-0.146
my-0.145
 hair-0.131
Token that
Token ID326
Feature activation+1.89e-01

Pos logit contributions
c+0.027
ags+0.026
aces+0.025
my+0.024
icer+0.023
Neg logit contributions
 possible-0.018
fire-0.018
 reality-0.017
eware-0.017
 portal-0.017
Token had
Token ID550
Feature activation+4.24e-03

Pos logit contributions
 possible+0.283
fire+0.265
 reality+0.260
 portal+0.253
eware+0.250
Neg logit contributions
ags-0.397
c-0.384
aces-0.361
icer-0.346
my-0.343
Token taken
Token ID2077
Feature activation+1.06e-01

Pos logit contributions
fire+0.007
eware+0.007
 reality+0.007
 possible+0.007
 portal+0.007
Neg logit contributions
c-0.008
ags-0.008
aces-0.007
my-0.007
rell-0.006
Token it
Token ID340
Feature activation-5.26e-02

Pos logit contributions
 possible+0.173
fire+0.168
 reality+0.166
lan+0.162
 portal+0.161
Neg logit contributions
ags-0.201
c-0.198
aces-0.182
my-0.175
rell-0.161
Token It
Token ID632
Feature activation-5.19e-02

Pos logit contributions
ags+0.011
c+0.010
aces+0.010
icer+0.009
my+0.009
Neg logit contributions
 possible-0.015
fire-0.015
 reality-0.014
 portal-0.014
eware-0.014
Token had
Token ID550
Feature activation-1.10e-03

Pos logit contributions
ags+0.114
c+0.097
aces+0.097
icer+0.093
aters+0.091
Neg logit contributions
 possible-0.079
fire-0.075
 portal-0.074
 unexpected-0.074
 reality-0.072
Token her
Token ID607
Feature activation-8.26e-02

Pos logit contributions
ags+0.002
c+0.002
aces+0.002
my+0.002
 hair+0.002
Neg logit contributions
fire-0.002
 possible-0.002
eware-0.002
 reality-0.002
 portal-0.002
Token name
Token ID1438
Feature activation-3.90e-02

Pos logit contributions
ags+0.143
c+0.135
aces+0.124
icer+0.124
my+0.122
Neg logit contributions
 possible-0.157
 reality-0.146
 portal-0.141
 unexpected-0.141
 traveler-0.140
Token,
Token ID11
Feature activation+4.94e-02

Pos logit contributions
ags+0.075
c+0.072
aces+0.066
icer+0.066
my+0.063
Neg logit contributions
 possible-0.065
fire-0.061
 reality-0.059
 portal-0.059
eware-0.058
Token her
Token ID607
Feature activation-8.25e-02

Pos logit contributions
 possible+0.068
 reality+0.065
fire+0.062
 portal+0.061
 unexpected+0.060
Neg logit contributions
ags-0.110
c-0.103
aces-0.097
icer-0.096
rell-0.092
Token picture
Token ID4286
Feature activation-7.08e-02

Pos logit contributions
ags+0.146
c+0.137
aces+0.126
icer+0.123
my+0.121
Neg logit contributions
 possible-0.151
 reality-0.144
fire-0.142
 portal-0.141
 traveler-0.138
Token,
Token ID11
Feature activation+4.92e-02

Pos logit contributions
ags+0.133
c+0.130
aces+0.122
my+0.115
icer+0.113
Neg logit contributions
 possible-0.119
fire-0.113
 reality-0.112
 portal-0.108
eware-0.108
Token and
Token ID290
Feature activation-9.36e-02

Pos logit contributions
 possible+0.083
 reality+0.079
fire+0.077
 portal+0.076
 traveler+0.074
Neg logit contributions
ags-0.093
c-0.085
icer-0.080
aces-0.080
rell-0.075
Token some
Token ID617
Feature activation-3.11e-02

Pos logit contributions
c+0.204
ags+0.204
aces+0.194
my+0.186
icer+0.177
Neg logit contributions
 possible-0.123
fire-0.122
eware-0.114
 reality-0.114
 portal-0.111
Token words
Token ID2456
Feature activation+1.51e-02

Pos logit contributions
ags+0.048
c+0.046
aces+0.043
my+0.040
icer+0.039
Neg logit contributions
 possible-0.063
fire-0.060
 reality-0.059
 portal-0.058
eware-0.058
TokenWhy
Token ID5195
Feature activation+1.01e-01

Pos logit contributions
ags+0.144
c+0.138
aces+0.131
icer+0.123
my+0.121
Neg logit contributions
 possible-0.113
fire-0.108
 reality-0.103
lan-0.102
 portal-0.101
Token are
Token ID389
Feature activation-1.04e-01

Pos logit contributions
 possible+0.202
eware+0.193
 reality+0.191
 unexpected+0.186
fire+0.185
Neg logit contributions
ags-0.165
c-0.156
aces-0.139
icer-0.138
aters-0.128
Token you
Token ID345
Feature activation+9.00e-02

Pos logit contributions
ags+0.213
c+0.197
aces+0.194
my+0.189
icer+0.183
Neg logit contributions
fire-0.156
 portal-0.152
 reality-0.148
 possible-0.145
 traveler-0.140
Token fighting
Token ID4330
Feature activation-8.11e-02

Pos logit contributions
 possible+0.133
 reality+0.124
 unexpected+0.120
fire+0.117
eware+0.115
Neg logit contributions
ags-0.185
c-0.181
aces-0.171
icer-0.161
aters-0.157
Token over
Token ID625
Feature activation-6.12e-02

Pos logit contributions
c+0.161
ags+0.158
aces+0.152
icer+0.144
aters+0.134
Neg logit contributions
 possible-0.129
 unexpected-0.117
 reality-0.116
fire-0.114
 portal-0.112
Token the
Token ID262
Feature activation-6.34e-02

Pos logit contributions
ags+0.125
c+0.121
aces+0.114
icer+0.108
my+0.107
Neg logit contributions
 possible-0.094
fire-0.088
 reality-0.088
 unexpected-0.084
 portal-0.084
Token beads
Token ID36116
Feature activation-3.91e-02

Pos logit contributions
ags+0.099
c+0.094
aces+0.085
icer+0.082
my+0.079
Neg logit contributions
 possible-0.128
 reality-0.122
fire-0.122
 portal-0.120
 unexpected-0.118
Token?
Token ID30
Feature activation+2.04e-02

Pos logit contributions
ags+0.075
c+0.072
aces+0.068
icer+0.066
my+0.063
Neg logit contributions
 possible-0.065
fire-0.060
 reality-0.059
 portal-0.057
 unexpected-0.057
Token You
Token ID921
Feature activation-2.41e-02

Pos logit contributions
 possible+0.038
fire+0.036
 reality+0.036
 portal+0.035
eware+0.035
Neg logit contributions
ags-0.035
c-0.033
aces-0.031
icer-0.029
my-0.029
Token have
Token ID423
Feature activation-2.23e-02

Pos logit contributions
ags+0.042
c+0.039
aces+0.036
icer+0.035
aters+0.033
Neg logit contributions
 possible-0.045
fire-0.042
 unexpected-0.042
 reality-0.042
eware-0.041
Token enough
Token ID1576
Feature activation+2.09e-01

Pos logit contributions
ags+0.047
c+0.046
aces+0.043
my+0.042
icer+0.041
Neg logit contributions
 possible-0.032
fire-0.031
 reality-0.030
 portal-0.029
eware-0.029
Token straight
Token ID3892
Feature activation+8.65e-02

Pos logit contributions
 possible+0.151
fire+0.140
 reality+0.140
 portal+0.135
 unexpected+0.133
Neg logit contributions
ags-0.211
c-0.205
aces-0.194
icer-0.185
my-0.179
Token away
Token ID1497
Feature activation+2.42e-02

Pos logit contributions
 possible+0.164
fire+0.162
 reality+0.155
eware+0.150
lan+0.149
Neg logit contributions
ags-0.153
c-0.144
aces-0.131
my-0.126
icer-0.123
Token to
Token ID284
Feature activation+5.96e-02

Pos logit contributions
eware+0.050
fire+0.049
 reality+0.047
lan+0.047
 traveler+0.047
Neg logit contributions
ags-0.033
aces-0.030
 hands-0.029
c-0.029
my-0.029
Token ask
Token ID1265
Feature activation+3.54e-03

Pos logit contributions
 possible+0.107
eware+0.105
lan+0.102
fire+0.098
 reality+0.096
Neg logit contributions
ags-0.109
c-0.101
my-0.095
aces-0.088
 hands-0.084
Token her
Token ID607
Feature activation-7.00e-02

Pos logit contributions
fire+0.008
 possible+0.008
eware+0.007
 portal+0.007
 reality+0.007
Neg logit contributions
ags-0.005
c-0.004
my-0.004
aces-0.004
icer-0.003
Token mum
Token ID25682
Feature activation+2.54e-02

Pos logit contributions
ags+0.080
c+0.080
my+0.079
 hands+0.073
 hair+0.069
Neg logit contributions
fire-0.156
 possible-0.152
 portal-0.149
eware-0.146
fall-0.145
Token a
Token ID257
Feature activation+1.17e-01

Pos logit contributions
 possible+0.038
fire+0.036
 reality+0.035
 portal+0.034
eware+0.034
Neg logit contributions
ags-0.053
c-0.051
aces-0.048
icer-0.046
my-0.045
Token question
Token ID1808
Feature activation-2.51e-02

Pos logit contributions
 possible+0.208
lan+0.191
 reality+0.185
fire+0.184
eware+0.184
Neg logit contributions
c-0.187
my-0.174
ags-0.159
 whip-0.150
 -0.149
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
ags+0.049
c+0.048
aces+0.046
my+0.044
icer+0.042
Neg logit contributions
 possible-0.039
fire-0.038
 reality-0.037
 portal-0.036
eware-0.036
Token "
Token ID366
Feature activation-5.73e-02

Pos logit contributions
 possible+0.035
fire+0.034
 reality+0.033
 portal+0.033
eware+0.033
Neg logit contributions
ags-0.027
c-0.026
aces-0.024
icer-0.022
my-0.022
TokenWhat
Token ID2061
Feature activation+1.40e-01

Pos logit contributions
ags+0.109
c+0.105
aces+0.098
icer+0.093
my+0.093
Neg logit contributions
 possible-0.097
fire-0.092
 reality-0.091
 portal-0.088
eware-0.088
Token I
Token ID314
Feature activation+1.33e-01

Pos logit contributions
fire+0.081
 possible+0.077
 reality+0.070
 traveler+0.069
fall+0.069
Neg logit contributions
c-0.100
aces-0.092
ags-0.092
my-0.090
 hair-0.084
Token have
Token ID423
Feature activation-1.48e-02

Pos logit contributions
 possible+0.259
fire+0.251
 reality+0.243
lan+0.237
 portal+0.235
Neg logit contributions
c-0.217
ags-0.211
aces-0.191
my-0.189
icer-0.182
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
c+0.029
ags+0.028
aces+0.027
my+0.025
icer+0.024
Neg logit contributions
fire-0.025
 possible-0.024
 traveler-0.022
 portal-0.022
eware-0.022
Token new
Token ID649
Feature activation+3.23e-03

Pos logit contributions
 possible+0.212
fire+0.209
 reality+0.197
eware+0.195
lan+0.193
Neg logit contributions
c-0.177
ags-0.176
aces-0.155
my-0.154
icer-0.149
Token pen
Token ID3112
Feature activation-1.15e-01

Pos logit contributions
 possible+0.007
 reality+0.006
fire+0.006
 portal+0.006
eware+0.006
Neg logit contributions
ags-0.005
c-0.005
aces-0.004
icer-0.004
my-0.004
Token.
Token ID13
Feature activation-1.30e-03

Pos logit contributions
c+0.206
ags+0.206
aces+0.203
my+0.185
aters+0.173
Neg logit contributions
 possible-0.190
 reality-0.187
 unexpected-0.186
fire-0.184
eware-0.179
Token It
Token ID632
Feature activation-5.48e-02

Pos logit contributions
ags+0.002
c+0.002
aces+0.002
icer+0.002
my+0.002
Neg logit contributions
 possible-0.003
fire-0.002
 reality-0.002
 portal-0.002
eware-0.002
Token is
Token ID318
Feature activation-3.74e-02

Pos logit contributions
ags+0.084
c+0.074
aces+0.070
icer+0.067
my+0.063
Neg logit contributions
 possible-0.114
 portal-0.108
fire-0.107
 reality-0.106
 unexpected-0.105
Token very
Token ID845
Feature activation-1.79e-02

Pos logit contributions
ags+0.068
c+0.066
aces+0.061
my+0.058
icer+0.057
Neg logit contributions
fire-0.064
 possible-0.064
eware-0.061
 reality-0.061
 portal-0.060
Token good
Token ID922
Feature activation+2.36e-02

Pos logit contributions
ags+0.028
c+0.027
aces+0.024
my+0.023
icer+0.022
Neg logit contributions
fire-0.035
 possible-0.034
 reality-0.033
eware-0.033
 traveler-0.032
Token.
Token ID13
Feature activation-2.19e-03

Pos logit contributions
fire+0.044
 possible+0.043
 reality+0.043
 portal+0.042
 unexpected+0.042
Neg logit contributions
ags-0.040
aces-0.038
c-0.037
my-0.035
aters-0.033
Tokenkin
Token ID5116
Feature activation-6.84e-02

Pos logit contributions
ags+0.057
c+0.056
aces+0.050
icer+0.050
my+0.049
Neg logit contributions
 possible-0.057
fire-0.056
 reality-0.054
 portal-0.053
lan-0.052
Token and
Token ID290
Feature activation-9.93e-02

Pos logit contributions
ags+0.122
c+0.117
aces+0.109
icer+0.105
my+0.101
Neg logit contributions
 possible-0.123
fire-0.117
 reality-0.114
 portal-0.113
eware-0.111
Token tries
Token ID8404
Feature activation-1.43e-02

Pos logit contributions
ags+0.161
c+0.156
aces+0.143
my+0.135
icer+0.134
Neg logit contributions
 possible-0.194
fire-0.185
 reality-0.182
 portal-0.179
eware-0.178
Token to
Token ID284
Feature activation+4.78e-02

Pos logit contributions
ags+0.035
c+0.034
aces+0.033
icer+0.032
my+0.031
Neg logit contributions
 possible-0.016
fire-0.014
 reality-0.014
eware-0.013
 portal-0.013
Token wipe
Token ID19916
Feature activation-1.24e-01

Pos logit contributions
 possible+0.090
 reality+0.084
fire+0.084
eware+0.083
 portal+0.083
Neg logit contributions
ags-0.083
c-0.079
my-0.072
aces-0.071
icer-0.068
Token her
Token ID607
Feature activation-8.23e-02

Pos logit contributions
ags+0.302
c+0.292
aces+0.291
icer+0.269
aters+0.264
Neg logit contributions
 possible-0.134
 reality-0.112
 unexpected-0.111
fire-0.108
lan-0.104
Token shirt
Token ID10147
Feature activation-2.01e-01

Pos logit contributions
c+0.119
ags+0.114
aces+0.110
my+0.109
igh+0.109
Neg logit contributions
 possible-0.147
 unexpected-0.145
 clues-0.145
fire-0.143
 reality-0.143
Token,
Token ID11
Feature activation+5.07e-02

Pos logit contributions
ags+0.448
c+0.427
icer+0.411
aces+0.391
aters+0.381
Neg logit contributions
 possible-0.270
fire-0.249
 reality-0.232
 unexpected-0.229
eware-0.223
Token but
Token ID475
Feature activation-1.10e-01

Pos logit contributions
 possible+0.051
 miracle+0.045
 unexpected+0.044
 reality+0.044
 that+0.044
Neg logit contributions
ags-0.127
c-0.120
icer-0.119
aces-0.118
aters-0.114
Token it
Token ID340
Feature activation-5.60e-02

Pos logit contributions
ags+0.268
c+0.259
aces+0.245
icer+0.238
my+0.233
Neg logit contributions
 possible-0.131
fire-0.117
 reality-0.116
 portal-0.111
 unexpected-0.111
Token is
Token ID318
Feature activation-3.97e-02

Pos logit contributions
ags+0.127
c+0.105
aces+0.104
aters+0.101
icer+0.101
Neg logit contributions
 possible-0.079
 unexpected-0.076
fire-0.075
eware-0.073
 reality-0.073
Token pieces
Token ID5207
Feature activation-1.19e-01

Pos logit contributions
 possible+0.147
fire+0.142
 reality+0.140
 portal+0.137
 unexpected+0.136
Neg logit contributions
ags-0.118
c-0.110
aces-0.103
icer-0.097
my-0.093
Token and
Token ID290
Feature activation-8.18e-02

Pos logit contributions
ags+0.245
c+0.238
aces+0.222
icer+0.214
my+0.211
Neg logit contributions
 possible-0.184
fire-0.172
 reality-0.168
 portal-0.163
 unexpected-0.161
Token they
Token ID484
Feature activation+8.56e-02

Pos logit contributions
c+0.151
ags+0.143
my+0.138
aces+0.136
icer+0.125
Neg logit contributions
 possible-0.138
 portal-0.129
eware-0.129
fire-0.129
 reality-0.127
Token made
Token ID925
Feature activation-4.16e-02

Pos logit contributions
fire+0.129
 possible+0.129
 reality+0.125
 unexpected+0.121
lan+0.121
Neg logit contributions
ags-0.185
c-0.161
aces-0.156
icer-0.154
aters-0.145
Token a
Token ID257
Feature activation+1.14e-01

Pos logit contributions
ags+0.086
c+0.083
aces+0.078
icer+0.075
my+0.074
Neg logit contributions
 possible-0.063
fire-0.059
 reality-0.058
 unexpected-0.056
lan-0.056
Token y
Token ID331
Feature activation-1.54e-01

Pos logit contributions
 possible+0.190
fire+0.180
 reality+0.174
eware+0.174
 portal+0.172
Neg logit contributions
ags-0.214
c-0.211
aces-0.191
icer-0.187
my-0.187
Tokenummy
Token ID13513
Feature activation+1.01e-01

Pos logit contributions
ags+0.309
c+0.291
icer+0.273
my+0.267
aces+0.266
Neg logit contributions
 possible-0.247
fire-0.235
 portal-0.227
 reality-0.227
eware-0.224
Token apple
Token ID17180
Feature activation-8.32e-02

Pos logit contributions
 possible+0.173
fire+0.166
 reality+0.161
 unexpected+0.157
 portal+0.156
Neg logit contributions
ags-0.190
c-0.180
aces-0.172
icer-0.162
my-0.156
Token pie
Token ID2508
Feature activation-8.42e-02

Pos logit contributions
c+0.132
ags+0.131
aces+0.120
my+0.114
icer+0.109
Neg logit contributions
 possible-0.159
 reality-0.155
 portal-0.152
fire-0.152
eware-0.151
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
c+0.154
ags+0.154
aces+0.144
my+0.138
icer+0.131
Neg logit contributions
 possible-0.140
 reality-0.138
 portal-0.135
eware-0.134
fire-0.131
Token But
Token ID887
Feature activation-8.45e-03

Pos logit contributions
 possible+0.017
fire+0.017
eware+0.016
 reality+0.016
 portal+0.016
Neg logit contributions
ags-0.029
c-0.028
aces-0.026
icer-0.026
my-0.025
Token One
Token ID1881
Feature activation-2.06e-02

Pos logit contributions
 Halloween+0.012
 we+0.012
fall+0.012
 they+0.012
 university+0.011
Neg logit contributions
aces-0.035
ags-0.034
wy-0.034
igh-0.033
ez-0.033
Token day
Token ID1110
Feature activation+2.89e-02

Pos logit contributions
ags+0.057
aces+0.055
aters+0.049
c+0.048
icer+0.048
Neg logit contributions
 possible-0.020
 unexpected-0.019
fire-0.019
 miracle-0.018
eware-0.018
Token,
Token ID11
Feature activation+6.29e-02

Pos logit contributions
 possible+0.043
fire+0.041
 reality+0.041
 portal+0.039
 unexpected+0.038
Neg logit contributions
ags-0.060
c-0.059
aces-0.055
my-0.054
icer-0.053
Token her
Token ID607
Feature activation-7.35e-02

Pos logit contributions
 possible+0.086
fire+0.081
 reality+0.080
 portal+0.077
eware+0.076
Neg logit contributions
ags-0.139
c-0.136
aces-0.128
icer-0.122
my-0.121
Token mom
Token ID1995
Feature activation-4.10e-02

Pos logit contributions
ags+0.153
c+0.151
aces+0.141
icer+0.134
my+0.129
Neg logit contributions
 possible-0.108
 reality-0.103
fire-0.100
 unexpected-0.099
eware-0.099
Tokenmy
Token ID1820
Feature activation-1.78e-01

Pos logit contributions
ags+0.051
aces+0.040
icer+0.031
c+0.029
agged+0.027
Neg logit contributions
 planned-0.102
 possible-0.101
 soon-0.099
 unexpected-0.097
 granted-0.096
Token asked
Token ID1965
Feature activation-4.02e-02

Pos logit contributions
ags+0.341
c+0.318
aces+0.304
icer+0.286
my+0.269
Neg logit contributions
 possible-0.302
fire-0.289
 reality-0.282
eware-0.276
 unexpected-0.274
Token her
Token ID607
Feature activation-7.31e-02

Pos logit contributions
ags+0.059
c+0.057
aces+0.050
my+0.050
icer+0.047
Neg logit contributions
 possible-0.087
fire-0.084
 reality-0.081
 portal-0.081
 traveler-0.079
Token to
Token ID284
Feature activation+5.49e-02

Pos logit contributions
my+0.129
c+0.125
ags+0.113
aces+0.111
 hands+0.105
Neg logit contributions
fire-0.129
 possible-0.123
 portal-0.121
 reality-0.117
fall-0.112
Token add
Token ID751
Feature activation-2.62e-02

Pos logit contributions
 possible+0.090
 reality+0.083
fire+0.083
 portal+0.082
eware+0.082
Neg logit contributions
ags-0.108
c-0.103
my-0.095
aces-0.094
icer-0.090
Token some
Token ID617
Feature activation-2.83e-02

Pos logit contributions
ags+0.054
c+0.053
aces+0.050
icer+0.048
my+0.047
Neg logit contributions
 possible-0.039
 unexpected-0.035
 reality-0.034
lan-0.033
fire-0.033
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
 possible+0.171
 reality+0.168
fire+0.167
 portal+0.165
 traveler+0.162
Neg logit contributions
ags-0.172
c-0.169
my-0.157
aces-0.156
icer-0.137
Token She
Token ID1375
Feature activation+3.14e-02

Pos logit contributions
ags+0.028
c+0.028
aces+0.025
icer+0.024
my+0.024
Neg logit contributions
 possible-0.032
fire-0.031
 reality-0.030
 portal-0.030
eware-0.030
Token takes
Token ID2753
Feature activation+9.41e-03

Pos logit contributions
 possible+0.058
 reality+0.057
 portal+0.056
 unexpected+0.056
eware+0.055
Neg logit contributions
ags-0.051
c-0.048
aces-0.045
icer-0.042
my-0.041
Token her
Token ID607
Feature activation-8.47e-02

Pos logit contributions
 possible+0.014
 reality+0.013
fire+0.013
lan+0.013
 unexpected+0.013
Neg logit contributions
ags-0.019
c-0.019
aces-0.018
icer-0.017
my-0.017
Token new
Token ID649
Feature activation+2.23e-03

Pos logit contributions
ags+0.113
c+0.100
icer+0.099
aces+0.098
rell+0.090
Neg logit contributions
 possible-0.182
 reality-0.174
 portal-0.173
 unexpected-0.173
 clues-0.169
Token doll
Token ID3654
Feature activation-1.92e-01

Pos logit contributions
 possible+0.005
 portal+0.005
 reality+0.005
 unexpected+0.005
fire+0.005
Neg logit contributions
ags-0.002
c-0.002
aces-0.002
icer-0.002
my-0.002
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
ags+0.355
c+0.330
aces+0.312
icer+0.306
my+0.287
Neg logit contributions
 possible-0.334
fire-0.321
 reality-0.310
 portal-0.309
eware-0.304
Token runs
Token ID4539
Feature activation+9.62e-02

Pos logit contributions
ags+0.169
c+0.164
aces+0.150
my+0.141
icer+0.140
Neg logit contributions
 possible-0.213
fire-0.204
 reality-0.201
 portal-0.196
eware-0.196
Token away
Token ID1497
Feature activation+1.57e-02

Pos logit contributions
 reality+0.213
 possible+0.211
fire+0.201
 portal+0.193
 unexpected+0.191
Neg logit contributions
ags-0.141
c-0.125
my-0.121
aces-0.110
 hair-0.106
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
 reality+0.034
eware+0.033
fire+0.033
 traveler+0.032
 possible+0.032
Neg logit contributions
ags-0.020
c-0.018
aces-0.017
my-0.016
 hair-0.016
Token She
Token ID1375
Feature activation+2.89e-02

Pos logit contributions
ags+0.033
c+0.032
aces+0.029
icer+0.028
my+0.028
Neg logit contributions
 possible-0.040
fire-0.038
 reality-0.037
 portal-0.037
eware-0.036
Token,
Token ID11
Feature activation+6.41e-02

Pos logit contributions
 possible+0.063
eware+0.062
fire+0.062
 portal+0.061
 reality+0.060
Neg logit contributions
ags-0.068
c-0.067
aces-0.061
my-0.060
icer-0.055
Token and
Token ID290
Feature activation-9.33e-02

Pos logit contributions
 possible+0.110
fire+0.106
 reality+0.102
 portal+0.099
lan+0.098
Neg logit contributions
ags-0.120
c-0.114
aces-0.108
icer-0.106
aters-0.097
Token it
Token ID340
Feature activation-5.35e-02

Pos logit contributions
c+0.190
ags+0.178
my+0.174
aces+0.172
 hands+0.155
Neg logit contributions
 possible-0.135
eware-0.129
 portal-0.127
 reality-0.125
fire-0.125
Token looked
Token ID3114
Feature activation+1.97e-02

Pos logit contributions
ags+0.094
c+0.092
aces+0.085
icer+0.080
my+0.080
Neg logit contributions
 possible-0.097
fire-0.092
 reality-0.091
 portal-0.089
eware-0.088
Token so
Token ID523
Feature activation+1.97e-02

Pos logit contributions
 possible+0.035
fire+0.035
eware+0.034
 traveler+0.033
 portal+0.033
Neg logit contributions
ags-0.035
c-0.033
aces-0.031
my-0.030
icer-0.028
Token y
Token ID331
Feature activation-1.54e-01

Pos logit contributions
eware+0.035
fire+0.035
 traveler+0.035
 possible+0.034
 portal+0.034
Neg logit contributions
ags-0.033
c-0.031
aces-0.029
my-0.028
icer-0.025
Tokenummy
Token ID13513
Feature activation+9.82e-02

Pos logit contributions
ags+0.268
c+0.233
aces+0.232
aters+0.221
icer+0.221
Neg logit contributions
 portal-0.294
 traveler-0.293
 reality-0.291
 possible-0.288
fire-0.281
Token.
Token ID13
Feature activation-3.09e-03

Pos logit contributions
eware+0.174
 portal+0.172
 reality+0.165
 traveler+0.159
fire+0.158
Neg logit contributions
c-0.156
aces-0.151
ags-0.150
my-0.148
 hands-0.134
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
ags+0.005
c+0.005
aces+0.005
icer+0.005
my+0.005
Neg logit contributions
 possible-0.006
fire-0.005
 reality-0.005
 portal-0.005
eware-0.005
Token
Token ID198
Feature activation-1.81e-02

Pos logit contributions
ags+0.032
c+0.030
aces+0.029
icer+0.027
my+0.026
Neg logit contributions
 possible-0.029
fire-0.028
 reality-0.027
 unexpected-0.026
 portal-0.026
TokenSuddenly
Token ID38582
Feature activation+1.47e-01

Pos logit contributions
ags+0.037
c+0.036
aces+0.034
my+0.033
icer+0.032
Neg logit contributions
 possible-0.027
fire-0.026
 reality-0.025
 portal-0.025
eware-0.025
Token dragged
Token ID17901
Feature activation+4.52e-02

Pos logit contributions
 possible+0.066
fire+0.064
 reality+0.062
 portal+0.062
eware+0.061
Neg logit contributions
ags-0.077
c-0.072
aces-0.067
icer-0.065
my-0.062
Token the
Token ID262
Feature activation-6.00e-02

Pos logit contributions
 possible+0.067
fire+0.062
 reality+0.060
eware+0.059
lan+0.059
Neg logit contributions
ags-0.094
c-0.092
aces-0.087
icer-0.082
my-0.081
Token stool
Token ID38753
Feature activation-1.17e-01

Pos logit contributions
ags+0.110
c+0.107
aces+0.097
my+0.090
icer+0.090
Neg logit contributions
 possible-0.107
fire-0.102
 reality-0.098
eware-0.098
 portal-0.097
Token to
Token ID284
Feature activation+5.30e-02

Pos logit contributions
ags+0.273
c+0.264
aces+0.251
icer+0.244
my+0.233
Neg logit contributions
 possible-0.147
fire-0.135
 reality-0.128
 portal-0.128
eware-0.126
Token the
Token ID262
Feature activation-5.97e-02

Pos logit contributions
 possible+0.096
fire+0.092
eware+0.091
 reality+0.091
 portal+0.090
Neg logit contributions
ags-0.094
c-0.090
my-0.081
aces-0.080
icer-0.079
Token cup
Token ID6508
Feature activation-1.55e-01

Pos logit contributions
ags+0.086
c+0.085
my+0.076
icer+0.076
aces+0.076
Neg logit contributions
 possible-0.125
 reality-0.118
fire-0.117
eware-0.115
 portal-0.114
Tokenboard
Token ID3526
Feature activation-3.20e-02

Pos logit contributions
ags+0.245
c+0.235
aces+0.215
icer+0.203
my+0.199
Neg logit contributions
 possible-0.309
fire-0.299
 reality-0.291
 portal-0.288
eware-0.285
Token and
Token ID290
Feature activation-9.08e-02

Pos logit contributions
ags+0.057
c+0.056
aces+0.052
my+0.049
icer+0.048
Neg logit contributions
 possible-0.057
fire-0.054
 reality-0.054
eware-0.052
 portal-0.052
Token climbed
Token ID19952
Feature activation+1.25e-02

Pos logit contributions
ags+0.156
c+0.155
aces+0.143
my+0.138
icer+0.133
Neg logit contributions
 possible-0.168
 reality-0.158
fire-0.157
 portal-0.152
eware-0.152
Token on
Token ID319
Feature activation-9.82e-02

Pos logit contributions
 possible+0.024
fire+0.023
 reality+0.022
 portal+0.022
lan+0.022
Neg logit contributions
ags-0.021
c-0.020
aces-0.019
icer-0.017
my-0.017
Token it
Token ID340
Feature activation-5.21e-02

Pos logit contributions
ags+0.205
c+0.200
aces+0.187
icer+0.180
my+0.180
Neg logit contributions
 possible-0.144
fire-0.138
 reality-0.135
eware-0.132
 portal-0.132