TOP ACTIVATIONS
MAX = 0.168

mad and said , " U gh , what are you
for you ." U ma carefully took the key
first friend said , " U gh , this part is
stream ers . U ma jumped up and down
it out . U nexpected ly , Jenny discovered
sprang open ! U ma was amazed at the
it . U ps et about the poor
colorful door . U ma was a 3 year
. They say , " U gly !" " Pretty !"
it ! Enc our aged by her success
!" " Pretty !" " U gly !" " Pretty !"
! â Jim said anx iously . His
Then , Jol ene , Little Johnny 's
no time ." Enc our aged by the eagle
smile . Mary anx iously opened up the pass
Emily , you look AM AZ ING !" Emily was so
, a gentle girl named Ci ara stumbled upon a lovely
disappear ?" His mother smiled knowingly and said , " Well
are you ?" she shouted anx iously .
The old man smiled knowingly and said , " I

MIN ACTIVATIONS
MIN = -0.144

all the books on the books he lf .
hidden behind the old broken books he lf .
the books back on the books he lf , smiling proudly
build a nest on this roof , but I can 't
knocked some books off the books he lf . Mom my
Jane wanted to fill the pages with her amazing journey .
saw many books on the books he lf . He wanted
big red ball behind the books he lf . Tom tried
put it back on the books he lf so we can
be up high on the roof , but they both enjoyed
he was sorting out the books they were holding . "
and accidentally knocked over the books he lf .
cat is hiding behind the books he lf . The cat
cozy nest on the green roof . From that day on
behind the tall pile of books in the corner . Quick
take a march around the books he lf like he had
something interesting happening on the books he lf near the river
all the books off the books he lf and started to
match the pictures on the pages . Tom saw
to start un loading the books he noticed a little girl

INTERVAL 0.090 - 0.168
CONTAINS 1.324%

, " Don 't worry , I found a lot
safe and sound . <|endoftext|> L ily and Ben were going
lot of fun . <|endoftext|> Tim and Tom were twins who
worry disappeared , and she smiled . Da isy
Mom said . She was shocked and worried . She took

INTERVAL 0.012 - 0.090
CONTAINS 36.020%

's mom brought it to her and asked , " Here
box , and told her the box contained her treasure .
L ily started to cry , but a kind lady
noise . Tim said , " Max , please
. " Ju ice is y ummy ," Anna

INTERVAL -0.066 - 0.012
CONTAINS 58.575%

that it was no longer making noise . "
She hugged her t eddy bear and went to sleep in
a broken button ! The little frog looked up
Ben had a shield . " Look , there
, even if it 's just a little bird . <|endoftext|>

INTERVAL -0.144 - -0.066
CONTAINS 4.081%

so happy that he had a friend . He was glad
startled and quickly curled into a ball . The monkey smiled
a th ud in the sand ! Amy was
a big pool of water to play in ! Everyone had
bird wanted to cross the river , but it did not
Token mad
Token ID8805
Feature activation+1.12e-01

Pos logit contributions
elling+0.088
lee+0.088
ishment+0.087
ealing+0.085
atted+0.085
Neg logit contributions
fire-0.077
 distance-0.075
 usual-0.072
 he-0.071
|-0.071
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
elling+0.264
lee+0.262
atted+0.261
els+0.256
ishment+0.255
Neg logit contributions
 he-0.154
fire-0.149
 distance-0.141
 ride-0.136
 usual-0.129
Token said
Token ID531
Feature activation+7.28e-02

Pos logit contributions
fire+0.040
 distance+0.038
 usual+0.036
|+0.035
 ride+0.035
Neg logit contributions
lee-0.052
elling-0.052
atted-0.051
ealing-0.051
ishment-0.050
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
elling+0.174
ishment+0.173
lee+0.171
atted+0.170
ealing+0.167
Neg logit contributions
fire-0.102
 he-0.089
 usual-0.089
 distance-0.088
|-0.084
Token "
Token ID366
Feature activation+4.34e-02

Pos logit contributions
lee+0.053
atted+0.053
elling+0.052
ealing+0.051
ishment+0.051
Neg logit contributions
fire-0.045
 distance-0.042
 usual-0.040
|-0.040
 ride-0.039
TokenU
Token ID52
Feature activation+1.68e-01

Pos logit contributions
lee+0.104
atted+0.103
elling+0.101
ealing+0.101
ishment+0.099
Neg logit contributions
fire-0.062
 distance-0.058
|-0.055
 usual-0.054
To-0.053
Tokengh
Token ID456
Feature activation+1.39e-01

Pos logit contributions
ishment+0.362
lee+0.355
atted+0.347
elling+0.344
arted+0.340
Neg logit contributions
fire-0.236
 ride-0.221
 rest-0.220
 usual-0.214
 he-0.205
Token,
Token ID11
Feature activation+4.08e-02

Pos logit contributions
elling+0.320
ishment+0.315
lee+0.313
atted+0.307
 opinions+0.300
Neg logit contributions
fire-0.202
 usual-0.183
 ride-0.178
 he-0.173
 distance-0.171
Token what
Token ID644
Feature activation-1.46e-02

Pos logit contributions
atted+0.095
lee+0.094
elling+0.092
ishment+0.092
ealing+0.092
Neg logit contributions
fire-0.061
 distance-0.054
 he-0.053
 ride-0.053
 usual-0.052
Token are
Token ID389
Feature activation+2.35e-02

Pos logit contributions
fire+0.023
 distance+0.022
 usual+0.021
|+0.021
 ride+0.020
Neg logit contributions
lee-0.032
atted-0.032
elling-0.032
ealing-0.031
ishment-0.031
Token you
Token ID345
Feature activation+1.99e-02

Pos logit contributions
lee+0.054
atted+0.054
elling+0.053
ealing+0.052
ishment+0.052
Neg logit contributions
fire-0.036
 distance-0.033
 usual-0.031
|-0.031
 ride-0.030
Token for
Token ID329
Feature activation-3.32e-02

Pos logit contributions
fire+0.165
|+0.157
 distance+0.155
 usual+0.146
 hunt+0.139
Neg logit contributions
elling-0.260
lee-0.259
atted-0.257
ishment-0.250
ished-0.248
Token you
Token ID345
Feature activation-4.00e-03

Pos logit contributions
fire+0.054
 distance+0.053
|+0.051
tr+0.047
 ride+0.046
Neg logit contributions
lee-0.072
elling-0.067
ishment-0.066
els-0.066
 mistakes-0.066
Token."
Token ID526
Feature activation-1.12e-02

Pos logit contributions
fire+0.005
 distance+0.005
|+0.005
 usual+0.005
 ride+0.004
Neg logit contributions
lee-0.010
atted-0.009
elling-0.009
ealing-0.009
ished-0.009
Token
Token ID198
Feature activation-4.46e-02

Pos logit contributions
fire+0.011
 distance+0.010
 usual+0.009
 hunt+0.008
|+0.008
Neg logit contributions
lee-0.032
atted-0.032
elling-0.031
ealing-0.031
ishment-0.031
Token
Token ID198
Feature activation-3.54e-02

Pos logit contributions
fire+0.050
 distance+0.045
 usual+0.044
 hunt+0.041
 ride+0.038
Neg logit contributions
lee-0.122
atted-0.119
ealing-0.115
elling-0.113
els-0.111
TokenU
Token ID52
Feature activation+1.67e-01

Pos logit contributions
fire+0.047
 distance+0.044
 usual+0.042
|+0.039
 hunt+0.039
Neg logit contributions
lee-0.088
atted-0.087
ealing-0.086
elling-0.085
ishment-0.083
Tokenma
Token ID2611
Feature activation+5.03e-02

Pos logit contributions
lee+0.349
ishment+0.345
elling+0.334
atted+0.327
ubby+0.318
Neg logit contributions
fire-0.265
 usual-0.238
 ride-0.232
|-0.223
To-0.222
Token carefully
Token ID7773
Feature activation+3.48e-02

Pos logit contributions
lee+0.120
elling+0.119
atted+0.119
ealing+0.118
ishment+0.117
Neg logit contributions
fire-0.071
 distance-0.063
|-0.061
 ride-0.060
 usual-0.060
Token took
Token ID1718
Feature activation+4.45e-02

Pos logit contributions
lee+0.079
atted+0.079
elling+0.078
ealing+0.077
ishment+0.076
Neg logit contributions
fire-0.053
 distance-0.049
 usual-0.047
|-0.046
 ride-0.046
Token the
Token ID262
Feature activation-1.00e-02

Pos logit contributions
lee+0.113
atted+0.112
elling+0.111
ealing+0.110
ishment+0.108
Neg logit contributions
fire-0.057
 distance-0.052
 usual-0.048
|-0.048
 hunt-0.046
Token key
Token ID1994
Feature activation-6.09e-02

Pos logit contributions
fire+0.016
|+0.015
 distance+0.015
 usual+0.014
 hunt+0.014
Neg logit contributions
atted-0.021
lee-0.021
elling-0.021
ealing-0.021
ishment-0.020
Token first
Token ID717
Feature activation+6.16e-02

Pos logit contributions
lee+0.047
atted+0.046
elling+0.046
ealing+0.045
ishment+0.045
Neg logit contributions
fire-0.034
|-0.032
 distance-0.031
 usual-0.030
 hunt-0.029
Token friend
Token ID1545
Feature activation+9.92e-04

Pos logit contributions
lee+0.137
atted+0.136
elling+0.134
ealing+0.133
ishment+0.131
Neg logit contributions
fire-0.098
 distance-0.091
|-0.087
 usual-0.087
 ride-0.084
Token said
Token ID531
Feature activation+7.08e-02

Pos logit contributions
lee+0.002
atted+0.002
elling+0.002
ealing+0.002
ishment+0.002
Neg logit contributions
fire-0.002
 distance-0.001
 usual-0.001
|-0.001
 ride-0.001
Token,
Token ID11
Feature activation+2.88e-02

Pos logit contributions
lee+0.154
atted+0.154
elling+0.150
ealing+0.150
ishment+0.146
Neg logit contributions
fire-0.116
 distance-0.108
|-0.103
 usual-0.103
 ride-0.100
Token "
Token ID366
Feature activation+3.90e-02

Pos logit contributions
lee+0.054
atted+0.053
ealing+0.051
elling+0.051
ishment+0.050
Neg logit contributions
fire-0.056
 distance-0.053
|-0.052
 usual-0.051
 hunt-0.050
TokenU
Token ID52
Feature activation+1.65e-01

Pos logit contributions
elling+0.078
lee+0.078
atted+0.078
ealing+0.077
ishment+0.076
Neg logit contributions
fire-0.068
 usual-0.062
 distance-0.061
 hunt-0.060
|-0.060
Tokengh
Token ID456
Feature activation+1.39e-01

Pos logit contributions
lee+0.394
ishment+0.391
elling+0.388
atted+0.382
ished+0.364
Neg logit contributions
fire-0.211
 ride-0.195
 hunt-0.188
 saf-0.187
 usual-0.187
Token,
Token ID11
Feature activation+3.16e-02

Pos logit contributions
lee+0.305
atted+0.302
elling+0.300
ealing+0.295
ishment+0.293
Neg logit contributions
fire-0.222
 distance-0.204
 usual-0.198
|-0.194
 ride-0.191
Token this
Token ID428
Feature activation+1.80e-02

Pos logit contributions
lee+0.071
atted+0.070
elling+0.069
ealing+0.068
ishment+0.067
Neg logit contributions
fire-0.050
 distance-0.046
 usual-0.044
|-0.044
 hunt-0.042
Token part
Token ID636
Feature activation-3.37e-02

Pos logit contributions
lee+0.044
atted+0.044
elling+0.043
ealing+0.043
ishment+0.042
Neg logit contributions
fire-0.025
 distance-0.023
 usual-0.022
|-0.021
 ride-0.021
Token is
Token ID318
Feature activation+6.23e-03

Pos logit contributions
fire+0.062
 distance+0.058
|+0.057
 usual+0.056
 hunt+0.054
Neg logit contributions
atted-0.065
elling-0.064
lee-0.064
ealing-0.062
ished-0.061
Token stream
Token ID4269
Feature activation-3.55e-02

Pos logit contributions
fire+0.069
|+0.066
 distance+0.063
 usual+0.059
 hunt+0.059
Neg logit contributions
lee-0.079
atted-0.075
elling-0.073
 opinions-0.072
els-0.071
Tokeners
Token ID364
Feature activation-2.98e-02

Pos logit contributions
 distance+0.070
fire+0.070
|+0.069
 usual+0.068
 ride+0.066
Neg logit contributions
lee-0.062
atted-0.060
ealing-0.056
elling-0.056
ished-0.056
Token.
Token ID13
Feature activation-5.40e-03

Pos logit contributions
fire+0.043
|+0.043
 distance+0.042
 usual+0.038
To+0.037
Neg logit contributions
lee-0.070
atted-0.067
els-0.063
ealing-0.063
ished-0.063
Token
Token ID198
Feature activation-4.91e-02

Pos logit contributions
fire+0.009
 distance+0.009
 usual+0.008
 hunt+0.008
|+0.008
Neg logit contributions
lee-0.012
atted-0.011
elling-0.011
ealing-0.011
ishment-0.011
Token
Token ID198
Feature activation-3.89e-02

Pos logit contributions
fire+0.055
 distance+0.049
 usual+0.047
 hunt+0.046
 ride+0.041
Neg logit contributions
lee-0.135
atted-0.131
ealing-0.128
elling-0.125
oking-0.124
TokenU
Token ID52
Feature activation+1.62e-01

Pos logit contributions
fire+0.054
 distance+0.050
 usual+0.048
 hunt+0.047
|+0.045
Neg logit contributions
lee-0.094
atted-0.093
ealing-0.092
elling-0.090
oking-0.088
Tokenma
Token ID2611
Feature activation+4.92e-02

Pos logit contributions
lee+0.299
ishment+0.295
elling+0.280
atted+0.279
ealing+0.272
Neg logit contributions
fire-0.302
 usual-0.277
 ride-0.273
|-0.264
 rest-0.259
Token jumped
Token ID11687
Feature activation+6.96e-02

Pos logit contributions
lee+0.111
atted+0.111
elling+0.109
ealing+0.108
ishment+0.107
Neg logit contributions
fire-0.076
 distance-0.070
 usual-0.067
|-0.066
 ride-0.065
Token up
Token ID510
Feature activation+1.65e-02

Pos logit contributions
lee+0.157
atted+0.156
elling+0.151
ealing+0.151
ishment+0.148
Neg logit contributions
fire-0.109
 distance-0.100
 usual-0.097
|-0.097
 hunt-0.093
Token and
Token ID290
Feature activation-2.96e-02

Pos logit contributions
lee+0.031
atted+0.030
ishment+0.028
els+0.028
elling+0.028
Neg logit contributions
fire-0.032
|-0.030
 distance-0.029
 hunt-0.028
 usual-0.028
Token down
Token ID866
Feature activation+1.32e-02

Pos logit contributions
fire+0.056
 distance+0.051
|+0.050
 usual+0.049
 hunt+0.047
Neg logit contributions
lee-0.060
atted-0.059
ished-0.055
ishment-0.054
elling-0.054
Token it
Token ID340
Feature activation-3.61e-02

Pos logit contributions
fire+0.007
 distance+0.006
|+0.006
 usual+0.006
 hunt+0.006
Neg logit contributions
lee-0.013
atted-0.013
elling-0.013
ealing-0.013
ishment-0.013
Token out
Token ID503
Feature activation-5.54e-02

Pos logit contributions
fire+0.066
|+0.063
 distance+0.061
 usual+0.056
To+0.055
Neg logit contributions
ishment-0.069
lee-0.069
atted-0.068
ealing-0.068
ished-0.068
Token.
Token ID13
Feature activation-3.04e-02

Pos logit contributions
fire+0.066
|+0.058
 distance+0.057
 hunt+0.053
 usual+0.053
Neg logit contributions
lee-0.149
atted-0.146
elling-0.144
ealing-0.141
ishment-0.141
Token
Token ID198
Feature activation-6.18e-02

Pos logit contributions
To+0.017
 usual+0.016
 ride+0.016
 he+0.016
 distance+0.015
Neg logit contributions
elling-0.096
ubby-0.095
atted-0.094
ealing-0.093
ished-0.093
Token
Token ID198
Feature activation-4.89e-02

Pos logit contributions
fire+0.059
 distance+0.049
 usual+0.044
|+0.043
 hunt+0.042
Neg logit contributions
lee-0.179
atted-0.177
ealing-0.173
elling-0.173
ishment-0.168
TokenU
Token ID52
Feature activation+1.61e-01

Pos logit contributions
To+0.056
On+0.047
 distance+0.046
 ride+0.046
 usual+0.045
Neg logit contributions
atted-0.139
elling-0.136
ener-0.132
ishment-0.131
ished-0.131
Tokennexpected
Token ID42072
Feature activation+1.02e-01

Pos logit contributions
ishment+0.372
elling+0.361
lee+0.358
atted+0.353
ubby+0.346
Neg logit contributions
 usual-0.184
fire-0.181
 ride-0.180
 he-0.168
 rest-0.166
Tokenly
Token ID306
Feature activation+5.48e-02

Pos logit contributions
ealing+0.166
oking+0.161
 Deal+0.156
elling+0.152
atted+0.148
Neg logit contributions
 he-0.201
 usual-0.193
 ride-0.183
 far-0.178
 well-0.176
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
elling+0.158
ishment+0.152
 opinions+0.151
ished+0.145
atted+0.144
Neg logit contributions
 he-0.056
fire-0.056
 ride-0.047
 usual-0.044
 rest-0.042
Token Jenny
Token ID28437
Feature activation+8.55e-02

Pos logit contributions
elling+0.035
atted+0.034
ealing+0.033
lee+0.033
ished+0.033
Neg logit contributions
 usual-0.020
fire-0.019
 he-0.019
 distance-0.019
 ride-0.018
Token discovered
Token ID5071
Feature activation+3.01e-02

Pos logit contributions
elling+0.209
atted+0.204
lee+0.202
ishment+0.199
ealing+0.198
Neg logit contributions
fire-0.117
 distance-0.105
 he-0.103
 usual-0.102
 ride-0.102
Token sprang
Token ID43908
Feature activation+4.19e-02

Pos logit contributions
fire+0.128
|+0.120
 distance+0.119
 usual+0.113
 hunt+0.110
Neg logit contributions
lee-0.172
atted-0.171
elling-0.164
ealing-0.161
els-0.160
Token open
Token ID1280
Feature activation-6.48e-03

Pos logit contributions
lee+0.098
atted+0.095
elling+0.094
ealing+0.094
ishment+0.093
Neg logit contributions
fire-0.062
 distance-0.056
|-0.055
 usual-0.054
 hunt-0.052
Token!
Token ID0
Feature activation-1.43e-02

Pos logit contributions
fire+0.009
|+0.009
 distance+0.008
 usual+0.008
 hunt+0.008
Neg logit contributions
lee-0.015
atted-0.015
ishment-0.015
elling-0.015
els-0.015
Token
Token ID198
Feature activation-5.38e-02

Pos logit contributions
fire+0.021
 hunt+0.019
 distance+0.019
 usual+0.018
|+0.017
Neg logit contributions
lee-0.035
atted-0.034
elling-0.032
ealing-0.032
ishment-0.031
Token
Token ID198
Feature activation-4.20e-02

Pos logit contributions
fire+0.060
 distance+0.056
 usual+0.054
 hunt+0.053
 ride+0.047
Neg logit contributions
lee-0.146
atted-0.142
ealing-0.138
elling-0.135
oking-0.133
TokenU
Token ID52
Feature activation+1.61e-01

Pos logit contributions
fire+0.060
 distance+0.057
 usual+0.056
 hunt+0.054
|+0.051
Neg logit contributions
lee-0.099
atted-0.096
ealing-0.096
elling-0.093
oking-0.092
Tokenma
Token ID2611
Feature activation+4.78e-02

Pos logit contributions
lee+0.310
ishment+0.307
elling+0.293
atted+0.289
ealing+0.281
Neg logit contributions
fire-0.287
 usual-0.259
 ride-0.255
|-0.245
To-0.242
Token was
Token ID373
Feature activation+5.21e-02

Pos logit contributions
lee+0.114
atted+0.113
elling+0.110
ealing+0.109
ishment+0.107
Neg logit contributions
fire-0.069
 distance-0.064
 usual-0.062
|-0.060
 hunt-0.058
Token amazed
Token ID24562
Feature activation+7.94e-02

Pos logit contributions
lee+0.117
atted+0.116
ealing+0.112
elling+0.111
ishment+0.110
Neg logit contributions
fire-0.083
 distance-0.075
 usual-0.074
|-0.073
 hunt-0.071
Token at
Token ID379
Feature activation-9.29e-03

Pos logit contributions
atted+0.204
ealing+0.203
elling+0.203
lee+0.202
ished+0.197
Neg logit contributions
fire-0.089
 distance-0.087
 he-0.086
 usual-0.083
 ride-0.078
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
fire+0.016
 distance+0.015
 usual+0.015
|+0.015
 ride+0.014
Neg logit contributions
lee-0.019
atted-0.019
elling-0.019
ealing-0.019
ishment-0.018
Token it
Token ID340
Feature activation-3.53e-02

Pos logit contributions
fire+0.046
|+0.046
To+0.041
 distance+0.041
tr+0.039
Neg logit contributions
ishment-0.070
lee-0.069
els-0.068
atted-0.068
elling-0.066
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
fire+0.046
|+0.044
 distance+0.042
 usual+0.039
 ride+0.037
Neg logit contributions
lee-0.086
atted-0.085
ished-0.083
elling-0.082
ealing-0.082
Token
Token ID220
Feature activation+2.73e-02

Pos logit contributions
fire+0.029
 distance+0.026
 usual+0.026
 hunt+0.024
|+0.024
Neg logit contributions
lee-0.056
atted-0.055
ealing-0.053
elling-0.053
ishment-0.052
Token
Token ID198
Feature activation-5.04e-02

Pos logit contributions
lee+0.077
atted+0.076
ealing+0.076
elling+0.075
oking+0.073
Neg logit contributions
fire-0.027
 distance-0.025
 usual-0.022
 hunt-0.022
|-0.021
Token
Token ID198
Feature activation-3.79e-02

Pos logit contributions
fire+0.060
 usual+0.057
 hunt+0.056
 distance+0.055
 ride+0.051
Neg logit contributions
lee-0.127
ealing-0.124
atted-0.123
oking-0.118
elling-0.117
TokenU
Token ID52
Feature activation+1.59e-01

Pos logit contributions
 usual+0.049
fire+0.047
 hunt+0.046
 distance+0.045
|+0.042
Neg logit contributions
ealing-0.091
lee-0.090
atted-0.088
oking-0.087
elling-0.086
Tokenps
Token ID862
Feature activation+5.17e-02

Pos logit contributions
lee+0.270
atted+0.260
ishment+0.258
elling+0.255
ares+0.250
Neg logit contributions
fire-0.327
 usual-0.292
 ride-0.288
 he-0.280
 hunt-0.279
Tokenet
Token ID316
Feature activation+5.15e-02

Pos logit contributions
lee+0.118
atted+0.117
elling+0.113
ealing+0.112
ished+0.110
Neg logit contributions
fire-0.079
 distance-0.074
 usual-0.072
|-0.071
 hunt-0.069
Token about
Token ID546
Feature activation-1.69e-02

Pos logit contributions
lee+0.109
atted+0.108
elling+0.105
ealing+0.105
ishment+0.102
Neg logit contributions
fire-0.088
 distance-0.083
|-0.079
 usual-0.079
 hunt-0.076
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
fire+0.026
|+0.025
 distance+0.025
 usual+0.022
To+0.022
Neg logit contributions
lee-0.039
atted-0.038
ishment-0.036
ealing-0.036
elling-0.036
Token poor
Token ID3595
Feature activation+5.39e-02

Pos logit contributions
fire+0.029
 distance+0.028
 usual+0.027
 ride+0.026
 hunt+0.026
Neg logit contributions
atted-0.022
lee-0.022
ealing-0.021
elling-0.021
ishment-0.021
Token colorful
Token ID20239
Feature activation-6.45e-03

Pos logit contributions
|+0.171
fire+0.134
whe+0.132
 Zig+0.132
seek+0.132
Neg logit contributions
lee-0.171
ealing-0.170
elling-0.167
ubby-0.167
 mistakes-0.164
Token door
Token ID3420
Feature activation-1.14e-01

Pos logit contributions
|+0.011
fire+0.011
 distance+0.010
 usual+0.010
 Zig+0.009
Neg logit contributions
lee-0.013
elling-0.013
els-0.013
ishment-0.012
atted-0.012
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
|+0.166
fire+0.161
 usual+0.147
 distance+0.146
 Zig+0.131
Neg logit contributions
elling-0.255
lee-0.253
atted-0.251
els-0.244
ishment-0.239
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
fire+0.072
 distance+0.066
 usual+0.064
|+0.063
 hunt+0.061
Neg logit contributions
lee-0.059
elling-0.058
 opinions-0.057
ishment-0.056
 requests-0.054
Token
Token ID198
Feature activation-5.10e-02

Pos logit contributions
fire+0.091
 distance+0.081
 usual+0.077
 hunt+0.077
tr+0.075
Neg logit contributions
lee-0.127
oking-0.122
 requests-0.121
inding-0.121
atted-0.121
TokenU
Token ID52
Feature activation+1.58e-01

Pos logit contributions
fire+0.067
 usual+0.064
|+0.060
 hunt+0.060
field+0.060
Neg logit contributions
ealing-0.104
ubby-0.104
oking-0.104
lee-0.102
elling-0.101
Tokenma
Token ID2611
Feature activation+4.47e-02

Pos logit contributions
lee+0.366
atted+0.360
elling+0.359
ishment+0.357
ealing+0.347
Neg logit contributions
fire-0.233
 usual-0.207
 ride-0.201
 hunt-0.194
|-0.192
Token was
Token ID373
Feature activation+4.69e-02

Pos logit contributions
lee+0.109
atted+0.108
elling+0.107
ealing+0.107
ishment+0.105
Neg logit contributions
fire-0.061
 distance-0.055
|-0.052
 usual-0.052
 ride-0.051
Token a
Token ID257
Feature activation-4.40e-02

Pos logit contributions
lee+0.119
atted+0.119
ealing+0.116
elling+0.115
ishment+0.112
Neg logit contributions
fire-0.058
 distance-0.053
|-0.052
 usual-0.049
 hunt-0.048
Token 3
Token ID513
Feature activation+3.83e-02

Pos logit contributions
|+0.093
fire+0.088
 distance+0.084
 usual+0.083
 hunt+0.082
Neg logit contributions
ubby-0.067
lee-0.066
ealing-0.066
atted-0.061
elling-0.060
Token year
Token ID614
Feature activation+2.97e-02

Pos logit contributions
lee+0.090
atted+0.089
elling+0.089
ealing+0.088
ishment+0.087
Neg logit contributions
fire-0.056
 distance-0.051
|-0.048
 usual-0.048
 ride-0.047
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
lee+0.078
atted+0.076
elling+0.075
ealing+0.074
ishment+0.073
Neg logit contributions
fire-0.051
 distance-0.047
 usual-0.046
 ride-0.045
|-0.045
Token They
Token ID1119
Feature activation-1.40e-02

Pos logit contributions
fire+0.010
 distance+0.009
|+0.009
 usual+0.009
 ride+0.009
Neg logit contributions
atted-0.022
lee-0.022
elling-0.021
ealing-0.021
ishment-0.021
Token say
Token ID910
Feature activation+7.55e-02

Pos logit contributions
fire+0.025
 distance+0.022
 usual+0.021
|+0.021
 hunt+0.020
Neg logit contributions
lee-0.029
atted-0.029
ealing-0.029
ishment-0.028
elling-0.028
Token,
Token ID11
Feature activation+2.83e-02

Pos logit contributions
elling+0.182
ishment+0.179
atted+0.179
lee+0.178
ished+0.176
Neg logit contributions
fire-0.103
 he-0.094
 usual-0.092
 distance-0.091
 ride-0.091
Token "
Token ID366
Feature activation+4.11e-02

Pos logit contributions
lee+0.057
atted+0.056
elling+0.055
ealing+0.055
ishment+0.053
Neg logit contributions
fire-0.051
 distance-0.048
 usual-0.046
|-0.046
 ride-0.045
TokenU
Token ID52
Feature activation+1.58e-01

Pos logit contributions
lee+0.089
atted+0.088
elling+0.086
ealing+0.086
ishment+0.084
Neg logit contributions
fire-0.069
 distance-0.064
 usual-0.062
|-0.061
 ride-0.060
Tokengly
Token ID10853
Feature activation+9.80e-02

Pos logit contributions
ishment+0.369
lee+0.365
elling+0.354
atted+0.347
arted+0.337
Neg logit contributions
fire-0.191
 ride-0.190
 saf-0.180
 he-0.179
 hunt-0.177
Token!"
Token ID2474
Feature activation+5.10e-03

Pos logit contributions
lee+0.236
atted+0.232
elling+0.228
ishment+0.227
ealing+0.224
Neg logit contributions
fire-0.135
 ride-0.122
 usual-0.119
 hunt-0.118
 distance-0.118
Token "
Token ID366
Feature activation+3.18e-02

Pos logit contributions
lee+0.013
atted+0.013
elling+0.013
ealing+0.013
ishment+0.013
Neg logit contributions
fire-0.006
 distance-0.005
 usual-0.005
|-0.005
 ride-0.005
TokenPretty
Token ID35700
Feature activation-5.14e-03

Pos logit contributions
lee+0.067
atted+0.066
ealing+0.066
elling+0.066
ishment+0.065
Neg logit contributions
fire-0.052
 distance-0.049
 usual-0.048
|-0.046
 hunt-0.046
Token!"
Token ID2474
Feature activation-2.63e-03

Pos logit contributions
fire+0.009
 distance+0.008
|+0.008
 usual+0.008
 hunt+0.007
Neg logit contributions
lee-0.011
elling-0.011
ealing-0.010
atted-0.010
ishment-0.010
Token it
Token ID340
Feature activation-1.96e-02

Pos logit contributions
lee+0.091
atted+0.089
ealing+0.087
elling+0.085
ishment+0.085
Neg logit contributions
fire-0.052
 distance-0.048
|-0.046
 usual-0.044
 hunt-0.043
Token!
Token ID0
Feature activation-2.33e-02

Pos logit contributions
fire+0.031
 distance+0.029
|+0.028
 usual+0.026
 hunt+0.026
Neg logit contributions
lee-0.044
atted-0.042
ealing-0.041
elling-0.041
ished-0.041
Token
Token ID220
Feature activation+2.41e-02

Pos logit contributions
fire+0.024
 distance+0.021
 usual+0.020
|+0.020
 ride+0.019
Neg logit contributions
lee-0.064
atted-0.064
elling-0.063
ealing-0.063
ishment-0.062
Token
Token ID198
Feature activation-5.55e-02

Pos logit contributions
lee+0.070
atted+0.069
ealing+0.069
elling+0.068
ished+0.066
Neg logit contributions
fire-0.023
 distance-0.021
 usual-0.018
 hunt-0.018
|-0.017
Token
Token ID198
Feature activation-4.25e-02

Pos logit contributions
fire+0.054
 distance+0.048
 usual+0.044
 hunt+0.044
 ride+0.042
Neg logit contributions
lee-0.159
atted-0.155
ealing-0.154
elling-0.152
oking-0.147
TokenEnc
Token ID27195
Feature activation+1.57e-01

Pos logit contributions
fire+0.058
 distance+0.053
 usual+0.051
|+0.050
 ride+0.048
Neg logit contributions
lee-0.104
atted-0.103
elling-0.102
ealing-0.101
ishment-0.100
Tokenour
Token ID454
Feature activation+4.17e-02

Pos logit contributions
lee+0.204
elling+0.197
atted+0.192
 opinions+0.188
els+0.187
Neg logit contributions
fire-0.391
 usual-0.360
 ride-0.356
 distance-0.353
|-0.350
Tokenaged
Token ID1886
Feature activation+6.26e-02

Pos logit contributions
elling+0.070
ishment+0.069
ealing+0.067
lee+0.067
atted+0.067
Neg logit contributions
fire-0.088
 distance-0.082
 usual-0.080
 ride-0.079
|-0.079
Token by
Token ID416
Feature activation+1.55e-02

Pos logit contributions
ishment+0.146
elling+0.146
lee+0.145
ealing+0.145
oking+0.142
Neg logit contributions
fire-0.082
 usual-0.078
 distance-0.077
 he-0.075
 far-0.069
Token her
Token ID607
Feature activation+3.60e-02

Pos logit contributions
ealing+0.035
elling+0.035
lee+0.035
atted+0.034
ished+0.034
Neg logit contributions
fire-0.023
 distance-0.022
 usual-0.021
|-0.020
 he-0.020
Token success
Token ID1943
Feature activation+1.66e-02

Pos logit contributions
elling+0.068
atted+0.066
lee+0.066
ealing+0.064
ished+0.064
Neg logit contributions
fire-0.070
 usual-0.067
 distance-0.065
 ride-0.064
 hunt-0.062
Token!"
Token ID2474
Feature activation+5.10e-03

Pos logit contributions
lee+0.236
atted+0.232
elling+0.228
ishment+0.227
ealing+0.224
Neg logit contributions
fire-0.135
 ride-0.122
 usual-0.119
 hunt-0.118
 distance-0.118
Token "
Token ID366
Feature activation+3.18e-02

Pos logit contributions
lee+0.013
atted+0.013
elling+0.013
ealing+0.013
ishment+0.013
Neg logit contributions
fire-0.006
 distance-0.005
 usual-0.005
|-0.005
 ride-0.005
TokenPretty
Token ID35700
Feature activation-5.14e-03

Pos logit contributions
lee+0.067
atted+0.066
ealing+0.066
elling+0.066
ishment+0.065
Neg logit contributions
fire-0.052
 distance-0.049
 usual-0.048
|-0.046
 hunt-0.046
Token!"
Token ID2474
Feature activation-2.63e-03

Pos logit contributions
fire+0.009
 distance+0.008
|+0.008
 usual+0.008
 hunt+0.007
Neg logit contributions
lee-0.011
elling-0.011
ealing-0.010
atted-0.010
ishment-0.010
Token "
Token ID366
Feature activation+2.94e-02

Pos logit contributions
fire+0.003
 distance+0.003
 he+0.002
 ride+0.002
|+0.002
Neg logit contributions
atted-0.007
lee-0.007
elling-0.007
ealing-0.007
ishment-0.007
TokenU
Token ID52
Feature activation+1.54e-01

Pos logit contributions
ealing+0.063
lee+0.063
elling+0.062
atted+0.062
ishment+0.061
Neg logit contributions
fire-0.047
 distance-0.044
 usual-0.043
 hunt-0.041
|-0.041
Tokengly
Token ID10853
Feature activation+9.51e-02

Pos logit contributions
ishment+0.344
lee+0.341
elling+0.331
atted+0.325
ubby+0.313
Neg logit contributions
fire-0.211
 ride-0.207
 hunt-0.194
 he-0.193
 saf-0.193
Token!"
Token ID2474
Feature activation-5.68e-03

Pos logit contributions
lee+0.212
atted+0.209
elling+0.205
ishment+0.202
ealing+0.201
Neg logit contributions
fire-0.150
 distance-0.135
 ride-0.133
 usual-0.132
|-0.131
Token "
Token ID366
Feature activation+2.87e-02

Pos logit contributions
fire+0.006
 distance+0.005
 he+0.005
 ride+0.005
To+0.005
Neg logit contributions
atted-0.015
lee-0.015
elling-0.015
ealing-0.015
ished-0.015
TokenPretty
Token ID35700
Feature activation-4.62e-03

Pos logit contributions
ealing+0.062
lee+0.061
elling+0.061
atted+0.060
oking+0.059
Neg logit contributions
fire-0.045
 distance-0.042
 usual-0.041
 hunt-0.040
|-0.039
Token!"
Token ID2474
Feature activation-9.64e-03

Pos logit contributions
fire+0.008
 distance+0.008
|+0.008
 usual+0.007
 hunt+0.007
Neg logit contributions
elling-0.009
lee-0.009
ealing-0.009
atted-0.009
 opinions-0.009
Token!
Token ID0
Feature activation-6.37e-03

Pos logit contributions
fire+0.042
 distance+0.039
|+0.038
 hunt+0.035
 usual+0.034
Neg logit contributions
lee-0.079
atted-0.076
elling-0.076
ealing-0.075
 opinions-0.072
Tokenâ
Token ID22940
Feature activation+1.03e-01

Pos logit contributions
fire+0.011
 distance+0.011
|+0.010
 usual+0.010
 ride+0.010
Neg logit contributions
lee-0.013
atted-0.013
elling-0.012
ealing-0.012
ishment-0.012
Token
Token ID26391
Feature activation+7.65e-02

Pos logit contributions
lee+0.215
atted+0.213
elling+0.203
ishment+0.202
els+0.202
Neg logit contributions
fire-0.183
 distance-0.168
|-0.167
 ride-0.162
 usual-0.154
Token Jim
Token ID5395
Feature activation+8.10e-02

Pos logit contributions
lee+0.173
ealing+0.170
atted+0.170
elling+0.162
ishment+0.161
Neg logit contributions
fire-0.120
 distance-0.112
 usual-0.108
|-0.104
 hunt-0.101
Token said
Token ID531
Feature activation+6.89e-02

Pos logit contributions
lee+0.177
atted+0.175
elling+0.174
ealing+0.172
ishment+0.170
Neg logit contributions
fire-0.130
 distance-0.120
 usual-0.115
|-0.114
 ride-0.112
Token anx
Token ID7296
Feature activation+1.53e-01

Pos logit contributions
lee+0.147
atted+0.147
ealing+0.144
elling+0.139
ished+0.134
Neg logit contributions
fire-0.116
 distance-0.112
|-0.108
 hunt-0.103
 usual-0.102
Tokeniously
Token ID6819
Feature activation+7.41e-02

Pos logit contributions
elling+0.187
ealing+0.183
oking+0.174
lee+0.164
atted+0.163
Neg logit contributions
fire-0.366
 gall-0.338
 ride-0.337
 usual-0.332
 seek-0.327
Token.
Token ID13
Feature activation-6.99e-03

Pos logit contributions
lee+0.184
atted+0.184
elling+0.182
ealing+0.180
ishment+0.178
Neg logit contributions
fire-0.097
 distance-0.088
 usual-0.084
|-0.081
 ride-0.081
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
fire+0.010
 distance+0.009
 usual+0.009
 hunt+0.008
|+0.008
Neg logit contributions
lee-0.017
atted-0.017
ealing-0.016
elling-0.016
ishment-0.016
Token
Token ID198
Feature activation-3.46e-02

Pos logit contributions
fire+0.049
 distance+0.043
 hunt+0.043
 usual+0.043
|+0.038
Neg logit contributions
lee-0.123
atted-0.121
ealing-0.120
elling-0.115
ubby-0.114
TokenHis
Token ID6653
Feature activation+7.27e-02

Pos logit contributions
fire+0.051
 usual+0.048
 hunt+0.047
 distance+0.047
|+0.044
Neg logit contributions
lee-0.079
ealing-0.078
atted-0.077
elling-0.074
oking-0.074
Token
Token ID220
Feature activation+2.04e-02

Pos logit contributions
fire+0.046
 distance+0.043
 usual+0.042
|+0.041
 ride+0.041
Neg logit contributions
atted-0.060
lee-0.060
elling-0.059
ealing-0.058
ishment-0.058
Token
Token ID198
Feature activation-5.95e-02

Pos logit contributions
lee+0.058
atted+0.057
ealing+0.057
elling+0.056
oking+0.055
Neg logit contributions
fire-0.021
 distance-0.018
 hunt-0.016
|-0.016
 usual-0.016
Token
Token ID198
Feature activation-4.66e-02

Pos logit contributions
fire+0.064
 distance+0.056
 hunt+0.053
 usual+0.053
|+0.053
Neg logit contributions
lee-0.163
atted-0.159
ealing-0.158
elling-0.155
oking-0.152
TokenThen
Token ID6423
Feature activation+1.75e-02

Pos logit contributions
fire+0.075
 distance+0.069
 usual+0.066
|+0.066
 ride+0.064
Neg logit contributions
lee-0.103
atted-0.102
elling-0.101
ealing-0.100
ishment-0.098
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
lee+0.037
atted+0.036
ealing+0.035
ishment+0.035
elling+0.035
Neg logit contributions
fire-0.031
|-0.028
 distance-0.028
 usual-0.026
 hunt-0.026
Token Jol
Token ID39329
Feature activation+1.53e-01

Pos logit contributions
lee+0.023
atted+0.022
ealing+0.021
elling+0.021
ishment+0.021
Neg logit contributions
fire-0.029
|-0.028
 distance-0.026
 hunt-0.026
 usual-0.024
Tokenene
Token ID1734
Feature activation+5.42e-02

Pos logit contributions
 agrees+0.268
 opinions+0.266
elling+0.265
ealing+0.261
ishment+0.257
Neg logit contributions
fire-0.266
 usual-0.260
 he-0.257
 ride-0.241
ding-0.240
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
lee+0.122
atted+0.122
elling+0.121
ealing+0.119
ishment+0.118
Neg logit contributions
fire-0.083
 distance-0.077
 usual-0.073
|-0.072
 ride-0.072
Token Little
Token ID7703
Feature activation+1.05e-01

Pos logit contributions
atted+0.050
lee+0.050
ealing+0.046
ishment+0.045
 opinions+0.045
Neg logit contributions
fire-0.040
|-0.039
 distance-0.036
 hunt-0.036
 usual-0.033
Token Johnny
Token ID15470
Feature activation+8.19e-02

Pos logit contributions
atted+0.212
elling+0.208
lee+0.206
ealing+0.205
ubby+0.203
Neg logit contributions
 ride-0.176
fire-0.174
 he-0.173
 distance-0.169
 usual-0.167
Token's
Token ID338
Feature activation+3.84e-02

Pos logit contributions
lee+0.190
atted+0.189
elling+0.187
ealing+0.185
ishment+0.182
Neg logit contributions
fire-0.121
 distance-0.112
 usual-0.107
|-0.105
 ride-0.103
Token no
Token ID645
Feature activation+1.87e-02

Pos logit contributions
fire+0.033
|+0.029
tr+0.026
 hunt+0.026
 usual+0.026
Neg logit contributions
lee-0.049
atted-0.048
ishment-0.047
els-0.046
 opinions-0.046
Token time
Token ID640
Feature activation+2.70e-02

Pos logit contributions
atted+0.036
lee+0.036
ishment+0.035
elling+0.035
ealing+0.035
Neg logit contributions
fire-0.035
|-0.032
To-0.030
 distance-0.029
 usual-0.029
Token."
Token ID526
Feature activation-1.04e-02

Pos logit contributions
atted+0.064
lee+0.063
ealing+0.062
oking+0.059
ished+0.059
Neg logit contributions
fire-0.038
|-0.036
 distance-0.034
 hunt-0.032
 usual-0.032
Token
Token ID198
Feature activation-3.48e-02

Pos logit contributions
fire+0.013
 distance+0.012
 usual+0.011
 hunt+0.011
|+0.011
Neg logit contributions
lee-0.027
atted-0.026
ealing-0.026
ishment-0.025
elling-0.025
Token
Token ID198
Feature activation-2.96e-02

Pos logit contributions
fire+0.039
 distance+0.034
 usual+0.034
 hunt+0.033
 ride+0.030
Neg logit contributions
lee-0.096
atted-0.092
ealing-0.091
elling-0.088
oking-0.087
TokenEnc
Token ID27195
Feature activation+1.53e-01

Pos logit contributions
fire+0.046
 usual+0.044
 distance+0.043
 hunt+0.041
 ride+0.040
Neg logit contributions
lee-0.064
ealing-0.064
atted-0.062
elling-0.060
oking-0.060
Tokenour
Token ID454
Feature activation+4.62e-02

Pos logit contributions
lee+0.207
atted+0.203
elling+0.200
ishment+0.187
ealing+0.187
Neg logit contributions
fire-0.378
 distance-0.352
 usual-0.347
|-0.344
 ride-0.343
Tokenaged
Token ID1886
Feature activation+5.91e-02

Pos logit contributions
elling+0.091
lee+0.090
atted+0.089
ishment+0.088
ealing+0.088
Neg logit contributions
fire-0.084
 distance-0.077
 usual-0.075
 ride-0.074
|-0.074
Token by
Token ID416
Feature activation+6.04e-03

Pos logit contributions
elling+0.138
lee+0.138
ealing+0.137
atted+0.136
ishment+0.134
Neg logit contributions
fire-0.083
 distance-0.077
 usual-0.075
|-0.072
 he-0.072
Token the
Token ID262
Feature activation-7.54e-03

Pos logit contributions
lee+0.014
atted+0.014
elling+0.014
ealing+0.014
ishment+0.013
Neg logit contributions
fire-0.009
 distance-0.009
 usual-0.008
|-0.008
 ride-0.008
Token eagle
Token ID31176
Feature activation-1.65e-02

Pos logit contributions
fire+0.013
|+0.012
 distance+0.011
 hunt+0.011
To+0.011
Neg logit contributions
lee-0.015
elling-0.015
ealing-0.015
atted-0.015
ishment-0.015
Token smile
Token ID8212
Feature activation+1.06e-02

Pos logit contributions
elling+0.032
lee+0.031
atted+0.031
ished+0.030
ishment+0.030
Neg logit contributions
 distance-0.019
fire-0.019
 ride-0.018
 hunt-0.017
 he-0.017
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
lee+0.029
atted+0.029
ealing+0.028
elling+0.028
ishment+0.028
Neg logit contributions
fire-0.012
 distance-0.010
|-0.010
 usual-0.010
 hunt-0.009
Token
Token ID198
Feature activation-4.58e-02

Pos logit contributions
fire+0.017
 distance+0.014
 usual+0.014
|+0.013
 hunt+0.013
Neg logit contributions
lee-0.043
atted-0.042
ealing-0.041
elling-0.041
ishment-0.040
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
fire+0.042
 distance+0.037
 usual+0.034
|+0.032
 hunt+0.032
Neg logit contributions
lee-0.135
atted-0.133
ealing-0.130
elling-0.130
ishment-0.126
TokenMary
Token ID24119
Feature activation+6.05e-02

Pos logit contributions
fire+0.056
 distance+0.052
 usual+0.049
|+0.049
 ride+0.048
Neg logit contributions
lee-0.078
atted-0.077
elling-0.076
ealing-0.075
ishment-0.074
Token anx
Token ID7296
Feature activation+1.50e-01

Pos logit contributions
elling+0.138
lee+0.135
ealing+0.135
atted+0.134
ishment+0.133
Neg logit contributions
fire-0.091
 distance-0.082
 ride-0.080
|-0.079
 usual-0.078
Tokeniously
Token ID6819
Feature activation+7.63e-02

Pos logit contributions
ealing+0.218
elling+0.211
oking+0.204
ubby+0.202
ours+0.194
Neg logit contributions
fire-0.299
 ride-0.296
 gall-0.293
 hunt-0.289
 drive-0.287
Token opened
Token ID4721
Feature activation+2.17e-02

Pos logit contributions
elling+0.173
lee+0.171
atted+0.169
ealing+0.167
ishment+0.166
Neg logit contributions
fire-0.116
 distance-0.107
 ride-0.104
 he-0.103
 usual-0.103
Token up
Token ID510
Feature activation+8.76e-03

Pos logit contributions
ealing+0.060
elling+0.059
atted+0.059
lee+0.058
oking+0.058
Neg logit contributions
 distance-0.021
fire-0.021
 usual-0.020
 he-0.020
 ride-0.019
Token the
Token ID262
Feature activation-2.11e-02

Pos logit contributions
lee+0.021
atted+0.021
elling+0.021
ishment+0.020
ealing+0.020
Neg logit contributions
fire-0.012
 distance-0.011
|-0.011
 usual-0.010
 hunt-0.010
Token pass
Token ID1208
Feature activation-3.52e-02

Pos logit contributions
fire+0.034
 distance+0.031
 usual+0.030
|+0.030
 ride+0.029
Neg logit contributions
lee-0.046
atted-0.046
elling-0.045
ealing-0.045
ishment-0.044
Token Emily
Token ID17608
Feature activation+6.98e-02

Pos logit contributions
fire+0.004
 distance+0.004
|+0.004
 usual+0.003
 ride+0.003
Neg logit contributions
atted-0.004
lee-0.004
elling-0.004
ealing-0.004
ishment-0.004
Token,
Token ID11
Feature activation+2.63e-02

Pos logit contributions
lee+0.167
atted+0.167
elling+0.165
ealing+0.163
ishment+0.162
Neg logit contributions
fire-0.097
 distance-0.088
 usual-0.084
|-0.083
 ride-0.082
Token you
Token ID345
Feature activation+2.15e-02

Pos logit contributions
lee+0.061
atted+0.060
elling+0.060
ealing+0.059
ishment+0.058
Neg logit contributions
fire-0.039
 distance-0.036
 usual-0.035
|-0.034
 ride-0.034
Token look
Token ID804
Feature activation+3.70e-02

Pos logit contributions
lee+0.051
atted+0.050
ealing+0.050
elling+0.049
ished+0.048
Neg logit contributions
fire-0.031
 distance-0.029
 usual-0.027
|-0.026
 hunt-0.025
Token AM
Token ID3001
Feature activation+5.48e-02

Pos logit contributions
lee+0.086
atted+0.085
elling+0.084
ealing+0.083
ishment+0.082
Neg logit contributions
fire-0.055
 distance-0.051
 usual-0.048
|-0.048
 ride-0.047
TokenAZ
Token ID22778
Feature activation+1.48e-01

Pos logit contributions
lee+0.113
atted+0.113
elling+0.110
ealing+0.109
ishment+0.109
Neg logit contributions
fire-0.094
 distance-0.088
 usual-0.085
|-0.085
 ride-0.083
TokenING
Token ID2751
Feature activation+2.44e-02

Pos logit contributions
lee+0.262
atted+0.262
elling+0.247
ishment+0.240
ealing+0.239
Neg logit contributions
fire-0.302
|-0.277
 distance-0.275
 ride-0.271
 usual-0.269
Token!"
Token ID2474
Feature activation-7.86e-03

Pos logit contributions
atted+0.055
lee+0.055
elling+0.054
ealing+0.054
ishment+0.053
Neg logit contributions
fire-0.037
 distance-0.034
 usual-0.033
|-0.033
 ride-0.032
Token Emily
Token ID17608
Feature activation+6.71e-02

Pos logit contributions
fire+0.013
 distance+0.012
|+0.012
 usual+0.011
To+0.011
Neg logit contributions
atted-0.017
elling-0.017
ealing-0.016
lee-0.016
ishment-0.016
Token was
Token ID373
Feature activation+4.54e-02

Pos logit contributions
elling+0.148
lee+0.148
atted+0.147
ealing+0.145
ishment+0.144
Neg logit contributions
fire-0.105
 distance-0.097
 usual-0.093
|-0.093
 ride-0.091
Token so
Token ID523
Feature activation+1.45e-02

Pos logit contributions
elling+0.107
atted+0.106
lee+0.106
ealing+0.104
ishment+0.104
Neg logit contributions
fire-0.065
 distance-0.061
 usual-0.058
|-0.057
 ride-0.055
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
lee+0.094
atted+0.093
elling+0.092
ealing+0.091
ishment+0.090
Neg logit contributions
fire-0.067
 distance-0.062
 usual-0.060
|-0.059
 ride-0.057
Token a
Token ID257
Feature activation-2.71e-02

Pos logit contributions
atted+0.057
lee+0.056
ealing+0.055
elling+0.055
ubby+0.054
Neg logit contributions
fire-0.030
 distance-0.028
 hunt-0.026
 usual-0.026
|-0.025
Token gentle
Token ID10296
Feature activation+5.07e-02

Pos logit contributions
fire+0.057
 distance+0.054
|+0.053
 usual+0.051
 hunt+0.051
Neg logit contributions
lee-0.046
ealing-0.045
atted-0.044
ubby-0.044
elling-0.043
Token girl
Token ID2576
Feature activation+3.19e-02

Pos logit contributions
lee+0.099
ubby+0.096
atted+0.095
elling+0.093
ealing+0.091
Neg logit contributions
fire-0.091
|-0.086
 usual-0.083
 distance-0.083
To-0.083
Token named
Token ID3706
Feature activation+3.03e-02

Pos logit contributions
lee+0.085
atted+0.084
elling+0.081
ealing+0.079
els+0.079
Neg logit contributions
fire-0.038
 distance-0.034
 usual-0.033
|-0.031
 hunt-0.030
Token Ci
Token ID37685
Feature activation+1.48e-01

Pos logit contributions
atted+0.069
lee+0.069
ealing+0.067
elling+0.067
ishment+0.067
Neg logit contributions
fire-0.046
 distance-0.043
|-0.042
 usual-0.042
 ride-0.040
Tokenara
Token ID3301
Feature activation+6.38e-02

Pos logit contributions
atted+0.205
lee+0.205
elling+0.194
ealing+0.192
ishment+0.190
Neg logit contributions
fire-0.350
 distance-0.341
|-0.333
 usual-0.332
 ride-0.329
Token stumbled
Token ID24241
Feature activation+3.56e-02

Pos logit contributions
lee+0.152
atted+0.148
elling+0.143
ealing+0.142
ished+0.140
Neg logit contributions
fire-0.096
 distance-0.088
 usual-0.085
|-0.080
 hunt-0.077
Token upon
Token ID2402
Feature activation+1.10e-02

Pos logit contributions
lee+0.084
atted+0.083
ished+0.081
elling+0.079
ubby+0.078
Neg logit contributions
fire-0.055
|-0.045
 hunt-0.045
 distance-0.043
 saf-0.042
Token a
Token ID257
Feature activation-6.29e-02

Pos logit contributions
lee+0.020
elling+0.020
 opinions+0.020
 mistakes+0.020
 requests+0.020
Neg logit contributions
fire-0.021
field-0.017
 distance-0.017
 hunt-0.016
|-0.016
Token lovely
Token ID14081
Feature activation+3.14e-02

Pos logit contributions
fire+0.115
|+0.112
 distance+0.095
whe+0.094
To+0.094
Neg logit contributions
lee-0.117
ubby-0.116
ealing-0.114
elling-0.112
ishment-0.109
Token disappear
Token ID10921
Feature activation+1.76e-02

Pos logit contributions
fire+0.059
 distance+0.052
 usual+0.052
|+0.046
 seek+0.045
Neg logit contributions
atted-0.080
ealing-0.080
lee-0.080
ubby-0.078
ished-0.077
Token?"
Token ID1701
Feature activation-1.05e-02

Pos logit contributions
lee+0.035
elling+0.035
atted+0.035
ealing+0.034
ishment+0.034
Neg logit contributions
fire-0.031
 distance-0.029
|-0.028
 usual-0.027
 ride-0.027
Token His
Token ID2399
Feature activation+7.89e-02

Pos logit contributions
fire+0.021
 distance+0.019
|+0.019
 usual+0.019
 hunt+0.018
Neg logit contributions
lee-0.020
atted-0.020
elling-0.019
ealing-0.018
ishment-0.018
Token mother
Token ID2802
Feature activation+2.13e-02

Pos logit contributions
elling+0.189
atted+0.184
ealing+0.183
lee+0.182
ishment+0.181
Neg logit contributions
 usual-0.110
 ride-0.107
 distance-0.104
fire-0.104
 he-0.096
Token smiled
Token ID13541
Feature activation+1.00e-01

Pos logit contributions
elling+0.045
ishment+0.045
ealing+0.044
lee+0.043
ubby+0.042
Neg logit contributions
fire-0.033
 he-0.032
 ride-0.032
 gall-0.030
 distance-0.030
Token knowingly
Token ID26526
Feature activation+1.48e-01

Pos logit contributions
elling+0.244
atted+0.240
lee+0.238
els+0.235
ealing+0.233
Neg logit contributions
 distance-0.129
 he-0.127
fire-0.123
 ride-0.119
 far-0.116
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
elling+0.360
lee+0.359
atted+0.355
ished+0.349
ealing+0.349
Neg logit contributions
fire-0.186
 distance-0.184
 he-0.173
 far-0.161
 ride-0.156
Token said
Token ID531
Feature activation+6.92e-02

Pos logit contributions
fire+0.024
 he+0.023
 ride+0.022
 distance+0.022
 usual+0.022
Neg logit contributions
elling-0.033
lee-0.032
ealing-0.032
atted-0.032
ubby-0.032
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
lee+0.159
elling+0.157
atted+0.157
ishment+0.156
ealing+0.154
Neg logit contributions
fire-0.105
 distance-0.093
 usual-0.092
 he-0.089
 ride-0.089
Token "
Token ID366
Feature activation+3.51e-02

Pos logit contributions
lee+0.043
atted+0.043
elling+0.042
ealing+0.042
ishment+0.041
Neg logit contributions
fire-0.038
 distance-0.036
 usual-0.034
|-0.034
 ride-0.033
TokenWell
Token ID5779
Feature activation+5.92e-02

Pos logit contributions
elling+0.073
lee+0.071
atted+0.070
ishment+0.069
 opinions+0.069
Neg logit contributions
fire-0.058
 hunt-0.053
|-0.051
 distance-0.051
 usual-0.050
Token are
Token ID389
Feature activation+5.08e-03

Pos logit contributions
lee+0.018
atted+0.018
elling+0.017
ealing+0.017
ished+0.017
Neg logit contributions
fire-0.014
 distance-0.013
 usual-0.012
|-0.012
 ride-0.012
Token you
Token ID345
Feature activation+1.45e-02

Pos logit contributions
lee+0.010
atted+0.010
elling+0.010
ealing+0.010
ishment+0.009
Neg logit contributions
fire-0.010
 distance-0.009
|-0.008
 hunt-0.008
 usual-0.008
Token?"
Token ID1701
Feature activation-2.81e-02

Pos logit contributions
lee+0.034
atted+0.034
elling+0.033
ealing+0.033
ishment+0.032
Neg logit contributions
fire-0.021
 distance-0.020
 usual-0.019
 ride-0.019
 hunt-0.019
Token she
Token ID673
Feature activation+2.18e-03

Pos logit contributions
fire+0.035
|+0.031
 usual+0.029
 distance+0.029
 hunt+0.026
Neg logit contributions
lee-0.074
atted-0.072
ishment-0.069
elling-0.069
ealing-0.068
Token shouted
Token ID17293
Feature activation+8.06e-02

Pos logit contributions
elling+0.004
atted+0.004
lee+0.004
ealing+0.004
ishment+0.004
Neg logit contributions
fire-0.004
 distance-0.004
 usual-0.003
|-0.003
 ride-0.003
Token anx
Token ID7296
Feature activation+1.47e-01

Pos logit contributions
lee+0.204
atted+0.202
ealing+0.198
elling+0.197
ishment+0.194
Neg logit contributions
fire-0.105
 distance-0.096
 usual-0.091
|-0.091
 hunt-0.086
Tokeniously
Token ID6819
Feature activation+6.18e-02

Pos logit contributions
elling+0.187
ealing+0.184
oking+0.176
ubby+0.164
atted+0.163
Neg logit contributions
fire-0.360
 hunt-0.336
 ride-0.328
 gall-0.325
 he-0.322
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
lee+0.174
atted+0.173
elling+0.172
ealing+0.169
ishment+0.168
Neg logit contributions
fire-0.062
 distance-0.055
 usual-0.051
|-0.049
 ride-0.048
Token
Token ID220
Feature activation+2.76e-02

Pos logit contributions
fire+0.035
 distance+0.032
|+0.031
 usual+0.031
 hunt+0.029
Neg logit contributions
lee-0.046
atted-0.045
ealing-0.043
elling-0.043
ishment-0.043
Token
Token ID198
Feature activation-5.80e-02

Pos logit contributions
lee+0.077
atted+0.077
ealing+0.076
elling+0.075
oking+0.073
Neg logit contributions
fire-0.028
 distance-0.025
 usual-0.022
|-0.022
 hunt-0.022
Token
Token ID198
Feature activation-4.62e-02

Pos logit contributions
fire+0.073
 distance+0.065
 usual+0.062
 hunt+0.057
|+0.055
Neg logit contributions
lee-0.148
atted-0.142
ealing-0.139
oking-0.135
elling-0.135
Token
Token ID198
Feature activation-4.35e-02

Pos logit contributions
fire+0.078
 hunt+0.074
 usual+0.071
 distance+0.071
tr+0.066
Neg logit contributions
lee-0.123
ealing-0.120
atted-0.117
oking-0.113
elling-0.109
TokenThe
Token ID464
Feature activation+1.82e-02

Pos logit contributions
fire+0.074
 hunt+0.073
 usual+0.071
 distance+0.068
|+0.067
Neg logit contributions
ealing-0.083
oking-0.080
lee-0.079
inding-0.077
elling-0.077
Token old
Token ID1468
Feature activation+5.30e-02

Pos logit contributions
lee+0.029
atted+0.028
elling+0.028
ealing+0.028
ishment+0.027
Neg logit contributions
fire-0.041
|-0.038
 distance-0.038
 usual-0.038
 hunt-0.037
Token man
Token ID582
Feature activation+1.37e-02

Pos logit contributions
lee+0.120
atted+0.118
elling+0.116
ealing+0.116
ishment+0.114
Neg logit contributions
fire-0.083
 distance-0.076
|-0.075
 usual-0.074
 hunt-0.070
Token smiled
Token ID13541
Feature activation+9.85e-02

Pos logit contributions
atted+0.030
lee+0.030
ealing+0.028
elling+0.028
ished+0.027
Neg logit contributions
fire-0.023
 distance-0.021
 usual-0.021
|-0.021
 hunt-0.020
Token knowingly
Token ID26526
Feature activation+1.47e-01

Pos logit contributions
lee+0.215
atted+0.213
elling+0.211
ealing+0.208
ishment+0.205
Neg logit contributions
fire-0.160
 distance-0.149
 usual-0.142
|-0.138
 ride-0.138
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
lee+0.353
elling+0.351
atted+0.349
ealing+0.341
ished+0.337
Neg logit contributions
fire-0.208
 distance-0.192
 he-0.176
 usual-0.170
 ride-0.166
Token said
Token ID531
Feature activation+6.44e-02

Pos logit contributions
fire+0.037
|+0.034
 distance+0.034
 usual+0.034
 hunt+0.032
Neg logit contributions
lee-0.033
atted-0.032
ealing-0.030
elling-0.030
ishment-0.029
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
lee+0.137
atted+0.136
ealing+0.133
elling+0.131
ishment+0.127
Neg logit contributions
fire-0.109
 distance-0.104
|-0.100
 usual-0.098
 hunt-0.095
Token "
Token ID366
Feature activation+4.09e-02

Pos logit contributions
lee+0.036
atted+0.035
ealing+0.034
elling+0.033
ishment+0.032
Neg logit contributions
fire-0.040
 distance-0.038
|-0.038
 usual-0.037
 hunt-0.036
TokenI
Token ID40
Feature activation-1.21e-02

Pos logit contributions
lee+0.086
elling+0.086
atted+0.085
ealing+0.084
ishment+0.083
Neg logit contributions
fire-0.068
 distance-0.062
 usual-0.061
|-0.060
 hunt-0.060
Token all
Token ID477
Feature activation-5.95e-02

Pos logit contributions
fire+0.010
|+0.010
 distance+0.009
 hunt+0.009
 ride+0.008
Neg logit contributions
lee-0.018
atted-0.018
ishment-0.018
els-0.017
elling-0.017
Token the
Token ID262
Feature activation-3.13e-02

Pos logit contributions
fire+0.062
 distance+0.057
 usual+0.053
|+0.052
 ride+0.050
Neg logit contributions
lee-0.164
atted-0.163
ealing-0.163
elling-0.161
ishment-0.159
Token books
Token ID3835
Feature activation-1.28e-01

Pos logit contributions
fire+0.050
 distance+0.049
 usual+0.047
|+0.047
 hunt+0.047
Neg logit contributions
atted-0.070
lee-0.070
ealing-0.069
elling-0.068
ishment-0.066
Token on
Token ID319
Feature activation-5.32e-02

Pos logit contributions
fire+0.181
 distance+0.175
|+0.173
 saf+0.162
 gall+0.159
Neg logit contributions
ished-0.279
atted-0.279
lee-0.278
elling-0.277
ishment-0.276
Token the
Token ID262
Feature activation-4.23e-02

Pos logit contributions
fire+0.073
|+0.066
 distance+0.065
 hunt+0.059
 ride+0.058
Neg logit contributions
lee-0.128
els-0.126
ishment-0.125
elling-0.121
atted-0.121
Token books
Token ID3835
Feature activation-1.44e-01

Pos logit contributions
fire+0.077
 distance+0.069
|+0.068
 usual+0.065
To+0.063
Neg logit contributions
lee-0.083
elling-0.081
ishment-0.079
atted-0.078
ealing-0.078
Tokenhe
Token ID258
Feature activation-3.36e-02

Pos logit contributions
fire+0.163
|+0.151
 distance+0.150
 saf+0.144
 gall+0.143
Neg logit contributions
elling-0.344
ishment-0.344
ished-0.341
lee-0.339
oking-0.333
Tokenlf
Token ID1652
Feature activation-9.05e-02

Pos logit contributions
fire+0.050
 distance+0.048
|+0.046
 usual+0.043
 ride+0.042
Neg logit contributions
lee-0.075
els-0.073
atted-0.073
 opinions-0.070
ishment-0.070
Token.
Token ID13
Feature activation-8.26e-03

Pos logit contributions
fire+0.118
 distance+0.116
|+0.115
To+0.103
 ride+0.099
Neg logit contributions
lee-0.215
atted-0.210
ishment-0.209
ished-0.208
els-0.205
Token
Token ID198
Feature activation-5.26e-02

Pos logit contributions
fire+0.014
 distance+0.013
 usual+0.012
 ride+0.012
|+0.012
Neg logit contributions
lee-0.018
atted-0.018
elling-0.017
ealing-0.017
ishment-0.017
Token
Token ID198
Feature activation-4.24e-02

Pos logit contributions
fire+0.059
 distance+0.052
 usual+0.050
 hunt+0.047
|+0.046
Neg logit contributions
lee-0.139
atted-0.136
ealing-0.135
elling-0.132
oking-0.130
Token hidden
Token ID7104
Feature activation+3.38e-02

Pos logit contributions
lee+0.028
atted+0.027
 opinions+0.026
ished+0.026
ealing+0.026
Neg logit contributions
fire-0.016
 distance-0.013
|-0.013
 ride-0.012
 usual-0.012
Token behind
Token ID2157
Feature activation-3.28e-02

Pos logit contributions
ealing+0.076
lee+0.074
elling+0.074
atted+0.073
oking+0.072
Neg logit contributions
fire-0.050
 usual-0.049
 distance-0.049
 he-0.045
|-0.045
Token the
Token ID262
Feature activation-3.54e-02

Pos logit contributions
fire+0.045
 distance+0.038
|+0.037
To+0.034
 ride+0.034
Neg logit contributions
lee-0.083
atted-0.081
elling-0.079
els-0.078
ishment-0.078
Token old
Token ID1468
Feature activation+8.30e-03

Pos logit contributions
fire+0.061
|+0.060
To+0.054
 usual+0.053
 gall+0.053
Neg logit contributions
lee-0.071
elling-0.069
ealing-0.062
els-0.062
ubby-0.061
Token broken
Token ID5445
Feature activation-1.26e-03

Pos logit contributions
lee+0.018
elling+0.017
atted+0.017
ishment+0.017
ealing+0.016
Neg logit contributions
fire-0.013
|-0.013
 usual-0.012
 distance-0.012
To-0.011
Token books
Token ID3835
Feature activation-1.43e-01

Pos logit contributions
fire+0.002
|+0.002
 usual+0.002
 distance+0.002
 ride+0.002
Neg logit contributions
lee-0.003
elling-0.003
atted-0.003
ishment-0.002
ealing-0.002
Tokenhe
Token ID258
Feature activation-3.01e-02

Pos logit contributions
fire+0.233
|+0.220
 distance+0.206
 usual+0.202
 saf+0.202
Neg logit contributions
elling-0.288
lee-0.287
atted-0.279
els-0.278
ishment-0.277
Tokenlf
Token ID1652
Feature activation-8.73e-02

Pos logit contributions
fire+0.040
 distance+0.037
|+0.036
 usual+0.034
 ride+0.032
Neg logit contributions
lee-0.075
atted-0.072
els-0.071
elling-0.071
ishment-0.070
Token.
Token ID13
Feature activation-3.63e-02

Pos logit contributions
fire+0.131
|+0.127
 distance+0.124
To+0.114
 usual+0.113
Neg logit contributions
lee-0.195
atted-0.190
ishment-0.185
els-0.183
ished-0.183
Token
Token ID198
Feature activation-6.37e-02

Pos logit contributions
 he+0.027
+0.026
 usual+0.024
 distance+0.024
To+0.024
Neg logit contributions
els-0.108
elling-0.107
atted-0.106
ishment-0.103
ener-0.103
Token
Token ID198
Feature activation-4.99e-02

Pos logit contributions
fire+0.047
 distance+0.039
 usual+0.035
|+0.035
 ride+0.032
Neg logit contributions
lee-0.196
atted-0.195
elling-0.193
ealing-0.191
ishment-0.189
Token the
Token ID262
Feature activation-1.79e-02

Pos logit contributions
fire+0.013
 distance+0.011
|+0.011
 usual+0.011
 hunt+0.010
Neg logit contributions
lee-0.024
atted-0.024
elling-0.024
ealing-0.024
ishment-0.024
Token books
Token ID3835
Feature activation-1.31e-01

Pos logit contributions
fire+0.027
|+0.023
 distance+0.022
field+0.020
 usual+0.020
Neg logit contributions
elling-0.041
lee-0.040
ealing-0.040
els-0.040
atted-0.039
Token back
Token ID736
Feature activation-1.79e-02

Pos logit contributions
fire+0.193
|+0.185
Ro+0.181
 hunt+0.175
 saf+0.173
Neg logit contributions
ished-0.268
elling-0.265
atted-0.264
lee-0.261
ealing-0.260
Token on
Token ID319
Feature activation-3.80e-02

Pos logit contributions
fire+0.025
 distance+0.023
|+0.022
 hunt+0.020
To+0.020
Neg logit contributions
atted-0.044
lee-0.043
ished-0.042
elling-0.042
els-0.042
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
fire+0.057
|+0.055
 hunt+0.048
 distance+0.048
tr+0.047
Neg logit contributions
lee-0.084
ishment-0.083
els-0.082
atted-0.079
ening-0.079
Token books
Token ID3835
Feature activation-1.42e-01

Pos logit contributions
fire+0.050
|+0.044
 distance+0.042
To+0.041
field+0.039
Neg logit contributions
lee-0.060
elling-0.059
ishment-0.056
 opinions-0.055
atted-0.055
Tokenhe
Token ID258
Feature activation-3.73e-02

Pos logit contributions
fire+0.186
|+0.178
 usual+0.163
Ro+0.161
 distance+0.158
Neg logit contributions
lee-0.315
ement-0.313
elling-0.305
ishment-0.305
ished-0.304
Tokenlf
Token ID1652
Feature activation-9.14e-02

Pos logit contributions
fire+0.057
 distance+0.053
|+0.052
 usual+0.049
To+0.046
Neg logit contributions
lee-0.084
atted-0.080
els-0.080
ishment-0.077
 opinions-0.076
Token,
Token ID11
Feature activation-1.25e-03

Pos logit contributions
|+0.135
fire+0.133
 distance+0.128
To+0.119
 usual+0.116
Neg logit contributions
lee-0.206
atted-0.199
ished-0.191
ishment-0.190
els-0.187
Token smiling
Token ID16755
Feature activation+8.03e-02

Pos logit contributions
fire+0.001
|+0.001
 distance+0.001
 usual+0.001
 hunt+0.001
Neg logit contributions
lee-0.003
atted-0.003
ishment-0.003
ealing-0.003
 opinions-0.003
Token proudly
Token ID23017
Feature activation+3.17e-02

Pos logit contributions
lee+0.180
atted+0.177
ealing+0.174
elling+0.172
ishment+0.170
Neg logit contributions
fire-0.128
 distance-0.120
|-0.117
 usual-0.113
 hunt-0.110
Token build
Token ID1382
Feature activation-3.55e-02

Pos logit contributions
fire+0.087
 distance+0.079
|+0.079
 usual+0.078
ding+0.069
Neg logit contributions
lee-0.128
 opinions-0.126
elling-0.122
els-0.120
 requests-0.118
Token a
Token ID257
Feature activation-8.90e-02

Pos logit contributions
fire+0.046
 distance+0.043
 usual+0.040
|+0.040
 ride+0.037
Neg logit contributions
lee-0.090
atted-0.088
elling-0.088
ealing-0.088
ishment-0.087
Token nest
Token ID16343
Feature activation-1.03e-01

Pos logit contributions
|+0.165
fire+0.161
 usual+0.144
 distance+0.141
 he+0.136
Neg logit contributions
 mistakes-0.161
lee-0.160
elling-0.158
ishment-0.157
ealing-0.155
Token on
Token ID319
Feature activation-6.21e-02

Pos logit contributions
fire+0.175
|+0.175
 usual+0.174
 distance+0.169
To+0.164
Neg logit contributions
elling-0.192
atted-0.180
ubby-0.180
lee-0.178
ished-0.176
Token this
Token ID428
Feature activation-4.91e-02

Pos logit contributions
fire+0.104
|+0.096
 distance+0.095
 usual+0.091
 ride+0.087
Neg logit contributions
elling-0.129
els-0.127
lee-0.124
ishment-0.122
 mistakes-0.120
Token roof
Token ID9753
Feature activation-1.40e-01

Pos logit contributions
|+0.078
fire+0.074
To+0.064
 he+0.061
 distance+0.061
Neg logit contributions
elling-0.103
lee-0.097
ishment-0.096
els-0.094
ubby-0.094
Token,
Token ID11
Feature activation+7.14e-03

Pos logit contributions
fire+0.242
|+0.219
 usual+0.219
 distance+0.216
To+0.211
Neg logit contributions
elling-0.285
atted-0.274
lee-0.270
ishment-0.261
 opinions-0.260
Token but
Token ID475
Feature activation+2.21e-03

Pos logit contributions
lee+0.016
atted+0.016
elling+0.016
ealing+0.015
ishment+0.015
Neg logit contributions
fire-0.011
 distance-0.010
 usual-0.010
|-0.010
 ride-0.010
Token I
Token ID314
Feature activation-2.68e-02

Pos logit contributions
lee+0.005
atted+0.005
elling+0.005
ealing+0.005
ishment+0.005
Neg logit contributions
fire-0.003
 distance-0.003
|-0.003
 usual-0.003
 ride-0.003
Token can
Token ID460
Feature activation-2.69e-02

Pos logit contributions
fire+0.046
 distance+0.043
 usual+0.041
|+0.040
 ride+0.038
Neg logit contributions
lee-0.056
atted-0.056
ealing-0.055
elling-0.054
ished-0.053
Token't
Token ID470
Feature activation-1.53e-02

Pos logit contributions
fire+0.035
 distance+0.034
|+0.032
 usual+0.031
 saf+0.027
Neg logit contributions
lee-0.066
elling-0.064
atted-0.063
els-0.063
ealing-0.062
Token knocked
Token ID13642
Feature activation+2.90e-02

Pos logit contributions
fire+0.057
|+0.054
 distance+0.050
 hunt+0.049
 saf+0.046
Neg logit contributions
atted-0.073
lee-0.072
elling-0.069
ishment-0.068
ealing-0.068
Token some
Token ID617
Feature activation-4.94e-02

Pos logit contributions
lee+0.063
elling+0.063
atted+0.063
els+0.062
ished+0.060
Neg logit contributions
fire-0.050
 hunt-0.044
 distance-0.041
|-0.041
 saf-0.040
Token books
Token ID3835
Feature activation-1.16e-01

Pos logit contributions
fire+0.094
|+0.091
field+0.081
 distance+0.079
 hunt+0.078
Neg logit contributions
elling-0.098
els-0.095
lee-0.093
 mistakes-0.091
atted-0.091
Token off
Token ID572
Feature activation-4.77e-02

Pos logit contributions
fire+0.196
|+0.177
 saf+0.171
 seek+0.171
 usual+0.170
Neg logit contributions
elling-0.233
ares-0.232
atted-0.229
els-0.226
ished-0.225
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
fire+0.080
|+0.077
 hunt+0.073
 seek+0.072
 distance+0.069
Neg logit contributions
lee-0.100
els-0.098
elling-0.097
atted-0.092
ubby-0.089
Token books
Token ID3835
Feature activation-1.39e-01

Pos logit contributions
fire+0.049
|+0.044
 distance+0.040
To+0.039
field+0.038
Neg logit contributions
lee-0.056
elling-0.055
 mistakes-0.052
ishment-0.051
 requests-0.051
Tokenhe
Token ID258
Feature activation-3.08e-02

Pos logit contributions
fire+0.200
 seek+0.179
|+0.175
 usual+0.173
 saf+0.167
Neg logit contributions
elling-0.309
lee-0.304
ares-0.300
els-0.295
atted-0.293
Tokenlf
Token ID1652
Feature activation-8.36e-02

Pos logit contributions
fire+0.049
 distance+0.046
|+0.045
 usual+0.041
 ride+0.040
Neg logit contributions
lee-0.069
els-0.066
atted-0.065
elling-0.062
ished-0.061
Token.
Token ID13
Feature activation-9.50e-03

Pos logit contributions
fire+0.125
|+0.121
 distance+0.115
 seek+0.108
To+0.105
Neg logit contributions
lee-0.188
atted-0.184
ished-0.176
els-0.176
elling-0.170
Token Mom
Token ID11254
Feature activation+5.29e-02

Pos logit contributions
fire+0.016
 distance+0.016
|+0.015
 usual+0.015
 hunt+0.014
Neg logit contributions
lee-0.020
atted-0.019
elling-0.019
ealing-0.018
 opinions-0.018
Tokenmy
Token ID1820
Feature activation+5.79e-02

Pos logit contributions
lee+0.105
atted+0.104
elling+0.102
ealing+0.102
ishment+0.100
Neg logit contributions
fire-0.096
 distance-0.091
 usual-0.087
|-0.087
 ride-0.085
TokenJane
Token ID41083
Feature activation+6.91e-02

Pos logit contributions
fire+0.082
 distance+0.077
|+0.073
 usual+0.072
 ride+0.070
Neg logit contributions
atted-0.113
lee-0.112
elling-0.110
ealing-0.109
ishment-0.109
Token wanted
Token ID2227
Feature activation+4.19e-02

Pos logit contributions
elling+0.168
ealing+0.166
lee+0.166
atted+0.165
ishment+0.164
Neg logit contributions
fire-0.094
 distance-0.084
 ride-0.081
 usual-0.081
 hunt-0.080
Token to
Token ID284
Feature activation-6.23e-02

Pos logit contributions
lee+0.106
elling+0.102
atted+0.102
 opinions+0.101
els+0.100
Neg logit contributions
fire-0.058
|-0.051
 distance-0.048
 ride-0.042
 gall-0.041
Token fill
Token ID6070
Feature activation-1.91e-02

Pos logit contributions
fire+0.104
|+0.091
 distance+0.091
 usual+0.085
ding+0.080
Neg logit contributions
lee-0.136
 opinions-0.130
elling-0.128
els-0.125
 requests-0.125
Token the
Token ID262
Feature activation-4.60e-02

Pos logit contributions
fire+0.025
 distance+0.022
 usual+0.021
|+0.021
 ride+0.020
Neg logit contributions
lee-0.048
atted-0.048
elling-0.047
ealing-0.047
ishment-0.046
Token pages
Token ID5468
Feature activation-1.39e-01

Pos logit contributions
fire+0.076
 distance+0.073
 usual+0.069
|+0.067
 ride+0.067
Neg logit contributions
atted-0.100
lee-0.099
ealing-0.096
elling-0.096
ishment-0.094
Token with
Token ID351
Feature activation-6.12e-02

Pos logit contributions
fire+0.165
 distance+0.146
|+0.146
 usual+0.138
 hunt+0.131
Neg logit contributions
lee-0.358
atted-0.354
elling-0.350
ealing-0.348
ishment-0.347
Token her
Token ID607
Feature activation+2.65e-03

Pos logit contributions
fire+0.097
 distance+0.086
|+0.085
 usual+0.080
 ride+0.079
Neg logit contributions
lee-0.141
atted-0.137
elling-0.136
ishment-0.134
ealing-0.132
Token amazing
Token ID4998
Feature activation+3.40e-02

Pos logit contributions
lee+0.005
atted+0.005
elling+0.005
ealing+0.005
ishment+0.005
Neg logit contributions
fire-0.005
 distance-0.005
|-0.004
 usual-0.004
 ride-0.004
Token journey
Token ID7002
Feature activation-3.03e-02

Pos logit contributions
atted+0.071
ealing+0.070
elling+0.069
lee+0.069
ished+0.067
Neg logit contributions
fire-0.055
 distance-0.055
 ride-0.055
 hunt-0.054
 usual-0.054
Token.
Token ID13
Feature activation-2.10e-02

Pos logit contributions
fire+0.040
|+0.038
 distance+0.037
 usual+0.033
To+0.033
Neg logit contributions
lee-0.073
atted-0.071
ishment-0.070
elling-0.069
ealing-0.069
Token saw
Token ID2497
Feature activation+3.23e-02

Pos logit contributions
fire+0.015
 distance+0.013
|+0.013
To+0.012
 usual+0.012
Neg logit contributions
lee-0.016
atted-0.016
ished-0.015
ealing-0.014
ishment-0.014
Token many
Token ID867
Feature activation-4.33e-02

Pos logit contributions
atted+0.077
lee+0.077
elling+0.074
 opinions+0.074
els+0.073
Neg logit contributions
fire-0.050
|-0.046
 distance-0.041
field-0.039
 hunt-0.038
Token books
Token ID3835
Feature activation-1.15e-01

Pos logit contributions
fire+0.050
 distance+0.046
 usual+0.043
|+0.042
 ride+0.041
Neg logit contributions
lee-0.115
atted-0.114
elling-0.112
ealing-0.112
ishment-0.110
Token on
Token ID319
Feature activation-5.99e-02

Pos logit contributions
fire+0.166
|+0.149
 distance+0.147
 usual+0.143
 gall+0.132
Neg logit contributions
atted-0.260
elling-0.259
lee-0.257
ealing-0.253
els-0.252
Token the
Token ID262
Feature activation-4.25e-02

Pos logit contributions
fire+0.098
|+0.089
 distance+0.086
To+0.076
tr+0.076
Neg logit contributions
els-0.130
lee-0.128
 opinions-0.123
ishment-0.123
elling-0.120
Token books
Token ID3835
Feature activation-1.39e-01

Pos logit contributions
fire+0.081
|+0.071
field+0.067
To+0.066
 distance+0.066
Neg logit contributions
lee-0.075
elling-0.075
 opinions-0.070
ishment-0.070
 requests-0.069
Tokenhe
Token ID258
Feature activation-3.23e-02

Pos logit contributions
fire+0.174
|+0.150
 distance+0.142
 usual+0.141
 gall+0.139
Neg logit contributions
elling-0.331
lee-0.326
els-0.324
oking-0.319
atted-0.317
Tokenlf
Token ID1652
Feature activation-8.32e-02

Pos logit contributions
fire+0.053
 distance+0.051
|+0.048
 usual+0.044
To+0.042
Neg logit contributions
lee-0.068
els-0.068
atted-0.066
 opinions-0.064
elling-0.062
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
fire+0.112
 distance+0.108
|+0.108
To+0.098
Ro+0.091
Neg logit contributions
atted-0.198
lee-0.197
els-0.193
ished-0.189
ishment-0.189
Token He
Token ID679
Feature activation-8.26e-03

Pos logit contributions
fire+0.038
 distance+0.034
|+0.033
 usual+0.032
 hunt+0.031
Neg logit contributions
lee-0.044
atted-0.044
ealing-0.043
elling-0.043
ishment-0.042
Token wanted
Token ID2227
Feature activation+4.08e-02

Pos logit contributions
fire+0.016
 distance+0.015
|+0.014
 usual+0.014
 hunt+0.013
Neg logit contributions
lee-0.016
atted-0.016
ealing-0.015
elling-0.015
ishment-0.015
Token big
Token ID1263
Feature activation+1.02e-02

Pos logit contributions
|+0.113
fire+0.110
 distance+0.096
 usual+0.096
field+0.092
Neg logit contributions
ealing-0.093
lee-0.091
ubby-0.091
atted-0.085
 requests-0.085
Token red
Token ID2266
Feature activation+2.54e-02

Pos logit contributions
lee+0.023
atted+0.023
elling+0.023
ealing+0.023
ishment+0.022
Neg logit contributions
fire-0.015
 distance-0.014
 usual-0.014
|-0.014
 ride-0.013
Token ball
Token ID2613
Feature activation-5.45e-02

Pos logit contributions
elling+0.049
els+0.047
ener+0.046
ubby+0.046
lee+0.045
Neg logit contributions
fire-0.048
|-0.044
 distance-0.040
 usual-0.040
To-0.036
Token behind
Token ID2157
Feature activation-5.55e-02

Pos logit contributions
fire+0.075
 usual+0.069
 distance+0.068
|+0.068
field+0.057
Neg logit contributions
atted-0.127
elling-0.127
els-0.126
lee-0.124
ished-0.124
Token the
Token ID262
Feature activation-3.01e-02

Pos logit contributions
fire+0.103
 seek+0.087
|+0.085
field+0.085
 distance+0.084
Neg logit contributions
els-0.115
lee-0.110
ishment-0.106
elling-0.106
 opinions-0.103
Token books
Token ID3835
Feature activation-1.39e-01

Pos logit contributions
fire+0.054
|+0.046
 usual+0.043
field+0.043
To+0.042
Neg logit contributions
elling-0.058
lee-0.057
els-0.054
ishment-0.051
ubby-0.051
Tokenhe
Token ID258
Feature activation-3.31e-02

Pos logit contributions
fire+0.203
|+0.177
 seek+0.165
 usual+0.164
 distance+0.158
Neg logit contributions
els-0.307
elling-0.307
lee-0.305
ares-0.295
atted-0.295
Tokenlf
Token ID1652
Feature activation-8.41e-02

Pos logit contributions
fire+0.054
 distance+0.050
|+0.049
 usual+0.047
 ride+0.044
Neg logit contributions
lee-0.070
els-0.069
atted-0.066
elling-0.063
ishment-0.063
Token.
Token ID13
Feature activation-6.48e-03

Pos logit contributions
fire+0.122
|+0.118
 distance+0.114
To+0.106
 usual+0.099
Neg logit contributions
lee-0.193
atted-0.193
els-0.191
ished-0.184
ishment-0.183
Token Tom
Token ID4186
Feature activation+7.18e-02

Pos logit contributions
fire+0.009
 distance+0.008
 usual+0.008
|+0.008
 ride+0.008
Neg logit contributions
lee-0.016
atted-0.015
elling-0.015
ealing-0.015
ishment-0.015
Token tried
Token ID3088
Feature activation+3.05e-02

Pos logit contributions
elling+0.179
ishment+0.173
lee+0.172
atted+0.170
ealing+0.169
Neg logit contributions
fire-0.094
 ride-0.090
 he-0.089
 hunt-0.087
 usual-0.085
Token put
Token ID1234
Feature activation-9.55e-03

Pos logit contributions
lee+0.038
ealing+0.037
ishment+0.037
elling+0.037
 opinions+0.037
Neg logit contributions
fire-0.023
 distance-0.022
|-0.020
 usual-0.019
 he-0.018
Token it
Token ID340
Feature activation-4.42e-02

Pos logit contributions
fire+0.014
 distance+0.013
 usual+0.012
|+0.012
 hunt+0.012
Neg logit contributions
lee-0.022
atted-0.022
elling-0.022
ealing-0.022
ishment-0.022
Token back
Token ID736
Feature activation-1.28e-02

Pos logit contributions
fire+0.063
 distance+0.059
|+0.053
 hunt+0.052
 usual+0.051
Neg logit contributions
ished-0.101
atted-0.101
elling-0.100
lee-0.098
els-0.096
Token on
Token ID319
Feature activation-3.90e-02

Pos logit contributions
fire+0.020
 distance+0.019
|+0.017
 hunt+0.017
 ride+0.016
Neg logit contributions
atted-0.028
elling-0.028
lee-0.027
ished-0.027
els-0.027
Token the
Token ID262
Feature activation-3.95e-02

Pos logit contributions
fire+0.061
 distance+0.055
|+0.054
 hunt+0.052
 usual+0.051
Neg logit contributions
lee-0.086
elling-0.085
ishment-0.085
els-0.083
atted-0.082
Token books
Token ID3835
Feature activation-1.39e-01

Pos logit contributions
fire+0.063
|+0.054
 distance+0.051
To+0.048
field+0.048
Neg logit contributions
lee-0.087
elling-0.086
ishment-0.082
 opinions-0.081
atted-0.081
Tokenhe
Token ID258
Feature activation-4.06e-02

Pos logit contributions
fire+0.178
 distance+0.160
|+0.153
 usual+0.153
 saf+0.149
Neg logit contributions
elling-0.336
lee-0.326
ishment-0.323
atted-0.322
ished-0.319
Tokenlf
Token ID1652
Feature activation-8.84e-02

Pos logit contributions
fire+0.062
 distance+0.056
|+0.054
 usual+0.051
 ride+0.048
Neg logit contributions
lee-0.092
els-0.089
atted-0.089
elling-0.088
ishment-0.086
Token so
Token ID523
Feature activation+2.96e-03

Pos logit contributions
fire+0.143
 distance+0.141
|+0.134
To+0.127
 usual+0.126
Neg logit contributions
lee-0.182
atted-0.180
elling-0.177
ishment-0.176
ished-0.175
Token we
Token ID356
Feature activation-3.50e-02

Pos logit contributions
lee+0.006
atted+0.006
elling+0.006
ealing+0.006
ishment+0.005
Neg logit contributions
 distance-0.005
fire-0.005
|-0.005
 hunt-0.005
 saf-0.004
Token can
Token ID460
Feature activation-3.56e-02

Pos logit contributions
fire+0.070
 distance+0.068
 usual+0.064
|+0.063
 saf+0.060
Neg logit contributions
lee-0.062
elling-0.061
ealing-0.060
atted-0.060
ished-0.059
Token be
Token ID307
Feature activation+1.71e-02

Pos logit contributions
fire+0.101
|+0.090
 distance+0.089
To+0.085
 usual+0.082
Neg logit contributions
lee-0.119
 opinions-0.116
els-0.114
ishment-0.110
atted-0.110
Token up
Token ID510
Feature activation-5.56e-03

Pos logit contributions
lee+0.038
atted+0.037
ealing+0.036
elling+0.036
ished+0.036
Neg logit contributions
fire-0.028
|-0.025
 distance-0.025
 usual-0.024
 hunt-0.024
Token high
Token ID1029
Feature activation+4.61e-02

Pos logit contributions
fire+0.008
|+0.007
 distance+0.007
 hunt+0.007
 usual+0.007
Neg logit contributions
lee-0.013
atted-0.013
ishment-0.012
ealing-0.012
elling-0.012
Token on
Token ID319
Feature activation-4.36e-02

Pos logit contributions
lee+0.104
atted+0.103
elling+0.102
ishment+0.100
ealing+0.100
Neg logit contributions
fire-0.071
 distance-0.066
|-0.063
 usual-0.062
 ride-0.060
Token the
Token ID262
Feature activation-3.64e-02

Pos logit contributions
fire+0.062
|+0.059
 distance+0.058
 hunt+0.052
 ride+0.052
Neg logit contributions
lee-0.103
els-0.100
ishment-0.099
atted-0.096
elling-0.096
Token roof
Token ID9753
Feature activation-1.38e-01

Pos logit contributions
fire+0.051
|+0.049
To+0.046
 hunt+0.044
 he+0.042
Neg logit contributions
lee-0.084
elling-0.080
 opinions-0.077
atted-0.077
ishment-0.077
Token,
Token ID11
Feature activation-5.54e-03

Pos logit contributions
|+0.212
fire+0.210
To+0.183
 hunt+0.183
 usual+0.183
Neg logit contributions
lee-0.297
atted-0.287
elling-0.287
ishment-0.284
els-0.278
Token but
Token ID475
Feature activation+3.43e-03

Pos logit contributions
fire+0.008
 distance+0.008
 usual+0.007
 ride+0.007
 he+0.007
Neg logit contributions
lee-0.013
atted-0.013
elling-0.013
ealing-0.013
ished-0.012
Token they
Token ID484
Feature activation-2.43e-03

Pos logit contributions
lee+0.009
atted+0.009
elling+0.009
ealing+0.009
ishment+0.009
Neg logit contributions
fire-0.004
 distance-0.004
|-0.004
 usual-0.004
 ride-0.003
Token both
Token ID1111
Feature activation+1.27e-02

Pos logit contributions
fire+0.004
 distance+0.004
|+0.004
 hunt+0.004
 usual+0.004
Neg logit contributions
lee-0.005
atted-0.005
ished-0.005
ealing-0.005
 opinions-0.005
Token enjoyed
Token ID8359
Feature activation+2.18e-02

Pos logit contributions
lee+0.024
atted+0.024
ealing+0.022
ished+0.022
 opinions+0.021
Neg logit contributions
fire-0.025
 distance-0.023
|-0.023
 hunt-0.022
 usual-0.022
Token he
Token ID339
Feature activation+3.88e-03

Pos logit contributions
lee+0.014
atted+0.014
ealing+0.013
ishment+0.013
elling+0.013
Neg logit contributions
fire-0.006
|-0.006
 distance-0.006
 hunt-0.005
 usual-0.005
Token was
Token ID373
Feature activation+4.81e-02

Pos logit contributions
atted+0.008
lee+0.008
ealing+0.007
ished+0.007
elling+0.007
Neg logit contributions
fire-0.007
 distance-0.007
|-0.007
 usual-0.006
 hunt-0.006
Token sorting
Token ID29407
Feature activation-7.54e-03

Pos logit contributions
lee+0.092
atted+0.092
ealing+0.089
elling+0.089
ishment+0.087
Neg logit contributions
fire-0.091
 distance-0.085
|-0.082
 usual-0.082
 hunt-0.080
Token out
Token ID503
Feature activation-6.55e-02

Pos logit contributions
fire+0.009
 distance+0.008
 usual+0.008
|+0.008
 ride+0.007
Neg logit contributions
lee-0.020
atted-0.020
elling-0.019
ealing-0.019
ishment-0.019
Token the
Token ID262
Feature activation-3.27e-02

Pos logit contributions
fire+0.093
 distance+0.084
|+0.083
 usual+0.078
 hunt+0.078
Neg logit contributions
lee-0.159
atted-0.158
elling-0.152
ealing-0.152
els-0.151
Token books
Token ID3835
Feature activation-1.38e-01

Pos logit contributions
fire+0.069
 distance+0.066
 usual+0.063
 ride+0.063
|+0.062
Neg logit contributions
lee-0.057
atted-0.056
elling-0.054
ealing-0.054
ishment-0.053
Token they
Token ID484
Feature activation-4.89e-02

Pos logit contributions
fire+0.197
|+0.191
 distance+0.173
 saf+0.171
Ro+0.164
Neg logit contributions
ishment-0.297
ished-0.295
elling-0.294
atted-0.294
els-0.293
Token were
Token ID547
Feature activation+1.10e-02

Pos logit contributions
fire+0.077
 distance+0.070
|+0.068
 usual+0.065
To+0.062
Neg logit contributions
lee-0.108
atted-0.103
ealing-0.103
elling-0.102
ishment-0.101
Token holding
Token ID4769
Feature activation+1.88e-02

Pos logit contributions
lee+0.024
atted+0.024
ealing+0.023
elling+0.023
ishment+0.023
Neg logit contributions
fire-0.018
 distance-0.017
|-0.016
 usual-0.016
 ride-0.015
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
lee+0.044
atted+0.043
ealing+0.043
elling+0.043
ishment+0.042
Neg logit contributions
fire-0.028
 distance-0.026
|-0.025
 usual-0.025
 hunt-0.024
Token "
Token ID366
Feature activation+3.02e-02

Pos logit contributions
fire+0.013
 distance+0.012
 he+0.012
 usual+0.012
To+0.011
Neg logit contributions
elling-0.031
atted-0.030
lee-0.030
els-0.030
ishment-0.029
Token and
Token ID290
Feature activation-3.43e-02

Pos logit contributions
fire+0.163
|+0.163
 Zig+0.152
 distance+0.150
 usual+0.149
Neg logit contributions
lee-0.130
atted-0.130
ishment-0.124
elling-0.118
ished-0.116
Token accidentally
Token ID14716
Feature activation+2.46e-02

Pos logit contributions
fire+0.063
 distance+0.058
|+0.055
 usual+0.054
 hunt+0.051
Neg logit contributions
lee-0.070
atted-0.069
ealing-0.065
elling-0.065
ishment-0.064
Token knocked
Token ID13642
Feature activation+1.81e-02

Pos logit contributions
lee+0.047
atted+0.047
ished+0.045
els+0.045
ealing+0.041
Neg logit contributions
fire-0.048
 distance-0.046
|-0.044
field-0.040
 usual-0.040
Token over
Token ID625
Feature activation-1.28e-04

Pos logit contributions
elling+0.039
atted+0.038
lee+0.038
els+0.037
ished+0.037
Neg logit contributions
fire-0.034
 hunt-0.028
 distance-0.028
|-0.027
 usual-0.027
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
fire+0.000
 distance+0.000
|+0.000
field+0.000
 hunt+0.000
Neg logit contributions
lee-0.000
atted-0.000
 opinions-0.000
els-0.000
elling-0.000
Token books
Token ID3835
Feature activation-1.38e-01

Pos logit contributions
fire+0.040
|+0.034
field+0.033
 distance+0.033
 usual+0.031
Neg logit contributions
elling-0.052
lee-0.050
els-0.049
 requests-0.048
 mistakes-0.048
Tokenhe
Token ID258
Feature activation-3.44e-02

Pos logit contributions
fire+0.177
 usual+0.149
|+0.147
 distance+0.144
 seek+0.143
Neg logit contributions
elling-0.339
ares-0.329
lee-0.328
atted-0.326
els-0.325
Tokenlf
Token ID1652
Feature activation-8.45e-02

Pos logit contributions
fire+0.057
 distance+0.055
|+0.052
 usual+0.050
 gall+0.047
Neg logit contributions
lee-0.070
els-0.068
atted-0.066
elling-0.064
 opinions-0.064
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
fire+0.117
 distance+0.106
|+0.105
 usual+0.098
To+0.095
Neg logit contributions
lee-0.199
atted-0.197
elling-0.192
ished-0.192
els-0.191
Token
Token ID220
Feature activation+2.29e-02

Pos logit contributions
fire+0.028
 distance+0.026
 usual+0.024
|+0.024
 hunt+0.023
Neg logit contributions
lee-0.047
atted-0.046
elling-0.045
ealing-0.045
ishment-0.044
Token
Token ID198
Feature activation-5.54e-02

Pos logit contributions
lee+0.064
atted+0.064
ealing+0.063
elling+0.063
ished+0.061
Neg logit contributions
fire-0.023
 distance-0.020
 usual-0.018
 hunt-0.018
|-0.017
Token cat
Token ID3797
Feature activation-6.95e-03

Pos logit contributions
elling+0.030
 opinions+0.030
ealing+0.029
ubby+0.029
lee+0.029
Neg logit contributions
fire-0.032
|-0.031
 distance-0.028
 usual-0.028
 gall-0.027
Token is
Token ID318
Feature activation+2.41e-02

Pos logit contributions
fire+0.014
 distance+0.013
 hunt+0.013
 ride+0.013
|+0.013
Neg logit contributions
lee-0.012
elling-0.012
atted-0.012
ealing-0.012
ishment-0.012
Token hiding
Token ID11816
Feature activation+2.28e-02

Pos logit contributions
lee+0.049
atted+0.048
elling+0.048
ealing+0.047
ishment+0.046
Neg logit contributions
fire-0.043
 distance-0.040
 usual-0.039
|-0.038
 ride-0.038
Token behind
Token ID2157
Feature activation-3.35e-02

Pos logit contributions
lee+0.055
atted+0.054
ishment+0.053
elling+0.052
ealing+0.052
Neg logit contributions
fire-0.031
 distance-0.029
|-0.029
 ride-0.028
 hunt-0.027
Token the
Token ID262
Feature activation-2.57e-02

Pos logit contributions
fire+0.051
 distance+0.044
|+0.043
 usual+0.042
 seek+0.041
Neg logit contributions
lee-0.080
elling-0.076
atted-0.076
els-0.076
ishment-0.074
Token books
Token ID3835
Feature activation-1.37e-01

Pos logit contributions
fire+0.048
|+0.046
 usual+0.042
field+0.040
 gall+0.040
Neg logit contributions
lee-0.047
elling-0.046
 mistakes-0.045
els-0.043
 opinions-0.042
Tokenhe
Token ID258
Feature activation-2.58e-02

Pos logit contributions
fire+0.210
|+0.198
 usual+0.193
 seek+0.181
 distance+0.177
Neg logit contributions
els-0.284
elling-0.281
ares-0.271
lee-0.270
oking-0.268
Tokenlf
Token ID1652
Feature activation-8.28e-02

Pos logit contributions
fire+0.035
 distance+0.033
|+0.032
 usual+0.031
 ride+0.029
Neg logit contributions
lee-0.062
els-0.061
atted-0.060
elling-0.059
ishment-0.058
Token.
Token ID13
Feature activation-2.19e-03

Pos logit contributions
fire+0.109
|+0.109
 distance+0.105
 usual+0.098
To+0.096
Neg logit contributions
lee-0.195
atted-0.194
els-0.193
ishment-0.188
ished-0.184
Token The
Token ID383
Feature activation+1.58e-02

Pos logit contributions
fire+0.004
 distance+0.003
 usual+0.003
 hunt+0.003
|+0.003
Neg logit contributions
lee-0.005
atted-0.005
elling-0.005
ealing-0.005
ishment-0.005
Token cat
Token ID3797
Feature activation-6.43e-03

Pos logit contributions
elling+0.027
 opinions+0.027
 mistakes+0.027
lee+0.027
ubby+0.026
Neg logit contributions
fire-0.030
|-0.030
 distance-0.027
 usual-0.027
To-0.026
Token cozy
Token ID37438
Feature activation+8.57e-03

Pos logit contributions
|+0.150
fire+0.128
seek+0.121
 usual+0.118
whe+0.117
Neg logit contributions
 mistakes-0.157
lee-0.155
ishment-0.145
ealing-0.145
ubby-0.139
Token nest
Token ID16343
Feature activation-8.80e-02

Pos logit contributions
lee+0.019
ishment+0.018
atted+0.018
ubby+0.017
elling+0.017
Neg logit contributions
fire-0.013
|-0.012
 usual-0.012
To-0.012
whe-0.012
Token on
Token ID319
Feature activation-5.28e-02

Pos logit contributions
|+0.121
 distance+0.113
fire+0.111
Ro+0.109
 saf+0.108
Neg logit contributions
atted-0.196
lee-0.196
elling-0.191
ished-0.189
els-0.188
Token the
Token ID262
Feature activation-4.24e-02

Pos logit contributions
fire+0.079
|+0.077
 distance+0.073
 hunt+0.070
 ride+0.069
Neg logit contributions
lee-0.113
els-0.112
ishment-0.111
 opinions-0.107
elling-0.107
Token green
Token ID4077
Feature activation+2.92e-03

Pos logit contributions
|+0.070
fire+0.064
To+0.062
 he+0.059
 seek+0.056
Neg logit contributions
lee-0.084
elling-0.083
ishment-0.080
 mistakes-0.080
ubby-0.077
Token roof
Token ID9753
Feature activation-1.36e-01

Pos logit contributions
lee+0.006
elling+0.006
atted+0.006
els+0.006
ener+0.006
Neg logit contributions
|-0.005
fire-0.004
 usual-0.004
To-0.004
whe-0.004
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
|+0.192
fire+0.185
 hunt+0.174
Ro+0.171
 usual+0.171
Neg logit contributions
lee-0.306
atted-0.303
elling-0.300
els-0.290
ished-0.288
Token From
Token ID3574
Feature activation+3.73e-02

Pos logit contributions
fire+0.026
 distance+0.024
 usual+0.023
|+0.022
 hunt+0.022
Neg logit contributions
lee-0.033
atted-0.032
elling-0.031
ealing-0.031
ishment-0.031
Token that
Token ID326
Feature activation-1.24e-03

Pos logit contributions
lee+0.094
atted+0.094
ealing+0.093
elling+0.092
ishment+0.091
Neg logit contributions
fire-0.048
 distance-0.044
 usual-0.041
 ride-0.040
|-0.039
Token day
Token ID1110
Feature activation+2.78e-02

Pos logit contributions
fire+0.001
 distance+0.001
|+0.001
 usual+0.001
 hunt+0.001
Neg logit contributions
lee-0.003
atted-0.003
elling-0.003
ubby-0.003
ealing-0.003
Token on
Token ID319
Feature activation+1.67e-02

Pos logit contributions
atted+0.079
lee+0.079
ealing+0.078
elling+0.077
ubby+0.075
Neg logit contributions
fire-0.028
 distance-0.024
 usual-0.023
|-0.022
 hunt-0.022
Token behind
Token ID2157
Feature activation-5.83e-02

Pos logit contributions
atted+0.042
lee+0.042
elling+0.042
els+0.041
ished+0.041
Neg logit contributions
fire-0.022
|-0.020
 distance-0.019
 ride-0.018
 usual-0.018
Token the
Token ID262
Feature activation-3.33e-02

Pos logit contributions
fire+0.096
|+0.088
 seek+0.083
 distance+0.081
tr+0.081
Neg logit contributions
lee-0.127
els-0.126
ishment-0.119
atted-0.117
elling-0.117
Token tall
Token ID7331
Feature activation-7.45e-03

Pos logit contributions
fire+0.058
|+0.052
 gall+0.050
 usual+0.050
 ride+0.048
Neg logit contributions
lee-0.070
elling-0.068
atted-0.063
ealing-0.063
els-0.062
Token pile
Token ID14540
Feature activation-8.46e-02

Pos logit contributions
fire+0.012
|+0.011
 usual+0.011
 distance+0.011
To+0.010
Neg logit contributions
lee-0.016
elling-0.015
atted-0.015
els-0.015
ishment-0.015
Token of
Token ID286
Feature activation-7.06e-02

Pos logit contributions
|+0.124
fire+0.124
 distance+0.111
 usual+0.110
 hunt+0.108
Neg logit contributions
lee-0.192
atted-0.188
els-0.183
ished-0.181
elling-0.180
Token books
Token ID3835
Feature activation-1.36e-01

Pos logit contributions
fire+0.123
 distance+0.115
 usual+0.111
|+0.109
 ride+0.107
Neg logit contributions
lee-0.145
atted-0.145
elling-0.142
ealing-0.142
ishment-0.139
Token in
Token ID287
Feature activation-7.49e-02

Pos logit contributions
fire+0.167
|+0.166
 saf+0.139
 usual+0.139
 seek+0.138
Neg logit contributions
els-0.322
lee-0.317
elling-0.316
ement-0.312
ished-0.310
Token the
Token ID262
Feature activation-4.58e-02

Pos logit contributions
fire+0.114
|+0.100
tr+0.095
 usual+0.092
 hunt+0.092
Neg logit contributions
lee-0.170
els-0.168
ishment-0.165
 opinions-0.161
elling-0.160
Token corner
Token ID5228
Feature activation-8.96e-02

Pos logit contributions
fire+0.072
 gall+0.063
|+0.061
 Zig+0.061
 usual+0.060
Neg logit contributions
lee-0.102
elling-0.100
 opinions-0.094
ishment-0.094
ealing-0.093
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
fire+0.128
|+0.128
 ride+0.113
 hunt+0.113
 usual+0.112
Neg logit contributions
lee-0.213
atted-0.209
els-0.202
elling-0.199
ished-0.195
Token Quick
Token ID12029
Feature activation+7.91e-02

Pos logit contributions
fire+0.024
 distance+0.023
 usual+0.021
|+0.021
 hunt+0.021
Neg logit contributions
lee-0.038
atted-0.037
elling-0.036
ealing-0.036
ishment-0.035
Token take
Token ID1011
Feature activation+1.51e-02

Pos logit contributions
fire+0.065
 distance+0.058
|+0.057
 usual+0.056
To+0.053
Neg logit contributions
lee-0.093
ishment-0.091
 opinions-0.091
ealing-0.089
els-0.089
Token a
Token ID257
Feature activation-6.98e-02

Pos logit contributions
lee+0.037
atted+0.037
ealing+0.036
elling+0.036
ished+0.035
Neg logit contributions
fire-0.021
 distance-0.019
|-0.018
 usual-0.017
 hunt-0.017
Token march
Token ID9960
Feature activation-1.37e-02

Pos logit contributions
fire+0.116
field+0.093
|+0.092
 seek+0.085
 Zig+0.082
Neg logit contributions
ealing-0.148
ares-0.145
atted-0.144
lee-0.142
ubby-0.141
Token around
Token ID1088
Feature activation-2.50e-02

Pos logit contributions
fire+0.021
|+0.019
 usual+0.018
 distance+0.018
field+0.017
Neg logit contributions
lee-0.029
atted-0.029
ealing-0.029
ishment-0.029
ished-0.028
Token the
Token ID262
Feature activation-3.20e-02

Pos logit contributions
fire+0.036
|+0.032
 distance+0.032
 usual+0.031
 hunt+0.030
Neg logit contributions
lee-0.058
atted-0.056
ishment-0.055
els-0.055
ening-0.054
Token books
Token ID3835
Feature activation-1.36e-01

Pos logit contributions
fire+0.061
|+0.054
 distance+0.053
 usual+0.052
 he+0.050
Neg logit contributions
lee-0.061
elling-0.059
ealing-0.058
ishment-0.058
ished-0.057
Tokenhe
Token ID258
Feature activation-3.16e-02

Pos logit contributions
fire+0.174
|+0.165
 saf+0.148
To+0.147
 distance+0.146
Neg logit contributions
ement-0.301
ishment-0.297
ished-0.297
lee-0.295
ares-0.292
Tokenlf
Token ID1652
Feature activation-7.93e-02

Pos logit contributions
fire+0.045
 distance+0.043
|+0.041
 usual+0.040
 ride+0.038
Neg logit contributions
lee-0.073
atted-0.070
els-0.070
ishment-0.068
elling-0.067
Token like
Token ID588
Feature activation-8.62e-03

Pos logit contributions
|+0.111
fire+0.110
 distance+0.104
To+0.103
 usual+0.091
Neg logit contributions
lee-0.177
atted-0.175
ishment-0.171
ished-0.170
els-0.165
Token he
Token ID339
Feature activation-1.36e-02

Pos logit contributions
fire+0.012
|+0.011
 distance+0.011
 hunt+0.010
field+0.009
Neg logit contributions
lee-0.021
atted-0.021
elling-0.020
 opinions-0.019
els-0.019
Token had
Token ID550
Feature activation+4.34e-03

Pos logit contributions
fire+0.021
 distance+0.020
|+0.018
 usual+0.017
To+0.017
Neg logit contributions
lee-0.030
atted-0.030
ished-0.029
ealing-0.029
els-0.028
Token something
Token ID1223
Feature activation-5.38e-02

Pos logit contributions
lee+0.102
atted+0.101
elling+0.096
els+0.093
ished+0.092
Neg logit contributions
|-0.078
fire-0.078
 distance-0.068
 hunt-0.063
 seek-0.062
Token interesting
Token ID3499
Feature activation+5.12e-02

Pos logit contributions
fire+0.097
|+0.096
 distance+0.091
 usual+0.083
 ride+0.083
Neg logit contributions
lee-0.105
atted-0.102
ealing-0.100
elling-0.099
els-0.096
Token happening
Token ID5836
Feature activation+3.00e-02

Pos logit contributions
atted+0.127
lee+0.126
elling+0.122
els+0.121
ished+0.119
Neg logit contributions
fire-0.066
|-0.063
 distance-0.062
 usual-0.055
 ride-0.054
Token on
Token ID319
Feature activation-3.59e-02

Pos logit contributions
lee+0.070
atted+0.069
elling+0.068
els+0.067
ishment+0.067
Neg logit contributions
fire-0.042
|-0.038
 ride-0.035
 distance-0.034
 usual-0.034
Token the
Token ID262
Feature activation-4.01e-02

Pos logit contributions
fire+0.058
|+0.054
tr+0.049
 distance+0.048
 seek+0.047
Neg logit contributions
lee-0.079
els-0.079
ishment-0.076
elling-0.075
ubby-0.073
Token books
Token ID3835
Feature activation-1.36e-01

Pos logit contributions
|+0.066
fire+0.063
To+0.062
tr+0.058
 ride+0.056
Neg logit contributions
lee-0.080
elling-0.078
ubby-0.073
ealing-0.070
ishment-0.070
Tokenhe
Token ID258
Feature activation-3.41e-02

Pos logit contributions
fire+0.180
|+0.166
 seek+0.157
Ar+0.150
 Zig+0.147
Neg logit contributions
elling-0.304
ement-0.303
els-0.300
ares-0.299
lee-0.292
Tokenlf
Token ID1652
Feature activation-7.93e-02

Pos logit contributions
|+0.052
fire+0.051
 distance+0.050
tr+0.046
 usual+0.045
Neg logit contributions
lee-0.073
els-0.072
atted-0.070
ubby-0.066
 opinions-0.065
Token near
Token ID1474
Feature activation-3.44e-02

Pos logit contributions
|+0.116
fire+0.113
 distance+0.106
To+0.101
Ar+0.101
Neg logit contributions
lee-0.176
atted-0.174
els-0.171
ished-0.167
ishment-0.163
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
fire+0.048
|+0.043
 distance+0.039
tr+0.039
 seek+0.038
Neg logit contributions
lee-0.085
ishment-0.082
els-0.082
atted-0.081
elling-0.080
Token river
Token ID7850
Feature activation-8.30e-02

Pos logit contributions
|+0.058
To+0.054
fire+0.053
bel+0.049
 usual+0.049
Neg logit contributions
elling-0.063
lee-0.063
 heads-0.058
els-0.057
ener-0.057
Token all
Token ID477
Feature activation-3.77e-02

Pos logit contributions
lee+0.092
atted+0.092
elling+0.090
ealing+0.090
ishment+0.088
Neg logit contributions
fire-0.053
 distance-0.049
 usual-0.047
|-0.047
 ride-0.045
Token the
Token ID262
Feature activation-2.67e-02

Pos logit contributions
 distance+0.032
fire+0.031
|+0.029
 usual+0.028
 ride+0.027
Neg logit contributions
ealing-0.111
ishment-0.109
lee-0.108
atted-0.106
elling-0.105
Token books
Token ID3835
Feature activation-1.19e-01

Pos logit contributions
fire+0.028
 distance+0.026
 usual+0.025
|+0.024
 ride+0.023
Neg logit contributions
lee-0.073
atted-0.073
ealing-0.072
elling-0.072
ishment-0.071
Token off
Token ID572
Feature activation-5.32e-02

Pos logit contributions
fire+0.191
 distance+0.175
|+0.173
 usual+0.171
 hunt+0.167
Neg logit contributions
atted-0.252
lee-0.250
elling-0.246
ished-0.241
ishment-0.239
Token the
Token ID262
Feature activation-3.15e-02

Pos logit contributions
fire+0.093
 hunt+0.085
|+0.085
 distance+0.083
 ride+0.080
Neg logit contributions
lee-0.113
atted-0.108
els-0.102
elling-0.101
ishment-0.100
Token books
Token ID3835
Feature activation-1.36e-01

Pos logit contributions
fire+0.050
|+0.041
 distance+0.041
field+0.039
 usual+0.038
Neg logit contributions
lee-0.069
elling-0.069
ishment-0.065
 opinions-0.065
 requests-0.065
Tokenhe
Token ID258
Feature activation-2.99e-02

Pos logit contributions
fire+0.173
|+0.154
 distance+0.152
 usual+0.151
 gall+0.142
Neg logit contributions
lee-0.324
elling-0.318
atted-0.318
ishment-0.313
els-0.308
Tokenlf
Token ID1652
Feature activation-8.33e-02

Pos logit contributions
fire+0.043
 distance+0.040
|+0.037
 usual+0.036
 ride+0.034
Neg logit contributions
lee-0.071
atted-0.070
els-0.068
elling-0.067
ishment-0.066
Token and
Token ID290
Feature activation-3.69e-02

Pos logit contributions
fire+0.134
 distance+0.129
|+0.126
To+0.119
 usual+0.118
Neg logit contributions
lee-0.181
atted-0.179
ishment-0.168
ished-0.167
els-0.167
Token started
Token ID2067
Feature activation+8.75e-03

Pos logit contributions
fire+0.057
 distance+0.052
|+0.050
 usual+0.048
 hunt+0.047
Neg logit contributions
lee-0.085
atted-0.084
elling-0.081
ishment-0.081
ealing-0.081
Token to
Token ID284
Feature activation-5.17e-02

Pos logit contributions
lee+0.023
atted+0.023
ealing+0.021
ished+0.021
els+0.021
Neg logit contributions
fire-0.011
 distance-0.010
|-0.010
 hunt-0.009
 usual-0.009
Token match
Token ID2872
Feature activation-4.08e-02

Pos logit contributions
fire+0.100
 distance+0.092
|+0.086
 usual+0.084
field+0.078
Neg logit contributions
lee-0.137
 opinions-0.136
els-0.133
elling-0.133
 requests-0.130
Token the
Token ID262
Feature activation-3.05e-02

Pos logit contributions
fire+0.054
 distance+0.049
|+0.047
 usual+0.044
 ride+0.043
Neg logit contributions
lee-0.102
atted-0.101
elling-0.100
ealing-0.099
ishment-0.099
Token pictures
Token ID5986
Feature activation-1.09e-01

Pos logit contributions
fire+0.061
|+0.055
 distance+0.055
 usual+0.052
 ride+0.051
Neg logit contributions
elling-0.055
lee-0.054
ealing-0.052
 opinions-0.052
els-0.052
Token on
Token ID319
Feature activation-5.49e-02

Pos logit contributions
fire+0.164
|+0.155
 distance+0.152
 usual+0.140
 hunt+0.130
Neg logit contributions
atted-0.241
elling-0.241
lee-0.240
els-0.236
ished-0.236
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
fire+0.096
|+0.090
 distance+0.087
 seek+0.075
tr+0.075
Neg logit contributions
els-0.119
lee-0.112
ishment-0.109
 opinions-0.107
elling-0.106
Token pages
Token ID5468
Feature activation-1.36e-01

Pos logit contributions
fire+0.072
|+0.067
 distance+0.064
 he+0.061
 usual+0.059
Neg logit contributions
lee-0.072
elling-0.072
els-0.068
ishment-0.068
 opinions-0.066
Token.
Token ID13
Feature activation-3.50e-02

Pos logit contributions
fire+0.197
|+0.192
 distance+0.179
 usual+0.163
 saf+0.161
Neg logit contributions
lee-0.302
elling-0.298
ished-0.295
ishment-0.295
oking-0.294
Token
Token ID198
Feature activation-6.60e-02

Pos logit contributions
fire+0.034
 distance+0.028
 usual+0.026
|+0.026
 hunt+0.024
Neg logit contributions
lee-0.100
atted-0.100
ealing-0.099
elling-0.098
ishment-0.097
Token
Token ID198
Feature activation-5.16e-02

Pos logit contributions
fire+0.092
 distance+0.077
 usual+0.073
 hunt+0.071
 seek+0.069
Neg logit contributions
atted-0.156
lee-0.156
ealing-0.153
oking-0.149
elling-0.148
TokenTom
Token ID13787
Feature activation+7.18e-02

Pos logit contributions
fire+0.093
 usual+0.083
 distance+0.082
|+0.082
field+0.077
Neg logit contributions
ealing-0.104
atted-0.100
lee-0.098
oking-0.098
elling-0.098
Token saw
Token ID2497
Feature activation+3.61e-02

Pos logit contributions
elling+0.190
ealing+0.188
lee+0.186
atted+0.185
ishment+0.184
Neg logit contributions
fire-0.082
 hunt-0.077
 ride-0.076
 he-0.072
 distance-0.072
Token to
Token ID284
Feature activation-4.20e-02

Pos logit contributions
|+0.021
fire+0.020
 distance+0.018
tr+0.017
To+0.016
Neg logit contributions
lee-0.029
 opinions-0.028
atted-0.028
 requests-0.027
ishment-0.027
Token start
Token ID923
Feature activation-3.68e-03

Pos logit contributions
fire+0.068
|+0.061
 distance+0.059
 usual+0.058
To+0.056
Neg logit contributions
lee-0.089
ishment-0.087
 opinions-0.086
ealing-0.085
atted-0.084
Token un
Token ID555
Feature activation+2.79e-02

Pos logit contributions
fire+0.006
|+0.006
 distance+0.006
To+0.005
 usual+0.005
Neg logit contributions
lee-0.008
atted-0.008
els-0.007
ealing-0.007
ishment-0.007
Tokenloading
Token ID25138
Feature activation-3.08e-02

Pos logit contributions
lee+0.058
atted+0.058
elling+0.057
ealing+0.057
ishment+0.056
Neg logit contributions
fire-0.048
 distance-0.045
 usual-0.043
|-0.043
 ride-0.041
Token the
Token ID262
Feature activation-3.23e-02

Pos logit contributions
fire+0.047
 distance+0.043
|+0.042
 usual+0.041
 ride+0.039
Neg logit contributions
lee-0.070
atted-0.070
ealing-0.068
elling-0.068
ishment-0.067
Token books
Token ID3835
Feature activation-1.36e-01

Pos logit contributions
fire+0.045
|+0.039
 distance+0.039
 usual+0.037
field+0.034
Neg logit contributions
elling-0.078
lee-0.077
ealing-0.076
atted-0.076
ishment-0.074
Token he
Token ID339
Feature activation-2.92e-02

Pos logit contributions
fire+0.239
|+0.235
 distance+0.217
Ro+0.214
 usual+0.212
Neg logit contributions
elling-0.239
lee-0.234
ement-0.234
ishment-0.233
ished-0.232
Token noticed
Token ID6810
Feature activation+3.66e-02

Pos logit contributions
fire+0.044
 distance+0.042
|+0.041
 usual+0.036
To+0.035
Neg logit contributions
lee-0.067
atted-0.065
ealing-0.063
ished-0.062
elling-0.062
Token a
Token ID257
Feature activation-5.62e-02

Pos logit contributions
lee+0.082
atted+0.082
elling+0.078
ealing+0.078
els+0.077
Neg logit contributions
fire-0.055
|-0.055
 distance-0.052
 usual-0.047
 hunt-0.047
Token little
Token ID1310
Feature activation+3.90e-02

Pos logit contributions
|+0.102
fire+0.091
 Zig+0.083
 usual+0.082
To+0.079
Neg logit contributions
lee-0.101
ealing-0.101
 mistakes-0.094
ubby-0.094
 darling-0.093
Token girl
Token ID2576
Feature activation-5.99e-03

Pos logit contributions
lee+0.074
atted+0.073
ishment+0.073
elling+0.073
ealing+0.072
Neg logit contributions
fire-0.070
|-0.068
 usual-0.066
To-0.063
 distance-0.063
Token,
Token ID11
Feature activation+1.78e-02

Pos logit contributions
lee+0.123
atted+0.121
elling+0.121
ealing+0.119
ishment+0.118
Neg logit contributions
fire-0.086
 distance-0.079
 usual-0.077
|-0.075
 ride-0.074
Token
Token ID198
Feature activation-4.58e-02

Pos logit contributions
lee+0.033
atted+0.033
ealing+0.032
elling+0.031
ishment+0.030
Neg logit contributions
fire-0.035
 distance-0.033
|-0.033
 usual-0.032
 hunt-0.031
Token"
Token ID1
Feature activation+2.58e-02

Pos logit contributions
 usual+0.051
fire+0.051
 distance+0.049
 hunt+0.045
|+0.042
Neg logit contributions
lee-0.121
atted-0.118
ealing-0.117
elling-0.113
oking-0.111
TokenDon
Token ID3987
Feature activation+7.91e-02

Pos logit contributions
atted+0.057
lee+0.057
elling+0.056
ealing+0.055
ishment+0.055
Neg logit contributions
fire-0.041
 distance-0.038
|-0.037
 usual-0.036
 hunt-0.036
Token't
Token ID470
Feature activation+3.91e-03

Pos logit contributions
lee+0.218
elling+0.216
atted+0.215
ealing+0.213
ishment+0.206
Neg logit contributions
fire-0.082
 distance-0.075
 usual-0.070
|-0.067
 hunt-0.064
Token worry
Token ID5490
Feature activation+1.07e-01

Pos logit contributions
lee+0.006
atted+0.006
ealing+0.006
ishment+0.006
elling+0.006
Neg logit contributions
fire-0.009
 distance-0.008
 usual-0.008
|-0.008
To-0.008
Token,
Token ID11
Feature activation+2.81e-02

Pos logit contributions
lee+0.237
atted+0.234
ealing+0.230
elling+0.230
ishment+0.225
Neg logit contributions
fire-0.172
 distance-0.161
|-0.153
 usual-0.152
 hunt-0.147
Token I
Token ID314
Feature activation-2.83e-02

Pos logit contributions
lee+0.063
atted+0.062
elling+0.061
ealing+0.061
ishment+0.059
Neg logit contributions
fire-0.044
 distance-0.041
|-0.040
 usual-0.039
 hunt-0.037
Token found
Token ID1043
Feature activation+2.25e-02

Pos logit contributions
fire+0.046
 distance+0.043
 usual+0.041
|+0.040
 ride+0.039
Neg logit contributions
lee-0.062
atted-0.061
ealing-0.061
elling-0.060
ished-0.058
Token a
Token ID257
Feature activation-6.16e-02

Pos logit contributions
elling+0.049
lee+0.049
atted+0.049
ealing+0.047
ished+0.047
Neg logit contributions
fire-0.037
 distance-0.035
|-0.034
 usual-0.032
 ride-0.031
Token lot
Token ID1256
Feature activation-4.21e-02

Pos logit contributions
fire+0.118
|+0.115
 distance+0.099
To+0.098
 usual+0.098
Neg logit contributions
ealing-0.103
lee-0.101
 mistakes-0.100
ubby-0.100
ishment-0.098
Token safe
Token ID3338
Feature activation+3.56e-02

Pos logit contributions
lee+0.043
atted+0.043
ealing+0.042
elling+0.042
ishment+0.041
Neg logit contributions
fire-0.026
 distance-0.025
 usual-0.025
|-0.024
 hunt-0.023
Token and
Token ID290
Feature activation-5.64e-02

Pos logit contributions
lee+0.064
atted+0.064
ishment+0.060
ealing+0.060
elling+0.060
Neg logit contributions
fire-0.069
|-0.066
 distance-0.066
 usual-0.065
 ride-0.063
Token sound
Token ID2128
Feature activation+2.12e-02

Pos logit contributions
fire+0.095
|+0.094
 distance+0.091
 usual+0.089
To+0.084
Neg logit contributions
 opinions-0.107
atted-0.107
ealing-0.105
lee-0.104
elling-0.102
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
atted+0.049
ealing+0.049
lee+0.048
elling+0.047
ubby+0.046
Neg logit contributions
|-0.031
 distance-0.030
 hunt-0.029
fire-0.029
 seek-0.028
Token<|endoftext|>
Token ID50256
Feature activation+3.14e-02

Pos logit contributions
fire+0.012
 distance+0.010
 usual+0.010
 hunt+0.009
|+0.009
Neg logit contributions
lee-0.031
atted-0.030
elling-0.030
ealing-0.030
ishment-0.029
TokenL
Token ID43
Feature activation+9.79e-02

Pos logit contributions
ubby+0.055
elling+0.054
els+0.054
atted+0.052
lee+0.052
Neg logit contributions
fire-0.058
 usual-0.055
 distance-0.053
 hunt-0.051
 drive-0.050
Tokenily
Token ID813
Feature activation+8.47e-02

Pos logit contributions
ishment+0.173
ealing+0.172
atted+0.171
elling+0.169
lee+0.168
Neg logit contributions
fire-0.188
|-0.186
 usual-0.184
 distance-0.184
 ride-0.176
Token and
Token ID290
Feature activation-7.78e-03

Pos logit contributions
lee+0.207
elling+0.207
atted+0.206
ealing+0.205
ishment+0.203
Neg logit contributions
fire-0.111
 distance-0.104
 ride-0.100
|-0.099
 hunt-0.097
Token Ben
Token ID3932
Feature activation+9.95e-02

Pos logit contributions
fire+0.013
 distance+0.012
 usual+0.011
|+0.011
 ride+0.011
Neg logit contributions
lee-0.017
atted-0.017
elling-0.017
ealing-0.017
ishment-0.016
Token were
Token ID547
Feature activation+2.93e-02

Pos logit contributions
lee+0.267
atted+0.265
elling+0.263
ealing+0.258
ishment+0.258
Neg logit contributions
fire-0.111
 distance-0.102
|-0.096
 usual-0.094
 ride-0.094
Token going
Token ID1016
Feature activation+2.36e-02

Pos logit contributions
elling+0.061
lee+0.061
ishment+0.060
atted+0.059
ealing+0.058
Neg logit contributions
 usual-0.046
fire-0.046
 distance-0.044
 ride-0.044
|-0.044
Token lot
Token ID1256
Feature activation-3.47e-02

Pos logit contributions
fire+0.109
|+0.097
 distance+0.096
 usual+0.089
To+0.087
Neg logit contributions
ealing-0.104
ubby-0.103
lee-0.103
atted-0.101
 opinions-0.100
Token of
Token ID286
Feature activation-5.36e-02

Pos logit contributions
field+0.057
fire+0.057
 usual+0.056
 distance+0.055
 hunt+0.054
Neg logit contributions
elling-0.073
atted-0.069
 opinions-0.066
lee-0.066
ished-0.065
Token fun
Token ID1257
Feature activation+2.30e-02

Pos logit contributions
fire+0.083
 distance+0.076
 usual+0.072
|+0.071
 hunt+0.068
Neg logit contributions
lee-0.121
atted-0.121
elling-0.119
ealing-0.118
ishment-0.116
Token.
Token ID13
Feature activation-4.00e-03

Pos logit contributions
lee+0.054
atted+0.054
els+0.053
 opinions+0.053
ishment+0.052
Neg logit contributions
 distance-0.032
fire-0.031
|-0.030
 usual-0.029
To-0.025
Token<|endoftext|>
Token ID50256
Feature activation+3.25e-02

Pos logit contributions
fire+0.006
 distance+0.006
 hunt+0.005
 usual+0.005
 gall+0.005
Neg logit contributions
lee-0.009
ealing-0.009
elling-0.009
atted-0.009
 opinions-0.009
TokenTim
Token ID14967
Feature activation+9.10e-02

Pos logit contributions
elling+0.069
ubby+0.068
els+0.067
atted+0.067
lee+0.067
Neg logit contributions
fire-0.052
 usual-0.048
 distance-0.048
 hunt-0.043
 ride-0.043
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
lee+0.195
ealing+0.193
ishment+0.191
atted+0.191
elling+0.191
Neg logit contributions
 ride-0.139
 distance-0.137
fire-0.137
|-0.137
 hunt-0.133
Token Tom
Token ID4186
Feature activation+8.15e-02

Pos logit contributions
fire+0.026
 distance+0.022
 usual+0.021
|+0.020
 hunt+0.019
Neg logit contributions
lee-0.047
atted-0.046
elling-0.045
ealing-0.044
 opinions-0.044
Token were
Token ID547
Feature activation+2.39e-02

Pos logit contributions
lee+0.206
atted+0.205
elling+0.201
ealing+0.199
ishment+0.198
Neg logit contributions
fire-0.104
 distance-0.096
|-0.091
 usual-0.090
 ride-0.089
Token twins
Token ID20345
Feature activation-2.06e-02

Pos logit contributions
elling+0.050
lee+0.050
ishment+0.049
atted+0.049
ealing+0.048
Neg logit contributions
 usual-0.037
fire-0.037
 ride-0.036
 distance-0.036
|-0.035
Token who
Token ID508
Feature activation-9.69e-03

Pos logit contributions
fire+0.033
|+0.032
 distance+0.030
To+0.029
 seek+0.028
Neg logit contributions
lee-0.045
atted-0.044
ealing-0.042
ishment-0.042
els-0.039
Token worry
Token ID5490
Feature activation+1.02e-01

Pos logit contributions
elling+0.118
atted+0.112
ished+0.112
ishment+0.111
ealing+0.110
Neg logit contributions
fire-0.104
 usual-0.102
 ride-0.100
 distance-0.099
 rest-0.097
Token disappeared
Token ID12120
Feature activation+3.76e-02

Pos logit contributions
lee+0.233
atted+0.230
elling+0.223
ealing+0.223
ishment+0.217
Neg logit contributions
fire-0.159
 distance-0.147
|-0.142
 usual-0.139
 hunt-0.135
Token,
Token ID11
Feature activation-6.59e-03

Pos logit contributions
lee+0.086
atted+0.084
elling+0.081
ishment+0.081
ealing+0.081
Neg logit contributions
fire-0.058
|-0.054
 distance-0.052
 ride-0.050
 usual-0.049
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
fire+0.012
|+0.011
 distance+0.011
 usual+0.011
 hunt+0.010
Neg logit contributions
lee-0.013
atted-0.013
elling-0.012
ishment-0.012
ealing-0.012
Token she
Token ID673
Feature activation-9.62e-04

Pos logit contributions
fire+0.052
 distance+0.047
|+0.046
 usual+0.045
 hunt+0.044
Neg logit contributions
lee-0.069
atted-0.068
ishment-0.064
elling-0.063
ished-0.062
Token smiled
Token ID13541
Feature activation+9.60e-02

Pos logit contributions
fire+0.002
 distance+0.001
 usual+0.001
|+0.001
 ride+0.001
Neg logit contributions
lee-0.002
atted-0.002
ealing-0.002
elling-0.002
ishment-0.002
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
lee+0.230
atted+0.228
elling+0.223
ealing+0.222
ishment+0.219
Neg logit contributions
fire-0.138
 distance-0.127
|-0.121
 usual-0.120
 ride-0.116
Token
Token ID198
Feature activation-4.66e-02

Pos logit contributions
fire+0.022
 distance+0.020
 usual+0.018
|+0.018
 ride+0.017
Neg logit contributions
lee-0.053
atted-0.052
elling-0.052
ealing-0.051
ishment-0.051
Token
Token ID198
Feature activation-3.42e-02

Pos logit contributions
fire+0.048
 distance+0.042
 usual+0.040
 hunt+0.038
 ride+0.036
Neg logit contributions
lee-0.133
atted-0.128
ealing-0.126
elling-0.124
oking-0.121
TokenDa
Token ID26531
Feature activation+7.39e-02

Pos logit contributions
fire+0.050
 distance+0.046
 usual+0.046
 hunt+0.043
|+0.043
Neg logit contributions
lee-0.080
atted-0.079
ealing-0.078
elling-0.077
ishment-0.075
Tokenisy
Token ID13560
Feature activation+3.76e-02

Pos logit contributions
lee+0.128
atted+0.126
elling+0.124
ealing+0.122
ishment+0.119
Neg logit contributions
fire-0.154
 distance-0.146
 usual-0.141
|-0.140
 ride-0.137
Token Mom
Token ID11254
Feature activation+4.05e-02

Pos logit contributions
lee+0.001
atted+0.001
ealing+0.001
 opinions+0.001
elling+0.001
Neg logit contributions
fire-0.001
|-0.001
 hunt-0.001
 distance-0.001
field-0.001
Token said
Token ID531
Feature activation+6.74e-02

Pos logit contributions
elling+0.087
lee+0.086
ealing+0.086
ishment+0.086
atted+0.085
Neg logit contributions
fire-0.066
 distance-0.060
 he-0.060
 ride-0.059
|-0.057
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
lee+0.201
elling+0.200
atted+0.199
ealing+0.197
ishment+0.196
Neg logit contributions
fire-0.056
 distance-0.046
 usual-0.043
 ride-0.041
|-0.040
Token She
Token ID1375
Feature activation+6.58e-04

Pos logit contributions
elling+0.039
atted+0.039
lee+0.038
ealing+0.038
ished+0.038
Neg logit contributions
 distance-0.021
fire-0.021
 he-0.020
To-0.019
 ride-0.019
Token was
Token ID373
Feature activation+4.81e-02

Pos logit contributions
elling+0.002
ealing+0.002
ishment+0.002
lee+0.002
atted+0.002
Neg logit contributions
fire-0.001
 ride-0.001
 distance-0.001
 he-0.001
 usual-0.001
Token shocked
Token ID11472
Feature activation+9.04e-02

Pos logit contributions
lee+0.104
atted+0.103
elling+0.103
ealing+0.101
ishment+0.100
Neg logit contributions
fire-0.079
 distance-0.075
 usual-0.071
|-0.069
 ride-0.069
Token and
Token ID290
Feature activation-2.70e-02

Pos logit contributions
lee+0.235
ealing+0.234
elling+0.233
atted+0.232
ishment+0.227
Neg logit contributions
fire-0.100
 he-0.100
 distance-0.100
 ride-0.089
 usual-0.088
Token worried
Token ID7960
Feature activation+9.07e-02

Pos logit contributions
fire+0.058
 distance+0.056
 usual+0.054
 he+0.053
 ride+0.053
Neg logit contributions
lee-0.044
elling-0.043
atted-0.043
ishment-0.042
ealing-0.042
Token.
Token ID13
Feature activation+8.04e-03

Pos logit contributions
atted+0.231
lee+0.231
ealing+0.225
elling+0.223
ishment+0.218
Neg logit contributions
fire-0.116
|-0.105
 distance-0.105
 usual-0.096
 hunt-0.096
Token She
Token ID1375
Feature activation-5.67e-03

Pos logit contributions
atted+0.021
lee+0.021
elling+0.021
ishment+0.021
ished+0.021
Neg logit contributions
fire-0.009
 distance-0.009
 he-0.008
To-0.008
 usual-0.008
Token took
Token ID1718
Feature activation+4.11e-02

Pos logit contributions
fire+0.007
 distance+0.007
 ride+0.006
 he+0.006
 usual+0.006
Neg logit contributions
elling-0.014
ishment-0.014
ealing-0.014
atted-0.014
lee-0.014
Token's
Token ID338
Feature activation+3.99e-02

Pos logit contributions
lee+0.174
atted+0.169
ealing+0.165
elling+0.162
ished+0.161
Neg logit contributions
fire-0.120
 distance-0.113
|-0.108
 usual-0.107
 ride-0.099
Token mom
Token ID1995
Feature activation+2.26e-02

Pos logit contributions
lee+0.074
atted+0.073
elling+0.072
ealing+0.071
ishment+0.070
Neg logit contributions
fire-0.078
 distance-0.074
 usual-0.071
|-0.070
 ride-0.069
Token brought
Token ID3181
Feature activation+3.22e-02

Pos logit contributions
lee+0.050
atted+0.049
elling+0.049
ealing+0.048
ishment+0.048
Neg logit contributions
fire-0.036
 distance-0.034
 usual-0.032
|-0.032
 ride-0.032
Token it
Token ID340
Feature activation-3.79e-02

Pos logit contributions
atted+0.082
lee+0.082
elling+0.081
ealing+0.080
ishment+0.079
Neg logit contributions
fire-0.040
 distance-0.037
 usual-0.035
 ride-0.034
|-0.034
Token to
Token ID284
Feature activation-4.98e-02

Pos logit contributions
fire+0.060
|+0.058
 distance+0.056
Ro+0.051
 gall+0.050
Neg logit contributions
ished-0.080
atted-0.078
lee-0.078
elling-0.076
ealing-0.073
Token her
Token ID607
Feature activation+2.04e-02

Pos logit contributions
fire+0.087
|+0.079
 distance+0.078
tr+0.074
 usual+0.071
Neg logit contributions
lee-0.105
ishment-0.097
 opinions-0.093
 requests-0.092
elling-0.090
Token and
Token ID290
Feature activation-2.93e-02

Pos logit contributions
lee+0.042
atted+0.042
ishment+0.041
 opinions+0.040
ealing+0.039
Neg logit contributions
fire-0.033
|-0.032
 distance-0.031
To-0.029
 usual-0.029
Token asked
Token ID1965
Feature activation+4.17e-02

Pos logit contributions
fire+0.050
 distance+0.047
 usual+0.045
|+0.044
 ride+0.043
Neg logit contributions
lee-0.062
atted-0.062
elling-0.060
ealing-0.060
ishment-0.059
Token,
Token ID11
Feature activation+1.56e-02

Pos logit contributions
elling+0.099
atted+0.097
ealing+0.096
ishment+0.095
lee+0.094
Neg logit contributions
fire-0.056
 usual-0.055
 he-0.054
 distance-0.052
 ride-0.051
Token "
Token ID366
Feature activation+1.94e-02

Pos logit contributions
lee+0.031
atted+0.031
elling+0.030
ealing+0.030
ishment+0.029
Neg logit contributions
fire-0.029
 distance-0.027
|-0.026
 usual-0.026
 hunt-0.025
TokenHere
Token ID4342
Feature activation+2.99e-02

Pos logit contributions
elling+0.042
lee+0.041
atted+0.041
ishment+0.041
ealing+0.041
Neg logit contributions
fire-0.031
 usual-0.028
 distance-0.028
|-0.027
 hunt-0.026
Token box
Token ID3091
Feature activation-7.22e-02

Pos logit contributions
lee+0.095
atted+0.094
elling+0.093
ealing+0.092
ishment+0.090
Neg logit contributions
fire-0.062
 distance-0.057
 usual-0.054
 ride-0.054
 hunt-0.053
Token,
Token ID11
Feature activation-8.61e-03

Pos logit contributions
fire+0.102
|+0.093
 distance+0.092
 usual+0.088
 hunt+0.084
Neg logit contributions
lee-0.173
atted-0.172
elling-0.167
ealing-0.164
ishment-0.162
Token and
Token ID290
Feature activation-3.63e-02

Pos logit contributions
fire+0.013
 distance+0.012
 usual+0.011
 ride+0.011
 hunt+0.011
Neg logit contributions
lee-0.020
atted-0.020
elling-0.020
ealing-0.020
ishment-0.019
Token told
Token ID1297
Feature activation+4.08e-02

Pos logit contributions
fire+0.052
|+0.050
 distance+0.050
 usual+0.046
 hunt+0.044
Neg logit contributions
lee-0.086
atted-0.085
elling-0.081
ished-0.081
ishment-0.080
Token her
Token ID607
Feature activation+5.01e-02

Pos logit contributions
lee+0.091
atted+0.089
elling+0.088
ealing+0.086
ishment+0.085
Neg logit contributions
fire-0.067
 distance-0.062
|-0.060
 usual-0.059
 hunt-0.057
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
lee+0.119
atted+0.113
ealing+0.109
elling+0.105
 opinions+0.105
Neg logit contributions
fire-0.079
 distance-0.077
|-0.074
 hunt-0.065
 usual-0.065
Token box
Token ID3091
Feature activation-4.68e-02

Pos logit contributions
atted+0.028
lee+0.028
elling+0.027
ealing+0.027
ishment+0.026
Neg logit contributions
fire-0.024
 distance-0.022
 usual-0.022
 ride-0.021
 hunt-0.021
Token contained
Token ID7763
Feature activation+1.17e-02

Pos logit contributions
fire+0.055
 distance+0.050
 usual+0.047
|+0.046
 ride+0.045
Neg logit contributions
lee-0.123
atted-0.122
elling-0.120
ealing-0.120
ishment-0.118
Token her
Token ID607
Feature activation+1.20e-02

Pos logit contributions
lee+0.030
atted+0.029
elling+0.029
ealing+0.029
ishment+0.028
Neg logit contributions
fire-0.015
 distance-0.014
|-0.013
 usual-0.013
 ride-0.012
Token treasure
Token ID14068
Feature activation-5.53e-02

Pos logit contributions
lee+0.026
atted+0.026
elling+0.025
ishment+0.025
ealing+0.025
Neg logit contributions
fire-0.019
 distance-0.018
|-0.018
 usual-0.017
 ride-0.016
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
fire+0.067
|+0.064
 distance+0.063
 usual+0.058
 ride+0.054
Neg logit contributions
lee-0.142
atted-0.139
elling-0.136
ealing-0.135
ishment-0.134
Token
Token ID198
Feature activation-5.34e-02

Pos logit contributions
fire+0.091
 distance+0.075
 usual+0.074
 hunt+0.073
field+0.067
Neg logit contributions
lee-0.163
atted-0.160
ealing-0.154
elling-0.153
oking-0.149
TokenL
Token ID43
Feature activation+8.32e-02

Pos logit contributions
fire+0.064
 distance+0.058
 usual+0.054
|+0.054
 ride+0.052
Neg logit contributions
lee-0.141
atted-0.140
elling-0.138
ealing-0.137
ishment-0.136
Tokenily
Token ID813
Feature activation+5.78e-02

Pos logit contributions
ishment+0.172
lee+0.164
atted+0.163
ealing+0.161
ening+0.151
Neg logit contributions
|-0.145
 rest-0.131
fire-0.131
To-0.130
he-0.125
Token started
Token ID2067
Feature activation+2.07e-02

Pos logit contributions
lee+0.130
atted+0.129
elling+0.128
ealing+0.127
ishment+0.125
Neg logit contributions
fire-0.090
 distance-0.083
 usual-0.080
|-0.079
 ride-0.077
Token to
Token ID284
Feature activation-5.33e-02

Pos logit contributions
atted+0.055
lee+0.055
ealing+0.051
elling+0.051
ished+0.050
Neg logit contributions
fire-0.027
 distance-0.021
|-0.021
 usual-0.020
 hunt-0.020
Token cry
Token ID3960
Feature activation+6.64e-02

Pos logit contributions
fire+0.130
 distance+0.114
 usual+0.114
To+0.111
|+0.110
Neg logit contributions
lee-0.076
atted-0.074
ealing-0.072
 requests-0.070
ishment-0.070
Token,
Token ID11
Feature activation-4.74e-04

Pos logit contributions
lee+0.147
ealing+0.144
atted+0.143
elling+0.141
ishment+0.139
Neg logit contributions
fire-0.110
 distance-0.102
|-0.101
 usual-0.097
 hunt-0.093
Token but
Token ID475
Feature activation+7.36e-03

Pos logit contributions
fire+0.001
 distance+0.001
|+0.001
 usual+0.001
 hunt+0.001
Neg logit contributions
lee-0.001
atted-0.001
ealing-0.001
elling-0.001
ishment-0.001
Token a
Token ID257
Feature activation-3.51e-02

Pos logit contributions
lee+0.015
atted+0.015
elling+0.015
ealing+0.015
ishment+0.014
Neg logit contributions
fire-0.013
 distance-0.012
|-0.012
 hunt-0.011
 usual-0.011
Token kind
Token ID1611
Feature activation+5.79e-02

Pos logit contributions
|+0.080
fire+0.078
field+0.073
 usual+0.073
 hunt+0.072
Neg logit contributions
ealing-0.047
ubby-0.045
lee-0.044
atted-0.043
 requests-0.042
Token lady
Token ID10846
Feature activation-6.84e-03

Pos logit contributions
lee+0.116
elling+0.114
atted+0.114
ealing+0.111
ubby+0.108
Neg logit contributions
fire-0.104
 usual-0.095
|-0.095
 distance-0.095
 hunt-0.091
Token noise
Token ID7838
Feature activation-1.23e-02

Pos logit contributions
fire+0.109
 distance+0.100
|+0.098
 usual+0.098
 hunt+0.094
Neg logit contributions
lee-0.097
atted-0.097
ealing-0.095
elling-0.095
 opinions-0.091
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
|+0.021
 distance+0.019
fire+0.019
field+0.019
 usual+0.018
Neg logit contributions
lee-0.025
atted-0.025
ealing-0.025
oking-0.025
ening-0.024
Token
Token ID198
Feature activation-5.99e-02

Pos logit contributions
fire+0.047
 distance+0.044
 usual+0.041
 hunt+0.039
|+0.037
Neg logit contributions
lee-0.060
atted-0.059
elling-0.057
ealing-0.057
ishment-0.055
Token
Token ID198
Feature activation-4.68e-02

Pos logit contributions
fire+0.085
 distance+0.077
 usual+0.071
 hunt+0.070
field+0.065
Neg logit contributions
lee-0.140
atted-0.136
ealing-0.133
oking-0.131
elling-0.128
TokenTim
Token ID14967
Feature activation+7.35e-02

Pos logit contributions
fire+0.075
 distance+0.072
 usual+0.071
 hunt+0.067
|+0.067
Neg logit contributions
ealing-0.096
lee-0.093
oking-0.093
atted-0.091
elling-0.091
Token said
Token ID531
Feature activation+7.08e-02

Pos logit contributions
lee+0.163
atted+0.162
ealing+0.156
elling+0.156
ished+0.154
Neg logit contributions
fire-0.120
 distance-0.109
 usual-0.105
|-0.104
 hunt-0.099
Token,
Token ID11
Feature activation+2.84e-02

Pos logit contributions
atted+0.158
lee+0.156
ealing+0.153
elling+0.150
ished+0.145
Neg logit contributions
fire-0.116
 distance-0.110
|-0.106
 usual-0.101
 hunt-0.100
Token "
Token ID366
Feature activation+3.73e-02

Pos logit contributions
lee+0.044
atted+0.042
ealing+0.041
elling+0.039
oking+0.038
Neg logit contributions
 distance-0.064
fire-0.063
|-0.063
 usual-0.059
 hunt-0.058
TokenMax
Token ID11518
Feature activation+8.84e-02

Pos logit contributions
elling+0.082
ealing+0.081
ishment+0.080
lee+0.080
atted+0.079
Neg logit contributions
fire-0.058
 usual-0.053
 hunt-0.051
 distance-0.050
|-0.047
Token,
Token ID11
Feature activation+4.73e-02

Pos logit contributions
lee+0.219
elling+0.218
atted+0.215
ishment+0.214
ealing+0.208
Neg logit contributions
fire-0.118
 he-0.104
 ride-0.103
 distance-0.102
 hunt-0.102
Token please
Token ID3387
Feature activation+2.27e-02

Pos logit contributions
lee+0.098
atted+0.097
elling+0.096
ealing+0.096
oking+0.093
Neg logit contributions
fire-0.080
 distance-0.076
|-0.072
 usual-0.070
To-0.066
Token.
Token ID13
Feature activation-8.89e-03

Pos logit contributions
atted+0.205
lee+0.201
elling+0.201
ished+0.197
ealing+0.196
Neg logit contributions
fire-0.118
 distance-0.113
|-0.113
 usual-0.104
 hunt-0.098
Token
Token ID198
Feature activation-3.63e-02

Pos logit contributions
fire+0.010
 distance+0.009
 usual+0.009
|+0.009
 ride+0.008
Neg logit contributions
lee-0.024
atted-0.024
elling-0.023
ealing-0.023
ishment-0.023
Token
Token ID198
Feature activation-2.36e-02

Pos logit contributions
fire+0.043
 distance+0.039
 usual+0.039
 hunt+0.036
|+0.035
Neg logit contributions
lee-0.095
atted-0.093
ealing-0.091
elling-0.089
oking-0.087
Token"
Token ID1
Feature activation+3.47e-02

Pos logit contributions
 usual+0.041
fire+0.041
 distance+0.039
|+0.039
 hunt+0.038
Neg logit contributions
ealing-0.045
atted-0.045
lee-0.045
elling-0.044
ubby-0.043
TokenJu
Token ID33018
Feature activation+3.89e-02

Pos logit contributions
lee+0.080
atted+0.079
elling+0.078
ealing+0.078
ishment+0.076
Neg logit contributions
fire-0.052
 distance-0.048
 usual-0.046
|-0.046
 ride-0.044
Tokenice
Token ID501
Feature activation+1.53e-02

Pos logit contributions
lee+0.038
atted+0.037
ishment+0.037
ealing+0.036
elling+0.035
Neg logit contributions
fire-0.106
 hunt-0.099
|-0.099
 usual-0.099
 distance-0.098
Token is
Token ID318
Feature activation+2.54e-02

Pos logit contributions
lee+0.036
atted+0.035
elling+0.035
ealing+0.035
ishment+0.034
Neg logit contributions
fire-0.023
 distance-0.021
 usual-0.020
|-0.020
 ride-0.019
Token y
Token ID331
Feature activation+6.96e-02

Pos logit contributions
lee+0.056
atted+0.055
elling+0.054
ealing+0.054
ishment+0.053
Neg logit contributions
fire-0.042
 distance-0.038
 usual-0.037
|-0.036
 ride-0.036
Tokenummy
Token ID13513
Feature activation+4.77e-02

Pos logit contributions
lee+0.088
atted+0.086
ishment+0.082
ealing+0.080
 opinions+0.077
Neg logit contributions
fire-0.168
 distance-0.162
|-0.160
 usual-0.156
 hunt-0.156
Token,"
Token ID553
Feature activation+5.65e-03

Pos logit contributions
lee+0.118
atted+0.117
elling+0.115
ealing+0.114
ishment+0.113
Neg logit contributions
fire-0.064
 distance-0.058
 usual-0.054
|-0.053
 ride-0.053
Token Anna
Token ID11735
Feature activation+6.96e-02

Pos logit contributions
lee+0.012
atted+0.012
elling+0.012
ealing+0.012
ishment+0.011
Neg logit contributions
fire-0.010
 distance-0.009
 usual-0.009
|-0.009
 ride-0.008
Token that
Token ID326
Feature activation-2.07e-02

Pos logit contributions
lee+0.098
atted+0.098
elling+0.095
ealing+0.094
ishment+0.092
Neg logit contributions
fire-0.053
|-0.048
 distance-0.047
 usual-0.045
 hunt-0.044
Token it
Token ID340
Feature activation-1.46e-02

Pos logit contributions
fire+0.030
|+0.029
 distance+0.025
To+0.024
 hunt+0.024
Neg logit contributions
lee-0.048
atted-0.047
 opinions-0.046
ealing-0.046
ishment-0.046
Token was
Token ID373
Feature activation+2.10e-02

Pos logit contributions
fire+0.021
 distance+0.019
|+0.018
 usual+0.018
 ride+0.017
Neg logit contributions
lee-0.035
atted-0.035
elling-0.033
ealing-0.033
ished-0.033
Token no
Token ID645
Feature activation+1.14e-02

Pos logit contributions
lee+0.048
atted+0.047
ealing+0.046
elling+0.046
ishment+0.045
Neg logit contributions
fire-0.033
|-0.030
 distance-0.029
 hunt-0.028
 usual-0.027
Token longer
Token ID2392
Feature activation-7.76e-03

Pos logit contributions
atted+0.026
lee+0.026
elling+0.026
ealing+0.026
ishment+0.025
Neg logit contributions
fire-0.017
 distance-0.016
|-0.015
 usual-0.015
 hunt-0.014
Token making
Token ID1642
Feature activation+3.63e-03

Pos logit contributions
fire+0.014
|+0.013
 distance+0.013
 usual+0.012
 hunt+0.012
Neg logit contributions
lee-0.016
atted-0.015
ealing-0.015
elling-0.015
ishment-0.015
Token noise
Token ID7838
Feature activation-1.62e-02

Pos logit contributions
lee+0.008
atted+0.008
ealing+0.008
elling+0.008
ishment+0.007
Neg logit contributions
fire-0.006
 distance-0.006
|-0.005
 usual-0.005
 hunt-0.005
Token.
Token ID13
Feature activation-7.73e-03

Pos logit contributions
fire+0.021
|+0.021
 distance+0.020
 usual+0.019
 hunt+0.019
Neg logit contributions
lee-0.039
atted-0.039
ealing-0.039
elling-0.038
ishment-0.037
Token
Token ID198
Feature activation-4.92e-02

Pos logit contributions
fire+0.009
 distance+0.008
 usual+0.008
|+0.008
 he+0.008
Neg logit contributions
lee-0.020
atted-0.020
elling-0.020
ealing-0.020
ishment-0.020
Token
Token ID198
Feature activation-3.83e-02

Pos logit contributions
fire+0.047
 distance+0.041
 usual+0.039
|+0.036
 hunt+0.035
Neg logit contributions
lee-0.143
atted-0.141
ealing-0.138
elling-0.136
ished-0.132
Token"
Token ID1
Feature activation+3.29e-02

Pos logit contributions
fire+0.064
 distance+0.058
 ride+0.055
To+0.054
|+0.054
Neg logit contributions
atted-0.084
lee-0.084
elling-0.083
ishment-0.082
ealing-0.080
Token She
Token ID1375
Feature activation-1.77e-02

Pos logit contributions
fire+0.044
 distance+0.041
 usual+0.039
|+0.039
 hunt+0.038
Neg logit contributions
lee-0.050
atted-0.049
elling-0.048
ealing-0.048
ishment-0.047
Token hugged
Token ID41130
Feature activation+4.70e-02

Pos logit contributions
fire+0.030
 distance+0.028
 usual+0.027
|+0.027
 hunt+0.026
Neg logit contributions
lee-0.038
atted-0.037
elling-0.036
ealing-0.036
ishment-0.035
Token her
Token ID607
Feature activation+3.74e-02

Pos logit contributions
lee+0.104
atted+0.104
elling+0.102
ealing+0.101
ishment+0.099
Neg logit contributions
fire-0.076
 distance-0.070
 usual-0.067
|-0.066
 hunt-0.064
Token t
Token ID256
Feature activation+2.60e-02

Pos logit contributions
atted+0.065
lee+0.065
ealing+0.063
elling+0.062
ishment+0.062
Neg logit contributions
fire-0.078
|-0.073
 distance-0.072
To-0.069
 hunt-0.069
Tokeneddy
Token ID21874
Feature activation-3.82e-04

Pos logit contributions
lee+0.044
atted+0.044
elling+0.043
ealing+0.042
ishment+0.041
Neg logit contributions
fire-0.055
 distance-0.052
 usual-0.051
|-0.050
 ride-0.049
Token bear
Token ID6842
Feature activation-2.74e-02

Pos logit contributions
fire+0.001
 distance+0.001
 usual+0.000
|+0.000
 hunt+0.000
Neg logit contributions
lee-0.001
atted-0.001
elling-0.001
ealing-0.001
ishment-0.001
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
fire+0.060
|+0.056
 distance+0.056
To+0.055
 usual+0.055
Neg logit contributions
atted-0.044
lee-0.043
ealing-0.040
elling-0.039
ished-0.039
Token went
Token ID1816
Feature activation+2.22e-02

Pos logit contributions
fire+0.053
|+0.049
 distance+0.048
 usual+0.047
To+0.045
Neg logit contributions
lee-0.050
atted-0.049
ishment-0.045
 opinions-0.044
ished-0.044
Token to
Token ID284
Feature activation-4.87e-02

Pos logit contributions
lee+0.041
atted+0.041
ished+0.037
ealing+0.036
elling+0.036
Neg logit contributions
fire-0.043
 usual-0.040
 distance-0.038
|-0.038
field-0.037
Token sleep
Token ID3993
Feature activation-1.72e-02

Pos logit contributions
fire+0.078
 distance+0.066
boat+0.063
To+0.062
t+0.062
Neg logit contributions
lee-0.110
 requests-0.108
 mistakes-0.103
atted-0.101
 opinions-0.101
Token in
Token ID287
Feature activation-4.21e-02

Pos logit contributions
|+0.028
 distance+0.027
fire+0.025
 usual+0.025
tr+0.025
Neg logit contributions
lee-0.035
atted-0.033
ealing-0.031
ishment-0.031
 mistakes-0.031
Token a
Token ID257
Feature activation-5.83e-02

Pos logit contributions
lee+0.042
atted+0.041
elling+0.041
ealing+0.040
ishment+0.040
Neg logit contributions
fire-0.018
 distance-0.016
|-0.015
 usual-0.014
 ride-0.014
Token broken
Token ID5445
Feature activation+1.24e-02

Pos logit contributions
|+0.103
fire+0.097
 usual+0.087
 distance+0.087
To+0.083
Neg logit contributions
ubby-0.109
lee-0.109
ealing-0.108
elling-0.103
atted-0.102
Token button
Token ID4936
Feature activation-3.53e-02

Pos logit contributions
elling+0.023
lee+0.023
ubby+0.023
atted+0.022
ishment+0.022
Neg logit contributions
|-0.023
fire-0.023
 distance-0.021
 usual-0.020
To-0.020
Token!
Token ID0
Feature activation-2.79e-02

Pos logit contributions
fire+0.051
|+0.048
 distance+0.047
 usual+0.044
To+0.041
Neg logit contributions
lee-0.083
atted-0.083
elling-0.081
els-0.080
ished-0.079
Token
Token ID198
Feature activation-6.29e-02

Pos logit contributions
fire+0.040
 usual+0.036
 distance+0.036
|+0.033
 hunt+0.033
Neg logit contributions
lee-0.069
atted-0.068
ealing-0.065
elling-0.064
ished-0.063
Token
Token ID198
Feature activation-4.90e-02

Pos logit contributions
fire+0.089
 usual+0.079
 distance+0.077
 hunt+0.073
tr+0.073
Neg logit contributions
lee-0.145
atted-0.142
ealing-0.140
oking-0.135
 requests-0.133
TokenThe
Token ID464
Feature activation+1.67e-02

Pos logit contributions
 usual+0.064
|+0.061
fire+0.061
 distance+0.056
 hunt+0.054
Neg logit contributions
ealing-0.114
lee-0.112
oking-0.107
ubby-0.106
atted-0.106
Token little
Token ID1310
Feature activation+7.85e-02

Pos logit contributions
lee+0.038
atted+0.038
elling+0.038
ealing+0.037
ishment+0.036
Neg logit contributions
fire-0.025
 distance-0.023
 usual-0.022
|-0.022
 ride-0.021
Token frog
Token ID21264
Feature activation+4.65e-03

Pos logit contributions
lee+0.197
atted+0.196
elling+0.193
ealing+0.192
ishment+0.189
Neg logit contributions
fire-0.103
 distance-0.094
 usual-0.089
|-0.088
 ride-0.085
Token looked
Token ID3114
Feature activation+4.62e-02

Pos logit contributions
lee+0.010
atted+0.010
ealing+0.010
elling+0.009
ished+0.009
Neg logit contributions
fire-0.008
 distance-0.007
 usual-0.007
|-0.007
 ride-0.007
Token up
Token ID510
Feature activation+2.52e-02

Pos logit contributions
elling+0.106
lee+0.105
atted+0.103
 opinions+0.103
els+0.102
Neg logit contributions
 distance-0.065
 he-0.064
 far-0.064
fire-0.061
 ride-0.060
Token Ben
Token ID3932
Feature activation+6.58e-02

Pos logit contributions
fire+0.031
 usual+0.028
 distance+0.028
 he+0.026
 ride+0.026
Neg logit contributions
lee-0.066
atted-0.066
elling-0.065
ealing-0.065
ishment-0.064
Token had
Token ID550
Feature activation-3.80e-03

Pos logit contributions
lee+0.166
atted+0.165
ealing+0.161
elling+0.160
ishment+0.157
Neg logit contributions
fire-0.085
 distance-0.077
|-0.073
 usual-0.072
 ride-0.067
Token a
Token ID257
Feature activation-6.53e-02

Pos logit contributions
fire+0.006
|+0.005
 distance+0.005
he+0.004
field+0.004
Neg logit contributions
lee-0.009
 opinions-0.009
atted-0.009
els-0.008
 requests-0.008
Token shield
Token ID7614
Feature activation-8.05e-02

Pos logit contributions
fire+0.108
|+0.105
 distance+0.093
 usual+0.092
To+0.090
Neg logit contributions
lee-0.132
ealing-0.131
atted-0.129
ubby-0.128
ishment-0.125
Token.
Token ID13
Feature activation-3.05e-02

Pos logit contributions
|+0.131
fire+0.120
 distance+0.113
 usual+0.112
To+0.106
Neg logit contributions
atted-0.163
lee-0.162
ener-0.159
ishment-0.157
ished-0.157
Token
Token ID198
Feature activation-5.69e-02

Pos logit contributions
fire+0.016
 distance+0.015
 he+0.015
|+0.014
 usual+0.013
Neg logit contributions
elling-0.096
atted-0.096
lee-0.094
ishment-0.094
ealing-0.094
Token
Token ID198
Feature activation-4.26e-02

Pos logit contributions
fire+0.070
 distance+0.060
 hunt+0.057
 usual+0.056
 ride+0.053
Neg logit contributions
lee-0.144
atted-0.142
ealing-0.141
elling-0.138
oking-0.137
Token"
Token ID1
Feature activation+2.85e-02

Pos logit contributions
fire+0.046
 distance+0.041
 usual+0.038
|+0.038
 ride+0.036
Neg logit contributions
lee-0.118
atted-0.117
elling-0.116
ealing-0.115
ishment-0.114
TokenLook
Token ID8567
Feature activation+9.78e-02

Pos logit contributions
elling+0.048
ishment+0.045
 opinions+0.045
oking+0.045
ealing+0.044
Neg logit contributions
fire-0.054
 usual-0.052
 hunt-0.051
 distance-0.048
|-0.047
Token,
Token ID11
Feature activation+3.92e-02

Pos logit contributions
elling+0.232
atted+0.227
lee+0.223
 opinions+0.223
ishment+0.222
Neg logit contributions
 he-0.121
fire-0.119
 ride-0.118
 usual-0.117
 rest-0.114
Token there
Token ID612
Feature activation-2.42e-02

Pos logit contributions
atted+0.084
lee+0.083
elling+0.082
ealing+0.081
ishment+0.080
Neg logit contributions
fire-0.066
 distance-0.060
 usual-0.060
 he-0.059
 ride-0.059
Token,
Token ID11
Feature activation-2.23e-03

Pos logit contributions
fire+0.048
 distance+0.046
|+0.045
 usual+0.043
 ride+0.041
Neg logit contributions
lee-0.081
atted-0.081
ealing-0.080
elling-0.079
ishment-0.078
Token even
Token ID772
Feature activation+5.59e-04

Pos logit contributions
fire+0.004
 distance+0.003
|+0.003
 usual+0.003
 ride+0.003
Neg logit contributions
lee-0.005
atted-0.005
elling-0.005
ealing-0.005
ishment-0.005
Token if
Token ID611
Feature activation+3.00e-02

Pos logit contributions
atted+0.001
lee+0.001
ishment+0.001
elling+0.001
 opinions+0.001
Neg logit contributions
fire-0.001
|-0.001
 distance-0.001
 saf-0.001
field-0.001
Token it
Token ID340
Feature activation+6.84e-03

Pos logit contributions
lee+0.065
atted+0.065
elling+0.064
 opinions+0.063
ealing+0.062
Neg logit contributions
fire-0.047
|-0.046
 distance-0.045
 hunt-0.040
 ride-0.040
Token's
Token ID338
Feature activation+2.90e-02

Pos logit contributions
lee+0.014
atted+0.014
elling+0.014
ealing+0.014
ishment+0.014
Neg logit contributions
fire-0.012
 distance-0.011
 usual-0.011
|-0.010
 ride-0.010
Token just
Token ID655
Feature activation+1.01e-02

Pos logit contributions
atted+0.057
lee+0.057
elling+0.056
ealing+0.056
ishment+0.055
Neg logit contributions
fire-0.053
 distance-0.049
|-0.048
 usual-0.047
 hunt-0.046
Token a
Token ID257
Feature activation-3.96e-02

Pos logit contributions
lee+0.022
atted+0.022
elling+0.022
ealing+0.021
ishment+0.021
Neg logit contributions
fire-0.016
 distance-0.015
|-0.015
 usual-0.014
 ride-0.014
Token little
Token ID1310
Feature activation+5.02e-02

Pos logit contributions
|+0.071
fire+0.068
 usual+0.063
 distance+0.061
whe+0.060
Neg logit contributions
ealing-0.072
ubby-0.070
lee-0.070
atted-0.069
 mistakes-0.069
Token bird
Token ID6512
Feature activation-1.65e-02

Pos logit contributions
atted+0.105
elling+0.103
ishment+0.103
lee+0.103
ealing+0.102
Neg logit contributions
fire-0.081
|-0.079
 distance-0.074
 usual-0.073
To-0.071
Token.
Token ID13
Feature activation-2.21e-02

Pos logit contributions
fire+0.023
 distance+0.022
|+0.021
 usual+0.021
 hunt+0.019
Neg logit contributions
atted-0.039
lee-0.038
ealing-0.038
elling-0.037
ished-0.036
Token<|endoftext|>
Token ID50256
Feature activation+1.44e-02

Pos logit contributions
fire+0.030
 distance+0.028
 usual+0.028
 hunt+0.026
 seek+0.025
Neg logit contributions
lee-0.055
atted-0.054
elling-0.054
ealing-0.052
ishment-0.052
Token so
Token ID523
Feature activation+1.51e-02

Pos logit contributions
lee+0.093
atted+0.093
ealing+0.090
elling+0.088
ishment+0.086
Neg logit contributions
fire-0.069
 distance-0.062
|-0.061
 hunt-0.060
 usual-0.059
Token happy
Token ID3772
Feature activation+8.31e-02

Pos logit contributions
atted+0.029
lee+0.029
ealing+0.028
elling+0.028
ubby+0.027
Neg logit contributions
fire-0.029
 distance-0.027
 hunt-0.026
|-0.026
 usual-0.026
Token that
Token ID326
Feature activation-3.76e-02

Pos logit contributions
atted+0.180
lee+0.180
elling+0.168
ishment+0.165
ealing+0.164
Neg logit contributions
fire-0.137
 distance-0.131
|-0.130
 usual-0.123
 ride-0.115
Token he
Token ID339
Feature activation-3.22e-03

Pos logit contributions
fire+0.066
tr+0.058
|+0.058
 hunt+0.055
ding+0.054
Neg logit contributions
lee-0.068
 requests-0.066
 mistakes-0.065
 opinions-0.064
atted-0.063
Token had
Token ID550
Feature activation+5.78e-03

Pos logit contributions
fire+0.005
 distance+0.005
 usual+0.005
|+0.005
 ride+0.004
Neg logit contributions
lee-0.007
atted-0.007
ealing-0.007
elling-0.007
ished-0.007
Token a
Token ID257
Feature activation-6.84e-02

Pos logit contributions
lee+0.013
atted+0.013
 opinions+0.012
ished+0.012
ealing+0.012
Neg logit contributions
fire-0.009
 distance-0.008
|-0.008
 usual-0.007
 ride-0.007
Token friend
Token ID1545
Feature activation-4.19e-02

Pos logit contributions
fire+0.132
|+0.126
 distance+0.112
To+0.110
whe+0.109
Neg logit contributions
lee-0.122
ealing-0.120
atted-0.119
ubby-0.116
 opinions-0.112
Token.
Token ID13
Feature activation-2.10e-02

Pos logit contributions
fire+0.060
|+0.053
 distance+0.052
 usual+0.051
To+0.046
Neg logit contributions
atted-0.100
lee-0.100
ealing-0.095
ished-0.094
elling-0.093
Token He
Token ID679
Feature activation-1.08e-02

Pos logit contributions
fire+0.034
 distance+0.032
 usual+0.030
|+0.029
 hunt+0.029
Neg logit contributions
lee-0.047
atted-0.047
elling-0.046
ealing-0.045
ishment-0.044
Token was
Token ID373
Feature activation+3.63e-02

Pos logit contributions
fire+0.018
 distance+0.017
 usual+0.016
|+0.016
 ride+0.015
Neg logit contributions
lee-0.023
atted-0.023
elling-0.022
ealing-0.022
ished-0.021
Token glad
Token ID9675
Feature activation+8.21e-02

Pos logit contributions
lee+0.079
atted+0.078
ealing+0.075
elling+0.074
ishment+0.072
Neg logit contributions
fire-0.062
 distance-0.055
|-0.055
 hunt-0.053
 usual-0.051
Token startled
Token ID36249
Feature activation+7.12e-02

Pos logit contributions
elling+0.078
atted+0.077
lee+0.077
ealing+0.076
ishment+0.075
Neg logit contributions
fire-0.039
 distance-0.036
 usual-0.034
 ride-0.032
 hunt-0.032
Token and
Token ID290
Feature activation-4.17e-02

Pos logit contributions
elling+0.159
atted+0.159
lee+0.158
ealing+0.158
ishment+0.152
Neg logit contributions
fire-0.108
 distance-0.103
 he-0.099
 usual-0.096
 ride-0.092
Token quickly
Token ID2952
Feature activation+2.71e-02

Pos logit contributions
fire+0.058
|+0.054
 distance+0.052
 usual+0.050
 ride+0.049
Neg logit contributions
lee-0.102
atted-0.100
ealing-0.095
ishment-0.094
ished-0.092
Token curled
Token ID41108
Feature activation+4.25e-02

Pos logit contributions
lee+0.062
atted+0.062
ealing+0.060
elling+0.059
ishment+0.058
Neg logit contributions
fire-0.042
 distance-0.038
|-0.037
 usual-0.037
 ride-0.035
Token into
Token ID656
Feature activation-3.82e-02

Pos logit contributions
lee+0.068
atted+0.067
ishment+0.064
ealing+0.064
elling+0.064
Neg logit contributions
fire-0.094
 distance-0.089
|-0.087
 usual-0.086
 ride-0.085
Token a
Token ID257
Feature activation-8.59e-02

Pos logit contributions
fire+0.056
|+0.052
 distance+0.050
tr+0.049
To+0.047
Neg logit contributions
lee-0.089
atted-0.085
ishment-0.084
els-0.083
 opinions-0.083
Token ball
Token ID2613
Feature activation-6.00e-02

Pos logit contributions
|+0.140
fire+0.135
 usual+0.129
 Zig+0.120
whe+0.119
Neg logit contributions
lee-0.164
ubby-0.161
ealing-0.160
elling-0.159
atted-0.153
Token.
Token ID13
Feature activation-2.10e-02

Pos logit contributions
|+0.089
fire+0.088
 usual+0.084
 distance+0.081
To+0.079
Neg logit contributions
lee-0.125
ished-0.125
atted-0.124
ishment-0.122
elling-0.120
Token The
Token ID383
Feature activation+1.48e-02

Pos logit contributions
fire+0.033
 distance+0.030
 usual+0.029
|+0.028
 ride+0.027
Neg logit contributions
lee-0.048
atted-0.048
elling-0.047
ealing-0.046
ishment-0.046
Token monkey
Token ID21657
Feature activation-1.08e-02

Pos logit contributions
lee+0.036
atted+0.036
elling+0.036
ealing+0.036
ishment+0.035
Neg logit contributions
fire-0.020
 distance-0.018
|-0.018
 usual-0.017
 ride-0.016
Token smiled
Token ID13541
Feature activation+9.53e-02

Pos logit contributions
fire+0.016
 distance+0.016
 usual+0.016
|+0.015
 hunt+0.014
Neg logit contributions
atted-0.025
lee-0.024
ished-0.023
ealing-0.023
elling-0.022
Token a
Token ID257
Feature activation-7.02e-02

Pos logit contributions
fire+0.048
|+0.040
 hunt+0.039
 distance+0.038
To+0.037
Neg logit contributions
lee-0.075
atted-0.073
elling-0.071
ishment-0.069
 opinions-0.069
Token th
Token ID294
Feature activation+2.82e-02

Pos logit contributions
fire+0.125
|+0.121
 hunt+0.104
 usual+0.102
 Zig+0.100
Neg logit contributions
ealing-0.142
lee-0.138
atted-0.136
ubby-0.133
eal-0.126
Tokenud
Token ID463
Feature activation-1.46e-02

Pos logit contributions
lee+0.047
atted+0.047
ealing+0.047
elling+0.046
ishment+0.046
Neg logit contributions
fire-0.059
 distance-0.057
|-0.055
 usual-0.053
 ride-0.052
Token in
Token ID287
Feature activation-6.40e-02

Pos logit contributions
fire+0.021
|+0.020
 distance+0.019
 usual+0.018
 hunt+0.018
Neg logit contributions
atted-0.035
lee-0.035
elling-0.033
ished-0.033
els-0.033
Token the
Token ID262
Feature activation-3.74e-02

Pos logit contributions
fire+0.113
|+0.098
 distance+0.097
 hunt+0.096
 usual+0.096
Neg logit contributions
lee-0.136
atted-0.133
elling-0.131
ishment-0.129
ealing-0.127
Token sand
Token ID6450
Feature activation-8.06e-02

Pos logit contributions
fire+0.065
|+0.055
 hunt+0.053
 Zig+0.049
 usual+0.048
Neg logit contributions
lee-0.081
ealing-0.081
elling-0.080
atted-0.079
 opinions-0.077
Token!
Token ID0
Feature activation-1.91e-02

Pos logit contributions
fire+0.131
|+0.126
 distance+0.113
 usual+0.112
To+0.110
Neg logit contributions
atted-0.179
lee-0.178
ished-0.167
ishment-0.167
els-0.167
Token
Token ID198
Feature activation-5.75e-02

Pos logit contributions
fire+0.031
 distance+0.028
 usual+0.027
|+0.026
 hunt+0.026
Neg logit contributions
lee-0.043
atted-0.042
elling-0.041
ealing-0.041
ishment-0.040
Token
Token ID198
Feature activation-4.58e-02

Pos logit contributions
fire+0.076
 usual+0.068
 distance+0.067
 hunt+0.061
field+0.059
Neg logit contributions
lee-0.143
atted-0.137
ealing-0.136
elling-0.133
oking-0.132
TokenAmy
Token ID40797
Feature activation+5.16e-02

Pos logit contributions
fire+0.065
 distance+0.060
 usual+0.057
|+0.056
 ride+0.055
Neg logit contributions
lee-0.109
atted-0.108
elling-0.107
ealing-0.106
ishment-0.104
Token was
Token ID373
Feature activation+4.44e-02

Pos logit contributions
elling+0.124
lee+0.123
atted+0.122
ealing+0.122
ishment+0.121
Neg logit contributions
fire-0.070
 distance-0.065
 usual-0.062
 ride-0.062
|-0.061
Token a
Token ID257
Feature activation-5.83e-02

Pos logit contributions
lee+0.031
atted+0.030
elling+0.029
 opinions+0.029
ishment+0.029
Neg logit contributions
fire-0.016
|-0.015
 distance-0.014
 hunt-0.013
 ride-0.013
Token big
Token ID1263
Feature activation+1.26e-02

Pos logit contributions
|+0.102
fire+0.096
 usual+0.088
 distance+0.087
To+0.083
Neg logit contributions
lee-0.112
ealing-0.109
ubby-0.104
atted-0.104
elling-0.103
Token pool
Token ID5933
Feature activation-6.02e-02

Pos logit contributions
lee+0.028
atted+0.028
elling+0.028
ealing+0.027
ishment+0.027
Neg logit contributions
fire-0.020
 distance-0.018
 usual-0.017
|-0.017
 ride-0.017
Token of
Token ID286
Feature activation-7.59e-02

Pos logit contributions
|+0.080
fire+0.078
 usual+0.075
 distance+0.072
 gall+0.069
Neg logit contributions
atted-0.136
lee-0.136
elling-0.134
ealing-0.131
ished-0.130
Token water
Token ID1660
Feature activation-6.29e-02

Pos logit contributions
fire+0.135
|+0.123
 hunt+0.118
 distance+0.116
To+0.115
Neg logit contributions
elling-0.152
lee-0.150
atted-0.149
 opinions-0.143
els-0.142
Token to
Token ID284
Feature activation-8.27e-02

Pos logit contributions
fire+0.092
|+0.088
 distance+0.084
 usual+0.083
To+0.079
Neg logit contributions
atted-0.141
lee-0.141
elling-0.138
ealing-0.135
ishment-0.134
Token play
Token ID711
Feature activation-2.15e-02

Pos logit contributions
fire+0.121
tr+0.114
To+0.111
|+0.106
 distance+0.103
Neg logit contributions
 opinions-0.165
elling-0.161
lee-0.161
ishment-0.155
ealing-0.154
Token in
Token ID287
Feature activation-6.85e-02

Pos logit contributions
 distance+0.032
 usual+0.030
fire+0.029
|+0.028
field+0.027
Neg logit contributions
ished-0.043
lee-0.043
 opinions-0.042
ealing-0.042
atted-0.042
Token!
Token ID0
Feature activation-8.87e-03

Pos logit contributions
fire+0.103
tr+0.093
|+0.092
 usual+0.092
 distance+0.089
Neg logit contributions
lee-0.145
ishment-0.143
els-0.143
 opinions-0.141
atted-0.139
Token Everyone
Token ID11075
Feature activation-2.16e-02

Pos logit contributions
fire+0.016
 distance+0.015
 usual+0.015
 hunt+0.015
 ride+0.014
Neg logit contributions
lee-0.018
atted-0.017
elling-0.017
ealing-0.017
ishment-0.017
Token had
Token ID550
Feature activation+1.85e-02

Pos logit contributions
fire+0.027
 distance+0.025
 usual+0.023
|+0.023
 hunt+0.022
Neg logit contributions
lee-0.056
atted-0.055
elling-0.054
ealing-0.054
ishment-0.053
Token bird
Token ID6512
Feature activation-1.41e-02

Pos logit contributions
lee+0.030
elling+0.029
ealing+0.029
atted+0.029
ubby+0.028
Neg logit contributions
fire-0.023
|-0.023
To-0.021
 usual-0.020
 distance-0.020
Token wanted
Token ID2227
Feature activation+4.68e-02

Pos logit contributions
fire+0.024
 distance+0.022
 usual+0.021
|+0.021
 hunt+0.020
Neg logit contributions
lee-0.031
atted-0.031
elling-0.029
ealing-0.029
ished-0.028
Token to
Token ID284
Feature activation-6.02e-02

Pos logit contributions
lee+0.117
atted+0.114
elling+0.112
 opinions+0.110
els+0.107
Neg logit contributions
fire-0.064
|-0.060
 distance-0.054
 ride-0.048
tr-0.048
Token cross
Token ID3272
Feature activation-5.51e-02

Pos logit contributions
fire+0.098
|+0.088
tr+0.084
 distance+0.084
 usual+0.083
Neg logit contributions
lee-0.125
 opinions-0.125
 requests-0.117
elling-0.116
atted-0.115
Token the
Token ID262
Feature activation-2.46e-02

Pos logit contributions
fire+0.090
|+0.085
tr+0.080
 distance+0.079
To+0.077
Neg logit contributions
atted-0.116
lee-0.115
els-0.114
 opinions-0.114
ishment-0.112
Token river
Token ID7850
Feature activation-7.28e-02

Pos logit contributions
|+0.032
fire+0.032
To+0.029
 usual+0.029
tr+0.028
Neg logit contributions
elling-0.056
lee-0.055
ealing-0.054
ubby-0.054
ishment-0.054
Token,
Token ID11
Feature activation-8.89e-03

Pos logit contributions
fire+0.128
|+0.128
To+0.120
 usual+0.118
 distance+0.117
Neg logit contributions
atted-0.135
lee-0.135
ishment-0.134
ealing-0.130
 opinions-0.128
Token but
Token ID475
Feature activation-4.05e-03

Pos logit contributions
fire+0.014
 distance+0.013
 usual+0.013
|+0.013
 ride+0.012
Neg logit contributions
lee-0.020
atted-0.020
elling-0.019
ealing-0.019
ishment-0.019
Token it
Token ID340
Feature activation+1.01e-02

Pos logit contributions
fire+0.007
|+0.007
 distance+0.006
 hunt+0.006
To+0.006
Neg logit contributions
lee-0.008
atted-0.008
ishment-0.008
elling-0.008
ealing-0.008
Token did
Token ID750
Feature activation+3.82e-02

Pos logit contributions
atted+0.024
lee+0.023
ealing+0.022
ished+0.022
elling+0.022
Neg logit contributions
fire-0.015
 distance-0.014
|-0.013
 usual-0.013
 hunt-0.013
Token not
Token ID407
Feature activation-1.15e-02

Pos logit contributions
ealing+0.076
atted+0.075
lee+0.075
 opinions+0.073
ishment+0.071
Neg logit contributions
fire-0.070
 distance-0.064
|-0.064
 usual-0.057
 ride-0.055