TOP ACTIVATIONS
MAX = 15.614

It 's not safe ." Tim heard a cat
Are you cold too ?" The bunny looked at
a terrible mess ." D ucky said ,
want a new phone !" S ally and her
butterflies in the tree ." Tim walked to the
you cold too ?" The bunny looked at Mia
a new phone !" S ally and her mom
berries near the big tree ." So , Quick
, " A penny !" Lu cy picked up
make the table clean ." Tim picked up a
. " What is that ?" Todd asked Ben . Ben
the angel , please ?" Her mom said ,
the spider go away ." Tim called his dad
Can you spin for me ?" Spot spun around and around
want to break it ." Sam played with the
wanted some cream too . K itty went to
's just some dirt !". G igi smiled and
if we go there ." Max and Lily listened
don 't bite it !" But Bob still wanted
" A penny !" Lu cy picked up the

MIN ACTIVATIONS
MIN = -29.193

will put it on my paper when I finish . It
you help me hang my art ?" The cat said ,
bad ! You ruined my desk ! Give me back my
know how to increase your finger ! â
children put on their bath ro bes and ran around in
Where did I lose my wand ?" Lily asked her friends
better way to repair the TV together ." They
I folded up those paper towels for our nap kins ?
. I ruined the pretty board ." The family
it to decor ate your paper things . And you can
for fighting over the yellow block . They realized they both
for getting the wild new carpet . <|endoftext|> Once there was
do not ruin my ice cream . You can stay ,
is cheaper than sugar and flour . We are not very
the water wash away the shampoo and she smiled . Then
to work gathering up the sand and carrying it to the
those who might invade your space . Being quiet can help
You should never touch the razor . It is very dangerous
help Mom to pour the batter into a pan . Mom
You need to remove the glue and put it away ,

INTERVAL 4.412 - 15.614
CONTAINS 18.637%

show them her fancy dress . So she raised her hand
touching his nose . " Fl uffy , you
she wanted to . Sarah was so excited
it . <|endoftext|> Once upon a time , there was a
friends again . " L ily , please don

INTERVAL -6.789 - 4.412
CONTAINS 70.529%

, " No , Joe . That ornament is special and
Once upon a time , there was a little girl called
discovered something special !" Ivy shouted with excitement .
to do . His mom my suggested that they go up
his large pocket and then followed his mom . He hoped

INTERVAL -17.991 - -6.789
CONTAINS 10.812%

but he didn 't stay long . He saw the security
they decided to have a race to the lovely pond .
brother Tom came into her room . He saw Jazz on
girl found some cr ay ons and hard markers in her
and watch all the other trains slide down the tracks .

INTERVAL -29.193 - -17.991
CONTAINS 0.022%

it look like a real spaceship . One day
" You are ruining our chess game !" The
to put it on our castle ." Ben nods
how to match the different colours and shapes of the gum
pour the mix into the mixer . But be careful ,
Token It
Token ID632
Feature activation+1.68e+00

Pos logit contributions
Don+0.076
 Curious+0.074
 Looking+0.068
Would+0.067
 Fasc+0.065
Neg logit contributions
ies-0.164
ish-0.162
ishing-0.158
ig-0.157
els-0.156
Token's
Token ID338
Feature activation+9.89e-01

Pos logit contributions
 Curious+0.034
Don+0.033
 Looking+0.032
Would+0.032
 Fasc+0.031
Neg logit contributions
ies-0.027
ishing-0.024
ish-0.024
ig-0.023
els-0.023
Token not
Token ID407
Feature activation+1.34e+00

Pos logit contributions
 Curious+0.020
Don+0.020
 Looking+0.019
 Fasc+0.019
 Who+0.018
Neg logit contributions
ies-0.016
ishing-0.015
els-0.014
ig-0.013
ish-0.013
Token safe
Token ID3338
Feature activation-6.50e+00

Pos logit contributions
Don+0.021
 Curious+0.021
 Looking+0.019
 Fasc+0.019
 Are+0.018
Neg logit contributions
ies-0.026
els-0.026
ishing-0.026
ig-0.025
ooming-0.023
Token."
Token ID526
Feature activation+1.41e+01

Pos logit contributions
ig+0.115
els+0.114
ooming+0.109
ishing+0.108
ins+0.099
Neg logit contributions
 shouldn-0.101
 He-0.097
 Who-0.096
-0.096
Don-0.096
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
 Curious+0.174
Don+0.173
 Looking+0.156
 Who+0.155
Would+0.155
Neg logit contributions
ies-0.325
ishing-0.293
--0.285
ie-0.282
ig-0.279
Token
Token ID198
Feature activation+1.48e+01

Pos logit contributions
 Curious+0.211
Don+0.204
 Looking+0.196
 Fasc+0.192
 Terr+0.190
Neg logit contributions
ies-0.332
--0.302
ie-0.300
ishing-0.289
 again-0.286
TokenTim
Token ID14967
Feature activation+5.31e+00

Pos logit contributions
 Curious+0.221
 Looking+0.204
 Terr+0.204
 Who+0.203
Don+0.203
Neg logit contributions
ies-0.318
ishing-0.285
--0.284
ie-0.276
els-0.268
Token heard
Token ID2982
Feature activation+4.58e+00

Pos logit contributions
 Curious+0.076
Don+0.074
 Looking+0.071
 Fasc+0.068
 Who+0.066
Neg logit contributions
ies-0.118
ishing-0.110
els-0.104
ig-0.104
ish-0.103
Token a
Token ID257
Feature activation+5.19e+00

Pos logit contributions
 Curious+0.080
Don+0.075
 Looking+0.075
 Fasc+0.071
 Who+0.069
Neg logit contributions
ies-0.090
ishing-0.086
ig-0.081
ish-0.080
els-0.079
Token cat
Token ID3797
Feature activation-9.99e-01

Pos logit contributions
 Curious+0.094
Don+0.093
 Looking+0.090
 Fasc+0.089
Would+0.089
Neg logit contributions
ies-0.092
--0.080
ishing-0.075
ig-0.074
ie-0.074
Token Are
Token ID4231
Feature activation+2.08e+00

Pos logit contributions
Don+0.077
 hadn+0.076
Looks+0.075
 intrigued+0.075
 Curious+0.075
Neg logit contributions
ish-0.111
ig-0.110
ies-0.110
 again-0.109
--0.107
Token you
Token ID345
Feature activation+2.16e+00

Pos logit contributions
 Curious+0.043
Don+0.042
 Looking+0.041
 Fasc+0.040
 Terr+0.039
Neg logit contributions
ies-0.033
ishing-0.030
ig-0.028
els-0.027
ish-0.027
Token cold
Token ID4692
Feature activation-6.57e+00

Pos logit contributions
 Curious+0.035
Don+0.032
 Looking+0.032
 Fasc+0.031
 Int+0.029
Neg logit contributions
ies-0.043
els-0.042
ishing-0.041
ries-0.038
ig-0.038
Token too
Token ID1165
Feature activation-6.58e-01

Pos logit contributions
els+0.128
isher+0.110
ig+0.109
ishing+0.108
ins+0.107
Neg logit contributions
Hi-0.091
 Are-0.090
 Curious-0.090
Don-0.083
Hello-0.077
Token?"
Token ID1701
Feature activation+1.44e+01

Pos logit contributions
ies+0.010
ishing+0.010
els+0.010
ig+0.009
ish+0.009
Neg logit contributions
 Curious-0.013
Don-0.012
 Looking-0.012
 Fasc-0.012
 Are-0.012
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
Don+0.188
 hadn+0.185
 Curious+0.182
Looks+0.180
 Looking+0.169
Neg logit contributions
ies-0.297
--0.277
 again-0.271
ie-0.271
.-0.267
Token
Token ID198
Feature activation+1.51e+01

Pos logit contributions
 Curious+0.232
Don+0.224
 hadn+0.218
 Looking+0.218
 Terr+0.215
Neg logit contributions
ies-0.295
--0.277
ie-0.269
 again-0.264
.-0.258
TokenThe
Token ID464
Feature activation+6.99e+00

Pos logit contributions
 Curious+0.204
 hadn+0.202
 Looking+0.200
 Terr+0.196
Don+0.191
Neg logit contributions
ies-0.310
--0.285
ie-0.273
els-0.264
ig-0.257
Token bunny
Token ID44915
Feature activation+2.57e+00

Pos logit contributions
 Curious+0.139
Don+0.136
 Looking+0.133
 Fasc+0.129
 Who+0.126
Neg logit contributions
ies-0.116
ishing-0.104
ig-0.096
els-0.096
ish-0.096
Token looked
Token ID3114
Feature activation+2.54e+00

Pos logit contributions
 Curious+0.034
Don+0.033
 Looking+0.032
 Fasc+0.031
 Who+0.031
Neg logit contributions
ies-0.059
ishing-0.054
ig-0.052
ish-0.052
ie-0.051
Token at
Token ID379
Feature activation+5.66e+00

Pos logit contributions
 Curious+0.061
Don+0.059
 Looking+0.058
 Fasc+0.056
 Terr+0.055
Neg logit contributions
ies-0.032
ishing-0.029
els-0.027
ig-0.026
ish-0.024
Token a
Token ID257
Feature activation-6.59e-02

Pos logit contributions
ies+0.019
ishing+0.018
els+0.017
ig+0.017
ish+0.016
Neg logit contributions
 Curious-0.020
Don-0.020
 Looking-0.019
 Fasc-0.018
 Terr-0.018
Token terrible
Token ID7818
Feature activation-4.42e+00

Pos logit contributions
els+0.001
ies+0.001
ishing+0.001
ries+0.001
ig+0.001
Neg logit contributions
 Curious-0.001
Don-0.001
 Looking-0.001
 Terr-0.001
 Who-0.001
Token mess
Token ID2085
Feature activation-8.41e+00

Pos logit contributions
els+0.101
ries+0.091
ishing+0.090
ishes+0.087
ins+0.086
Neg logit contributions
 Curious-0.057
 Are-0.054
 Looking-0.053
 Who-0.050
 Terr-0.049
Token."
Token ID526
Feature activation+1.46e+01

Pos logit contributions
els+0.143
ishing+0.136
ig+0.133
ishes+0.127
ies+0.127
Neg logit contributions
 Are-0.132
 Who-0.128
-0.126
 I-0.123
 He-0.121
Token
Token ID220
Feature activation+9.40e+00

Pos logit contributions
 Curious+0.192
Don+0.190
 Looking+0.179
 hadn+0.177
 Who+0.176
Neg logit contributions
ies-0.321
ish-0.297
ig-0.292
ishing-0.290
ie-0.283
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
 Curious+0.128
Don+0.121
 Looking+0.117
 Fasc+0.115
 Terr+0.107
Neg logit contributions
ies-0.209
ishing-0.185
ins-0.184
els-0.182
--0.181
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.224
 Looking+0.213
Don+0.211
 Terr+0.208
 Fasc+0.204
Neg logit contributions
ies-0.316
ie-0.284
--0.282
ig-0.281
ishing-0.276
TokenD
Token ID35
Feature activation+5.35e+00

Pos logit contributions
 Curious+0.247
 Looking+0.237
 Terr+0.235
 Who+0.233
 Fasc+0.225
Neg logit contributions
ies-0.297
ishing-0.264
ig-0.261
ish-0.260
--0.259
Tokenucky
Token ID5309
Feature activation+3.77e+00

Pos logit contributions
 Curious+0.119
Don+0.117
 Looking+0.114
 Fasc+0.110
 Who+0.108
Neg logit contributions
ies-0.076
ishing-0.068
els-0.063
ig-0.062
ish-0.061
Token said
Token ID531
Feature activation+6.22e+00

Pos logit contributions
 Curious+0.054
Don+0.052
 Looking+0.050
 Fasc+0.048
 Who+0.047
Neg logit contributions
ies-0.084
ishing-0.078
els-0.074
ig-0.074
ish-0.073
Token,
Token ID11
Feature activation+7.28e+00

Pos logit contributions
 Curious+0.154
Don+0.149
 Looking+0.147
 Fasc+0.141
 Who+0.140
Neg logit contributions
ishing-0.071
ies-0.071
els-0.067
ins-0.058
ig-0.058
Token want
Token ID765
Feature activation+8.43e-01

Pos logit contributions
 Curious+0.079
Don+0.075
 Looking+0.075
 Fasc+0.072
 Int+0.071
Neg logit contributions
ies-0.077
ishing-0.069
ish-0.067
ig-0.065
--0.065
Token a
Token ID257
Feature activation+2.46e+00

Pos logit contributions
 Who+0.012
 Curious+0.012
Don+0.012
+0.011
 Looking+0.011
Neg logit contributions
ies-0.018
ig-0.018
ishing-0.018
ish-0.017
els-0.016
Token new
Token ID649
Feature activation-1.62e+00

Pos logit contributions
 Curious+0.047
 Looking+0.045
Don+0.044
 Who+0.044
 Fasc+0.043
Neg logit contributions
ies-0.043
ishing-0.041
els-0.040
ig-0.037
ries-0.037
Token phone
Token ID3072
Feature activation-9.33e+00

Pos logit contributions
ies+0.025
ishing+0.024
els+0.023
ishes+0.023
ig+0.022
Neg logit contributions
 Curious-0.034
 Looking-0.033
 Who-0.031
 Fasc-0.030
Don-0.030
Token!"
Token ID2474
Feature activation+1.42e+01

Pos logit contributions
ishing+0.162
els+0.159
ig+0.149
ies+0.147
ries+0.136
Neg logit contributions
 Who-0.169
 Is-0.157
 Where-0.156
 Curious-0.155
 Thank-0.152
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
 Curious+0.135
Don+0.127
 Looking+0.120
 Fasc+0.111
 Who+0.105
Neg logit contributions
ies-0.384
ishing-0.367
els-0.352
ig-0.350
ish-0.346
Token
Token ID198
Feature activation+1.51e+01

Pos logit contributions
 Curious+0.229
Don+0.212
 Looking+0.211
 Fasc+0.207
 hadn+0.207
Neg logit contributions
ies-0.323
--0.290
ie-0.286
ins-0.270
els-0.269
TokenS
Token ID50
Feature activation+7.51e+00

Pos logit contributions
 Curious+0.269
 Looking+0.252
 Who+0.246
Don+0.245
 Terr+0.243
Neg logit contributions
ies-0.293
ishing-0.254
els-0.245
--0.243
ie-0.242
Tokenally
Token ID453
Feature activation+5.15e+00

Pos logit contributions
 Curious+0.116
Don+0.112
 Looking+0.108
 Fasc+0.103
 Who+0.101
Neg logit contributions
ies-0.158
ishing-0.147
els-0.139
ig-0.138
ish-0.137
Token and
Token ID290
Feature activation+9.03e+00

Pos logit contributions
 Curious+0.085
Don+0.081
 Looking+0.079
 Fasc+0.076
 Exc+0.074
Neg logit contributions
ies-0.101
ishing-0.096
els-0.091
ig-0.089
ish-0.087
Token her
Token ID607
Feature activation+4.09e+00

Pos logit contributions
 Curious+0.186
Don+0.181
 Looking+0.177
 Fasc+0.170
 Who+0.168
Neg logit contributions
ies-0.147
ishing-0.133
els-0.124
ig-0.121
ish-0.121
Token butterflies
Token ID44571
Feature activation-1.04e+01

Pos logit contributions
ies+0.059
ishing+0.059
ig+0.054
els+0.054
ish+0.054
Neg logit contributions
 Curious-0.060
Don-0.060
 Looking-0.056
 Fasc-0.054
 Are-0.053
Token in
Token ID287
Feature activation-1.83e+00

Pos logit contributions
els+0.170
ishing+0.163
ies+0.158
isher+0.153
ries+0.150
Neg logit contributions
-0.178
Don-0.167
 Curious-0.162
 shouldn-0.161
've-0.160
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
ies+0.040
ishing+0.037
ish+0.035
ier+0.035
ig+0.035
Neg logit contributions
 Curious-0.026
Don-0.026
 Looking-0.024
 Who-0.023
 Exc-0.023
Token tree
Token ID5509
Feature activation-9.06e+00

Pos logit contributions
 Curious+0.005
Don+0.005
 Looking+0.005
 Fasc+0.005
 Who+0.005
Neg logit contributions
ies-0.005
ishing-0.004
ig-0.004
els-0.004
ish-0.004
Token."
Token ID526
Feature activation+1.41e+01

Pos logit contributions
ishing+0.128
ies+0.127
els+0.120
isher+0.115
ig+0.109
Neg logit contributions
-0.174
 Curious-0.171
Don-0.164
 shouldn-0.159
 hadn-0.158
Token
Token ID198
Feature activation+1.51e+01

Pos logit contributions
Don+0.180
 Curious+0.169
Would+0.169
 hadn+0.160
 Fasc+0.156
Neg logit contributions
ies-0.317
--0.291
ishing-0.285
ish-0.282
ig-0.273
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.213
Don+0.205
 Looking+0.198
 Fasc+0.198
 Terr+0.193
Neg logit contributions
ies-0.325
--0.301
ie-0.294
ig-0.284
 again-0.281
TokenTim
Token ID14967
Feature activation+5.67e+00

Pos logit contributions
 Curious+0.219
 Looking+0.207
 Terr+0.204
 Who+0.203
 Fasc+0.202
Neg logit contributions
ies-0.317
--0.287
ishing-0.276
ie-0.275
ig-0.267
Token walked
Token ID6807
Feature activation+3.79e+00

Pos logit contributions
 Curious+0.074
Don+0.070
 Looking+0.067
 Fasc+0.063
 Exc+0.061
Neg logit contributions
ies-0.133
ishing-0.126
els-0.121
ig-0.119
ish-0.118
Token to
Token ID284
Feature activation+6.02e+00

Pos logit contributions
 Curious+0.085
Don+0.083
 Looking+0.080
 Fasc+0.077
Would+0.076
Neg logit contributions
ies-0.054
ishing-0.050
els-0.048
ig-0.047
ish-0.046
Token the
Token ID262
Feature activation+5.53e+00

Pos logit contributions
 Curious+0.122
Don+0.120
 Looking+0.117
 Fasc+0.112
 hadn+0.112
Neg logit contributions
ies-0.094
ishing-0.084
els-0.079
ie-0.077
ish-0.076
Token you
Token ID345
Feature activation+2.16e+00

Pos logit contributions
 Curious+0.043
Don+0.042
 Looking+0.041
 Fasc+0.040
 Terr+0.039
Neg logit contributions
ies-0.033
ishing-0.030
ig-0.028
els-0.027
ish-0.027
Token cold
Token ID4692
Feature activation-6.57e+00

Pos logit contributions
 Curious+0.035
Don+0.032
 Looking+0.032
 Fasc+0.031
 Int+0.029
Neg logit contributions
ies-0.043
els-0.042
ishing-0.041
ries-0.038
ig-0.038
Token too
Token ID1165
Feature activation-6.58e-01

Pos logit contributions
els+0.128
isher+0.110
ig+0.109
ishing+0.108
ins+0.107
Neg logit contributions
Hi-0.091
 Are-0.090
 Curious-0.090
Don-0.083
Hello-0.077
Token?"
Token ID1701
Feature activation+1.44e+01

Pos logit contributions
ies+0.010
ishing+0.010
els+0.010
ig+0.009
ish+0.009
Neg logit contributions
 Curious-0.013
Don-0.012
 Looking-0.012
 Fasc-0.012
 Are-0.012
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
Don+0.188
 hadn+0.185
 Curious+0.182
Looks+0.180
 Looking+0.169
Neg logit contributions
ies-0.297
--0.277
 again-0.271
ie-0.271
.-0.267
Token
Token ID198
Feature activation+1.51e+01

Pos logit contributions
 Curious+0.232
Don+0.224
 hadn+0.218
 Looking+0.218
 Terr+0.215
Neg logit contributions
ies-0.295
--0.277
ie-0.269
 again-0.264
.-0.258
TokenThe
Token ID464
Feature activation+6.99e+00

Pos logit contributions
 Curious+0.204
 hadn+0.202
 Looking+0.200
 Terr+0.196
Don+0.191
Neg logit contributions
ies-0.310
--0.285
ie-0.273
els-0.264
ig-0.257
Token bunny
Token ID44915
Feature activation+2.57e+00

Pos logit contributions
 Curious+0.139
Don+0.136
 Looking+0.133
 Fasc+0.129
 Who+0.126
Neg logit contributions
ies-0.116
ishing-0.104
ig-0.096
els-0.096
ish-0.096
Token looked
Token ID3114
Feature activation+2.54e+00

Pos logit contributions
 Curious+0.034
Don+0.033
 Looking+0.032
 Fasc+0.031
 Who+0.031
Neg logit contributions
ies-0.059
ishing-0.054
ig-0.052
ish-0.052
ie-0.051
Token at
Token ID379
Feature activation+5.66e+00

Pos logit contributions
 Curious+0.061
Don+0.059
 Looking+0.058
 Fasc+0.056
 Terr+0.055
Neg logit contributions
ies-0.032
ishing-0.029
els-0.027
ig-0.026
ish-0.024
Token Mia
Token ID32189
Feature activation-1.57e+00

Pos logit contributions
 Curious+0.086
Don+0.083
 Looking+0.080
 Fasc+0.076
 Who+0.075
Neg logit contributions
ies-0.120
ishing-0.113
els-0.107
ig-0.106
ish-0.105
Token a
Token ID257
Feature activation+2.46e+00

Pos logit contributions
 Who+0.012
 Curious+0.012
Don+0.012
+0.011
 Looking+0.011
Neg logit contributions
ies-0.018
ig-0.018
ishing-0.018
ish-0.017
els-0.016
Token new
Token ID649
Feature activation-1.62e+00

Pos logit contributions
 Curious+0.047
 Looking+0.045
Don+0.044
 Who+0.044
 Fasc+0.043
Neg logit contributions
ies-0.043
ishing-0.041
els-0.040
ig-0.037
ries-0.037
Token phone
Token ID3072
Feature activation-9.33e+00

Pos logit contributions
ies+0.025
ishing+0.024
els+0.023
ishes+0.023
ig+0.022
Neg logit contributions
 Curious-0.034
 Looking-0.033
 Who-0.031
 Fasc-0.030
Don-0.030
Token!"
Token ID2474
Feature activation+1.42e+01

Pos logit contributions
ishing+0.162
els+0.159
ig+0.149
ies+0.147
ries+0.136
Neg logit contributions
 Who-0.169
 Is-0.157
 Where-0.156
 Curious-0.155
 Thank-0.152
Token
Token ID198
Feature activation+1.52e+01

Pos logit contributions
 Curious+0.135
Don+0.127
 Looking+0.120
 Fasc+0.111
 Who+0.105
Neg logit contributions
ies-0.384
ishing-0.367
els-0.352
ig-0.350
ish-0.346
Token
Token ID198
Feature activation+1.51e+01

Pos logit contributions
 Curious+0.229
Don+0.212
 Looking+0.211
 Fasc+0.207
 hadn+0.207
Neg logit contributions
ies-0.323
--0.290
ie-0.286
ins-0.270
els-0.269
TokenS
Token ID50
Feature activation+7.51e+00

Pos logit contributions
 Curious+0.269
 Looking+0.252
 Who+0.246
Don+0.245
 Terr+0.243
Neg logit contributions
ies-0.293
ishing-0.254
els-0.245
--0.243
ie-0.242
Tokenally
Token ID453
Feature activation+5.15e+00

Pos logit contributions
 Curious+0.116
Don+0.112
 Looking+0.108
 Fasc+0.103
 Who+0.101
Neg logit contributions
ies-0.158
ishing-0.147
els-0.139
ig-0.138
ish-0.137
Token and
Token ID290
Feature activation+9.03e+00

Pos logit contributions
 Curious+0.085
Don+0.081
 Looking+0.079
 Fasc+0.076
 Exc+0.074
Neg logit contributions
ies-0.101
ishing-0.096
els-0.091
ig-0.089
ish-0.087
Token her
Token ID607
Feature activation+4.09e+00

Pos logit contributions
 Curious+0.186
Don+0.181
 Looking+0.177
 Fasc+0.170
 Who+0.168
Neg logit contributions
ies-0.147
ishing-0.133
els-0.124
ig-0.121
ish-0.121
Token mom
Token ID1995
Feature activation+2.94e+00

Pos logit contributions
 Curious+0.067
Me+0.061
Don+0.061
 Looking+0.058
 Who+0.058
Neg logit contributions
ishing-0.075
ig-0.071
ies-0.069
ier-0.067
els-0.067
Token berries
Token ID36322
Feature activation-9.18e+00

Pos logit contributions
ooming+0.105
ishing+0.103
ig+0.102
ier+0.099
ist+0.097
Neg logit contributions
 Would-0.100
 Curious-0.099
 Who-0.095
-0.095
Would-0.092
Token near
Token ID1474
Feature activation-2.32e+00

Pos logit contributions
ishing+0.146
els+0.144
ooming+0.135
ies+0.133
ig+0.131
Neg logit contributions
-0.159
Don-0.156
 Would-0.155
 On-0.154
 Inside-0.151
Token the
Token ID262
Feature activation+1.71e+00

Pos logit contributions
ies+0.045
ishing+0.043
ier+0.041
ig+0.040
ins+0.038
Neg logit contributions
 Curious-0.038
Don-0.037
 Looking-0.035
 Who-0.035
 Exc-0.034
Token big
Token ID1263
Feature activation-1.25e+00

Pos logit contributions
 Curious+0.032
Don+0.031
 Looking+0.030
 Fasc+0.029
 Who+0.028
Neg logit contributions
ies-0.031
ishing-0.029
els-0.027
ig-0.026
ish-0.026
Token tree
Token ID5509
Feature activation-8.39e+00

Pos logit contributions
ies+0.025
ishing+0.025
ins+0.024
els+0.023
ier+0.023
Neg logit contributions
 Curious-0.020
Don-0.018
 Looking-0.017
 Are-0.017
 Fasc-0.017
Token."
Token ID526
Feature activation+1.50e+01

Pos logit contributions
ishing+0.108
ies+0.104
els+0.101
ig+0.089
ooming+0.089
Neg logit contributions
 Curious-0.186
Don-0.174
 Who-0.174
 Would-0.171
-0.169
Token
Token ID198
Feature activation+1.48e+01

Pos logit contributions
 Curious+0.184
Don+0.181
 Looking+0.168
 Fasc+0.162
Would+0.161
Neg logit contributions
ies-0.356
ishing-0.327
els-0.319
ish-0.319
ig-0.317
Token
Token ID198
Feature activation+1.46e+01

Pos logit contributions
 Curious+0.195
Don+0.187
 Looking+0.180
 Fasc+0.176
 Terr+0.168
Neg logit contributions
ies-0.334
--0.301
ig-0.298
ie-0.294
ishing-0.294
TokenSo
Token ID2396
Feature activation+6.71e+00

Pos logit contributions
 Curious+0.229
 Looking+0.219
 Fasc+0.210
 Terr+0.205
Don+0.203
Neg logit contributions
ies-0.296
els-0.258
--0.254
ishing-0.253
ig-0.249
Token,
Token ID11
Feature activation+9.15e+00

Pos logit contributions
 Curious+0.180
Don+0.177
 Looking+0.173
 Fasc+0.169
 Who+0.167
Neg logit contributions
ies-0.064
ishing-0.055
els-0.048
ig-0.047
ish-0.046
Token Quick
Token ID12029
Feature activation+4.99e+00

Pos logit contributions
 Curious+0.146
Don+0.140
 Looking+0.137
 Fasc+0.130
Would+0.127
Neg logit contributions
ies-0.188
ishing-0.172
els-0.164
ig-0.163
ish-0.162
Token,
Token ID11
Feature activation+5.27e+00

Pos logit contributions
 Curious+0.129
Don+0.124
 Looking+0.124
 Fasc+0.120
 Who+0.118
Neg logit contributions
ies-0.047
ishing-0.045
els-0.040
ig-0.037
ish-0.036
Token "
Token ID366
Feature activation+8.58e+00

Pos logit contributions
 Curious+0.066
Don+0.064
 Looking+0.060
 Fasc+0.058
Would+0.056
Neg logit contributions
ies-0.123
ishing-0.113
els-0.109
ig-0.107
ish-0.107
TokenA
Token ID32
Feature activation+2.35e+00

Pos logit contributions
 Curious+0.083
 Looking+0.077
 Terr+0.074
 Fasc+0.071
Don+0.068
Neg logit contributions
ies-0.210
ishing-0.200
els-0.193
ish-0.191
--0.191
Token penny
Token ID26369
Feature activation-6.55e+00

Pos logit contributions
 Curious+0.050
Don+0.049
 Looking+0.047
 Fasc+0.046
 Terr+0.045
Neg logit contributions
ies-0.035
ishing-0.032
ig-0.030
ish-0.029
els-0.029
Token!"
Token ID2474
Feature activation+1.33e+01

Pos logit contributions
ies+0.099
ishing+0.098
els+0.096
ins+0.089
ishes+0.084
Neg logit contributions
 Curious-0.113
 Are-0.111
 hadn-0.111
Don-0.109
-0.108
Token
Token ID198
Feature activation+1.50e+01

Pos logit contributions
Don+0.157
 Curious+0.153
Would+0.149
 hadn+0.144
 Who+0.142
Neg logit contributions
ies-0.306
--0.272
ishing-0.262
ries-0.260
els-0.259
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.213
Don+0.208
 Fasc+0.195
 Looking+0.195
 Terr+0.194
Neg logit contributions
ies-0.320
--0.293
ie-0.291
ig-0.277
 again-0.276
TokenLu
Token ID25596
Feature activation+7.36e+00

Pos logit contributions
 Curious+0.201
Don+0.189
 Looking+0.187
 Terr+0.186
 Who+0.180
Neg logit contributions
ies-0.331
--0.296
ishing-0.296
ie-0.285
els-0.281
Tokency
Token ID948
Feature activation+3.78e+00

Pos logit contributions
 Curious+0.143
Don+0.140
 Looking+0.135
 Fasc+0.130
 Who+0.128
Neg logit contributions
ies-0.124
ishing-0.113
els-0.106
ig-0.105
ish-0.102
Token picked
Token ID6497
Feature activation+1.79e+00

Pos logit contributions
 Curious+0.053
Don+0.052
 Looking+0.049
 Fasc+0.047
 Terr+0.047
Neg logit contributions
ies-0.085
ishing-0.078
ig-0.074
els-0.074
ish-0.074
Token up
Token ID510
Feature activation+1.10e+00

Pos logit contributions
 Curious+0.037
Don+0.036
 Looking+0.035
 Fasc+0.034
 Who+0.034
Neg logit contributions
ies-0.028
ishing-0.026
els-0.024
ig-0.024
ish-0.023
Token make
Token ID787
Feature activation-2.63e-01

Pos logit contributions
 Curious+0.039
Don+0.038
 Looking+0.037
 Who+0.035
 Fasc+0.035
Neg logit contributions
ies-0.056
ishing-0.052
els-0.049
ig-0.049
ins-0.049
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
ies+0.006
ishing+0.005
ig+0.005
ier+0.005
els+0.005
Neg logit contributions
 Curious-0.004
 Looking-0.003
 Exc-0.003
Don-0.003
 Int-0.003
Token table
Token ID3084
Feature activation-8.63e+00

Pos logit contributions
 Curious+0.042
Don+0.039
 Who+0.038
 Fasc+0.038
 Looking+0.037
Neg logit contributions
ies-0.031
ishing-0.030
ig-0.030
els-0.028
ie-0.026
Token clean
Token ID3424
Feature activation-6.67e+00

Pos logit contributions
els+0.153
ishing+0.153
ig+0.146
ies+0.137
ish+0.132
Neg logit contributions
 Curious-0.132
-0.129
 relieved-0.127
 Something-0.123
Don-0.121
Token."
Token ID526
Feature activation+1.43e+01

Pos logit contributions
ig+0.108
els+0.106
ishing+0.102
ies+0.101
ooming+0.094
Neg logit contributions
-0.104
 He-0.101
 Curious-0.099
 It-0.099
Don-0.097
Token
Token ID198
Feature activation+1.50e+01

Pos logit contributions
 Curious+0.169
Don+0.167
 Looking+0.151
Would+0.147
 Who+0.147
Neg logit contributions
ies-0.343
ishing-0.314
els-0.302
ish-0.301
--0.300
Token
Token ID198
Feature activation+1.44e+01

Pos logit contributions
 Curious+0.205
Don+0.198
 Looking+0.190
 Fasc+0.184
 Terr+0.184
Neg logit contributions
ies-0.334
--0.300
ie-0.297
ishing-0.292
ig-0.286
TokenTim
Token ID14967
Feature activation+5.45e+00

Pos logit contributions
 Curious+0.206
 Looking+0.191
Don+0.188
 Terr+0.185
 Who+0.183
Neg logit contributions
ies-0.325
ishing-0.295
els-0.281
--0.280
ie-0.279
Token picked
Token ID6497
Feature activation+2.03e+00

Pos logit contributions
 Curious+0.099
Don+0.096
 Looking+0.092
 Fasc+0.089
 Exc+0.087
Neg logit contributions
ies-0.099
ishing-0.092
els-0.087
ig-0.085
ish-0.083
Token up
Token ID510
Feature activation+9.33e-01

Pos logit contributions
 Curious+0.038
 Fasc+0.035
 Looking+0.035
 Exc+0.035
Don+0.034
Neg logit contributions
ies-0.036
ig-0.032
ishing-0.031
ist-0.030
els-0.029
Token a
Token ID257
Feature activation+2.66e+00

Pos logit contributions
 Curious+0.012
Don+0.011
 Fasc+0.011
 Looking+0.010
 Exc+0.010
Neg logit contributions
ies-0.021
ishing-0.021
ig-0.020
ier-0.019
ish-0.019
Token.
Token ID13
Feature activation+1.35e+01

Pos logit contributions
ishing+0.104
ies+0.099
els+0.085
ishes+0.084
isher+0.083
Neg logit contributions
 Curious-0.134
Don-0.133
-0.129
 hadn-0.128
 Looking-0.127
Token "
Token ID366
Feature activation+1.09e+01

Pos logit contributions
Don+0.210
 hadn+0.202
Ell+0.202
Follow+0.200
 Terr+0.194
Neg logit contributions
ies-0.241
 and-0.217
ish-0.208
ishing-0.204
ins-0.202
TokenWhat
Token ID2061
Feature activation+5.17e+00

Pos logit contributions
 Curious+0.115
 Looking+0.106
 Terr+0.101
 Fasc+0.098
Don+0.093
Neg logit contributions
ies-0.259
ishing-0.247
els-0.246
ish-0.236
ins-0.233
Token is
Token ID318
Feature activation+1.79e+00

Pos logit contributions
 Curious+0.088
Don+0.085
 Looking+0.082
 Fasc+0.079
 Who+0.079
Neg logit contributions
ies-0.100
ishing-0.096
els-0.088
ig-0.088
ish-0.086
Token that
Token ID326
Feature activation-3.11e+00

Pos logit contributions
 Curious+0.035
Don+0.034
 Looking+0.033
 Fasc+0.032
 Who+0.032
Neg logit contributions
ies-0.030
ishing-0.028
ig-0.026
els-0.025
ish-0.025
Token?"
Token ID1701
Feature activation+1.50e+01

Pos logit contributions
ies+0.054
ishing+0.052
els+0.048
ig+0.047
ish+0.045
Neg logit contributions
 Curious-0.058
Don-0.056
 Looking-0.054
 Fasc-0.052
 Who-0.052
Token Todd
Token ID14377
Feature activation+4.86e+00

Pos logit contributions
Don+0.231
 Terr+0.228
Follow+0.228
 Curious+0.227
tted+0.224
Neg logit contributions
ies-0.295
els-0.253
ries-0.251
 eggs-0.245
ishing-0.240
Token asked
Token ID1965
Feature activation+4.85e+00

Pos logit contributions
 Curious+0.056
Don+0.053
 Looking+0.051
 Fasc+0.048
 Who+0.046
Neg logit contributions
ies-0.122
ishing-0.114
els-0.110
ig-0.109
ish-0.109
Token Ben
Token ID3932
Feature activation+1.46e+00

Pos logit contributions
 Curious+0.119
 Looking+0.113
Don+0.112
 Exc+0.109
 Fasc+0.109
Neg logit contributions
ishing-0.058
ies-0.055
ig-0.050
els-0.047
ish-0.046
Token.
Token ID13
Feature activation+1.33e+01

Pos logit contributions
 Curious+0.038
Don+0.037
 Looking+0.037
 Who+0.036
 Fasc+0.036
Neg logit contributions
ies-0.014
ishing-0.013
ig-0.012
els-0.011
ier-0.010
Token Ben
Token ID3932
Feature activation+5.04e+00

Pos logit contributions
Don+0.184
 hadn+0.168
Ta+0.168
 Curious+0.166
Follow+0.163
Neg logit contributions
ies-0.271
--0.237
ishing-0.231
 again-0.229
ins-0.229
Token the
Token ID262
Feature activation+1.85e+00

Pos logit contributions
ies+0.054
ishing+0.051
ig+0.048
ish+0.046
ier+0.046
Neg logit contributions
 Curious-0.039
Don-0.038
 Who-0.036
Would-0.036
 Looking-0.036
Token angel
Token ID18304
Feature activation-8.90e+00

Pos logit contributions
 Curious+0.035
Me+0.032
 Looking+0.032
 Who+0.032
Don+0.031
Neg logit contributions
ishing-0.032
ies-0.031
ins-0.028
ig-0.028
els-0.027
Token,
Token ID11
Feature activation+4.05e+00

Pos logit contributions
ishing+0.133
ies+0.123
els+0.123
ins+0.118
isher+0.114
Neg logit contributions
 Curious-0.165
-0.157
 Who-0.155
 Is-0.152
 Are-0.151
Token please
Token ID3387
Feature activation+1.10e+00

Pos logit contributions
 Curious+0.062
Don+0.062
 Looking+0.057
 Who+0.056
Would+0.055
Neg logit contributions
ies-0.084
ishing-0.082
els-0.076
ig-0.076
ish-0.074
Token?"
Token ID1701
Feature activation+1.42e+01

Pos logit contributions
 Curious+0.022
Don+0.021
 Who+0.021
 Looking+0.021
 Are+0.020
Neg logit contributions
ies-0.017
ishing-0.016
ig-0.015
els-0.015
ins-0.015
Token
Token ID198
Feature activation+1.50e+01

Pos logit contributions
 Curious+0.184
Don+0.176
 Looking+0.167
 Terr+0.161
 Fasc+0.159
Neg logit contributions
ies-0.329
ishing-0.297
els-0.290
--0.288
ie-0.287
Token
Token ID198
Feature activation+1.48e+01

Pos logit contributions
 Curious+0.215
Don+0.203
 Looking+0.197
 Fasc+0.191
 Terr+0.191
Neg logit contributions
ies-0.330
--0.301
ie-0.295
ishing-0.287
 again-0.283
TokenHer
Token ID9360
Feature activation+5.54e+00

Pos logit contributions
 Curious+0.231
 Looking+0.214
Don+0.212
 Terr+0.210
 Who+0.204
Neg logit contributions
ies-0.314
ishing-0.281
--0.277
els-0.268
ie-0.266
Token mom
Token ID1995
Feature activation+5.02e+00

Pos logit contributions
 Curious+0.096
 Looking+0.086
Don+0.086
 Who+0.083
 Fasc+0.083
Neg logit contributions
ishing-0.103
ies-0.094
ig-0.093
ier-0.090
els-0.090
Token said
Token ID531
Feature activation+4.79e+00

Pos logit contributions
 Curious+0.062
Don+0.059
 Looking+0.056
 Fasc+0.053
 Who+0.051
Neg logit contributions
ies-0.122
ishing-0.115
els-0.110
ig-0.109
ish-0.108
Token,
Token ID11
Feature activation+5.98e+00

Pos logit contributions
 Curious+0.107
Don+0.102
 Looking+0.100
 Who+0.097
 hadn+0.097
Neg logit contributions
ishing-0.068
ies-0.063
els-0.060
ish-0.055
ishes-0.055
Token the
Token ID262
Feature activation+2.27e+00

Pos logit contributions
ies+0.021
ishing+0.021
ish+0.019
els+0.019
ishes+0.019
Neg logit contributions
 Curious-0.015
 Looking-0.013
 Int-0.013
Don-0.013
 Who-0.012
Token spider
Token ID19230
Feature activation-9.22e+00

Pos logit contributions
 Curious+0.042
Don+0.039
 Looking+0.038
 Who+0.037
 Terr+0.037
Neg logit contributions
ies-0.042
ishing-0.041
els-0.039
ishes-0.036
ins-0.036
Token go
Token ID467
Feature activation-3.41e+00

Pos logit contributions
ishing+0.181
els+0.176
ies+0.168
isher+0.167
ries+0.164
Neg logit contributions
 Curious-0.138
-0.129
 Terr-0.127
 Looking-0.122
Don-0.118
Token away
Token ID1497
Feature activation-6.11e+00

Pos logit contributions
ies+0.054
ishing+0.051
els+0.050
ins+0.049
ig+0.048
Neg logit contributions
 Curious-0.067
Don-0.066
 Looking-0.062
 Fasc-0.060
 Who-0.060
Token."
Token ID526
Feature activation+1.37e+01

Pos logit contributions
ishing+0.107
els+0.103
ig+0.099
ies+0.095
ish+0.093
Neg logit contributions
-0.105
 Curious-0.102
 Who-0.100
 He-0.099
Don-0.098
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
Don+0.167
 Curious+0.166
 Looking+0.152
Would+0.148
 hadn+0.148
Neg logit contributions
ies-0.315
ishing-0.291
--0.284
ish-0.279
ig-0.277
Token
Token ID198
Feature activation+1.46e+01

Pos logit contributions
 Curious+0.206
Don+0.197
 Looking+0.192
 Fasc+0.188
 Terr+0.185
Neg logit contributions
ies-0.329
--0.303
ie-0.296
ishing-0.288
ig-0.286
TokenTim
Token ID14967
Feature activation+5.73e+00

Pos logit contributions
 Curious+0.206
 Looking+0.196
Don+0.194
 Terr+0.190
 Who+0.188
Neg logit contributions
ies-0.323
ishing-0.291
--0.290
ie-0.278
ig-0.273
Token called
Token ID1444
Feature activation+3.06e+00

Pos logit contributions
 Curious+0.073
Don+0.070
 Looking+0.067
 Fasc+0.063
 Who+0.061
Neg logit contributions
ies-0.135
ishing-0.128
els-0.122
ig-0.121
ish-0.120
Token his
Token ID465
Feature activation+3.07e+00

Pos logit contributions
 Curious+0.048
Don+0.045
 Looking+0.044
 Who+0.042
 Fasc+0.042
Neg logit contributions
ies-0.065
ishing-0.065
els-0.061
ig-0.061
ins-0.059
Token dad
Token ID9955
Feature activation+1.53e+00

Pos logit contributions
 Curious+0.064
 Looking+0.059
Don+0.059
 Who+0.057
 Fasc+0.056
Neg logit contributions
ishing-0.047
ies-0.047
els-0.043
ries-0.042
ins-0.042
Token Can
Token ID1680
Feature activation+6.19e+00

Pos logit contributions
Don+0.085
 Curious+0.081
 hadn+0.080
Thanks+0.078
 intrigued+0.077
Neg logit contributions
ies-0.125
ish-0.124
 again-0.123
ig-0.122
ing-0.119
Token you
Token ID345
Feature activation+2.50e+00

Pos logit contributions
 Curious+0.124
 Terr+0.121
 Fasc+0.119
 Looking+0.118
Don+0.114
Neg logit contributions
ies-0.089
ie-0.080
ig-0.077
--0.077
 again-0.076
Token spin
Token ID7906
Feature activation-5.24e+00

Pos logit contributions
 Curious+0.040
Don+0.038
 Looking+0.037
 Fasc+0.036
 Who+0.035
Neg logit contributions
ies-0.052
ishing-0.048
els-0.046
ish-0.045
ig-0.044
Token for
Token ID329
Feature activation+1.68e-01

Pos logit contributions
ies+0.090
ishing+0.080
els+0.077
ier+0.077
ins+0.077
Neg logit contributions
 Curious-0.100
Don-0.097
 Looking-0.094
 Fasc-0.092
 Exc-0.091
Token me
Token ID502
Feature activation-3.46e+00

Pos logit contributions
 Curious+0.003
Don+0.002
 Looking+0.002
 Int+0.002
 Fasc+0.002
Neg logit contributions
ies-0.004
ishing-0.003
ish-0.003
els-0.003
ig-0.003
Token?"
Token ID1701
Feature activation+1.49e+01

Pos logit contributions
ies+0.062
els+0.059
ries+0.057
ins+0.056
ishes+0.056
Neg logit contributions
 Curious-0.060
 Looking-0.056
 Int-0.054
-0.054
Don-0.054
Token Spot
Token ID15899
Feature activation+6.69e+00

Pos logit contributions
 hadn+0.221
 Curious+0.218
Don+0.215
 Looking+0.213
 Terr+0.206
Neg logit contributions
ies-0.283
ries-0.264
--0.260
 again-0.253
els-0.252
Token spun
Token ID26843
Feature activation-2.18e-01

Pos logit contributions
Don+0.103
 Curious+0.101
 Looking+0.094
Follow+0.094
 Terr+0.094
Neg logit contributions
ies-0.140
--0.123
ie-0.119
ish-0.119
ishing-0.116
Token around
Token ID1088
Feature activation+2.12e+00

Pos logit contributions
ies+0.003
ishing+0.003
els+0.002
-+0.002
ig+0.002
Neg logit contributions
 Curious-0.005
Don-0.005
 Looking-0.005
 Who-0.005
Would-0.005
Token and
Token ID290
Feature activation+5.56e+00

Pos logit contributions
 Curious+0.052
Don+0.051
 Looking+0.049
 Fasc+0.048
 Who+0.047
Neg logit contributions
ies-0.028
ishing-0.026
els-0.025
ish-0.024
ig-0.024
Token around
Token ID1088
Feature activation+1.42e+00

Pos logit contributions
 Curious+0.092
Don+0.090
 Looking+0.090
 Terr+0.087
Would+0.086
Neg logit contributions
ies-0.101
 again-0.090
ishing-0.090
--0.087
ig-0.086
Token want
Token ID765
Feature activation-1.18e+00

Pos logit contributions
 Curious+0.040
Don+0.039
 Looking+0.037
 Who+0.036
 Fasc+0.034
Neg logit contributions
ies-0.069
ishing-0.066
els-0.064
ig-0.062
ins-0.061
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
ies+0.026
ishing+0.025
ins+0.023
ig+0.023
ish+0.023
Neg logit contributions
 Who-0.017
 Curious-0.016
Don-0.016
 Where-0.016
-0.016
Token break
Token ID2270
Feature activation-7.18e+00

Pos logit contributions
 Curious+0.021
Don+0.021
 Who+0.020
 Are+0.020
 Looking+0.019
Neg logit contributions
ishing-0.041
ies-0.040
ins-0.039
ig-0.038
els-0.038
Token it
Token ID340
Feature activation-8.28e+00

Pos logit contributions
ies+0.167
ig+0.167
ishing+0.166
ish+0.161
pit+0.156
Neg logit contributions
Don-0.057
 Who-0.054
-0.053
 Are-0.052
 It-0.046
Token."
Token ID526
Feature activation+1.46e+01

Pos logit contributions
ishing+0.145
els+0.140
ig+0.139
ies+0.138
isher+0.134
Neg logit contributions
 On-0.127
-0.125
 Curious-0.125
 He-0.125
Don-0.120
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
Don+0.216
 Curious+0.210
Would+0.196
 hadn+0.195
 Looking+0.193
Neg logit contributions
ies-0.308
ishing-0.276
ig-0.269
ish-0.266
--0.264
Token
Token ID198
Feature activation+1.45e+01

Pos logit contributions
 Curious+0.222
Don+0.213
 Looking+0.207
 Fasc+0.202
 Terr+0.201
Neg logit contributions
ies-0.310
--0.284
ie-0.278
ig-0.273
ishing-0.268
TokenSam
Token ID16305
Feature activation+5.78e+00

Pos logit contributions
 Curious+0.221
 Looking+0.211
 Terr+0.210
 Who+0.199
 Fasc+0.199
Neg logit contributions
ies-0.304
--0.267
ishing-0.262
ig-0.257
ie-0.256
Token played
Token ID2826
Feature activation+2.35e+00

Pos logit contributions
 Curious+0.094
Don+0.091
 Looking+0.088
 Fasc+0.085
 Who+0.084
Neg logit contributions
ies-0.117
ishing-0.108
els-0.101
ish-0.101
ig-0.101
Token with
Token ID351
Feature activation+1.88e+00

Pos logit contributions
 Curious+0.051
Don+0.049
 Looking+0.048
 Fasc+0.047
Would+0.046
Neg logit contributions
ies-0.037
ishing-0.032
els-0.031
ish-0.031
ig-0.030
Token the
Token ID262
Feature activation+3.47e+00

Pos logit contributions
 Curious+0.027
Don+0.026
 Looking+0.025
 Fasc+0.024
 Who+0.023
Neg logit contributions
ies-0.041
ishing-0.038
ig-0.036
ish-0.036
ier-0.036
Token wanted
Token ID2227
Feature activation+1.34e+00

Pos logit contributions
 Curious+0.077
 Looking+0.072
 Fasc+0.072
Don+0.071
 Terr+0.070
Neg logit contributions
ies-0.102
ie-0.090
--0.090
ish-0.088
ishing-0.088
Token some
Token ID617
Feature activation-3.16e+00

Pos logit contributions
 Curious+0.026
Don+0.025
 Who+0.024
 Looking+0.024
Would+0.024
Neg logit contributions
ies-0.023
ishing-0.023
ig-0.021
ish-0.020
els-0.019
Token cream
Token ID8566
Feature activation-6.99e+00

Pos logit contributions
ies+0.053
ig+0.052
ishing+0.052
els+0.046
ist+0.045
Neg logit contributions
 Curious-0.056
Me-0.054
Don-0.053
 Fasc-0.052
 Looking-0.052
Token too
Token ID1165
Feature activation-2.19e+00

Pos logit contributions
ishing+0.109
ies+0.101
ig+0.099
els+0.095
ooming+0.093
Neg logit contributions
Don-0.127
 Who-0.123
-0.122
 Is-0.120
 Ms-0.116
Token.
Token ID13
Feature activation+1.38e+01

Pos logit contributions
ishing+0.032
ies+0.029
els+0.029
ig+0.029
ishes+0.026
Neg logit contributions
 Curious-0.045
Don-0.043
-0.043
 Who-0.042
 Looking-0.042
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 hadn+0.178
 Curious+0.175
Ta+0.174
Don+0.173
Would+0.172
Neg logit contributions
ies-0.289
--0.266
ig-0.248
ish-0.247
ie-0.246
Token
Token ID198
Feature activation+1.47e+01

Pos logit contributions
 Curious+0.208
Don+0.197
 Looking+0.194
 Fasc+0.191
 Terr+0.189
Neg logit contributions
ies-0.321
--0.294
ie-0.293
ig-0.284
ishing-0.277
TokenK
Token ID42
Feature activation+7.91e+00

Pos logit contributions
 Curious+0.176
 Looking+0.167
 Who+0.158
 Terr+0.158
 Fasc+0.158
Neg logit contributions
ies-0.348
--0.325
ig-0.312
ishing-0.310
ie-0.309
Tokenitty
Token ID9760
Feature activation+4.39e+00

Pos logit contributions
 Curious+0.143
Don+0.138
 Looking+0.134
 Fasc+0.129
 Who+0.126
Neg logit contributions
ies-0.146
ishing-0.135
els-0.127
ig-0.124
ish-0.124
Token went
Token ID1816
Feature activation+4.19e+00

Pos logit contributions
 Curious+0.066
Don+0.064
 Looking+0.062
 Fasc+0.059
 Who+0.058
Neg logit contributions
ies-0.095
ishing-0.086
ig-0.082
ish-0.082
els-0.082
Token to
Token ID284
Feature activation+5.69e+00

Pos logit contributions
 Curious+0.090
Don+0.088
 Looking+0.084
 Fasc+0.082
Would+0.081
Neg logit contributions
ies-0.063
ishing-0.059
els-0.057
ig-0.056
ish-0.055
Token's
Token ID338
Feature activation-3.54e-01

Pos logit contributions
 Curious+0.001
Don+0.001
 Looking+0.001
 Who+0.001
 Fasc+0.001
Neg logit contributions
ies-0.001
ishing-0.001
els-0.001
ig-0.001
ish-0.001
Token just
Token ID655
Feature activation-2.14e+00

Pos logit contributions
ies+0.007
ishing+0.007
ig+0.007
els+0.007
ish+0.006
Neg logit contributions
Don-0.005
 Curious-0.005
 Looking-0.005
 hadn-0.005
 Who-0.005
Token some
Token ID617
Feature activation-4.55e+00

Pos logit contributions
ies+0.036
ishing+0.034
ig+0.032
els+0.031
ish+0.030
Neg logit contributions
 Curious-0.041
Don-0.041
 Looking-0.039
 Who-0.038
 Fasc-0.037
Token dirt
Token ID13647
Feature activation-1.11e+01

Pos logit contributions
ishing+0.079
ies+0.076
ig+0.072
els+0.071
ooming+0.068
Neg logit contributions
Don-0.080
 Curious-0.080
 Who-0.075
 Looking-0.074
 hadn-0.073
Token!".
Token ID48220
Feature activation+1.34e+01

Pos logit contributions
ishing+0.155
els+0.151
ig+0.143
ooming+0.142
ies+0.140
Neg logit contributions
-0.203
Don-0.200
 hadn-0.200
 shouldn-0.199
 Who-0.197
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
Don+0.180
 Curious+0.179
 hadn+0.166
 Fasc+0.165
 Looking+0.164
Neg logit contributions
ies-0.293
ish-0.257
--0.255
ie-0.253
ig-0.251
Token
Token ID198
Feature activation+1.45e+01

Pos logit contributions
 Curious+0.234
Don+0.222
 Looking+0.219
 Fasc+0.218
 hadn+0.210
Neg logit contributions
ies-0.287
--0.270
ie-0.261
ig-0.256
 again-0.247
TokenG
Token ID38
Feature activation+7.86e+00

Pos logit contributions
 Curious+0.223
 Looking+0.215
 Fasc+0.205
 Who+0.204
 Are+0.202
Neg logit contributions
ies-0.297
--0.269
ie-0.264
els-0.243
ig-0.242
Tokenigi
Token ID25754
Feature activation+3.49e+00

Pos logit contributions
 Curious+0.134
Don+0.133
 Looking+0.127
 Fasc+0.121
Would+0.120
Neg logit contributions
ies-0.144
ishing-0.137
els-0.127
ish-0.125
ig-0.125
Token smiled
Token ID13541
Feature activation+4.16e-01

Pos logit contributions
 Curious+0.044
Don+0.042
 Looking+0.040
 Fasc+0.038
 Who+0.036
Neg logit contributions
ies-0.082
ishing-0.078
els-0.074
ig-0.073
ish-0.072
Token and
Token ID290
Feature activation+5.67e+00

Pos logit contributions
 Curious+0.010
Don+0.010
 Looking+0.009
 Fasc+0.009
 Exc+0.009
Neg logit contributions
els-0.005
ies-0.005
ries-0.005
ishing-0.005
ig-0.005
Token if
Token ID611
Feature activation-2.54e-01

Pos logit contributions
ishing+0.108
els+0.106
isher+0.102
ries+0.102
ig+0.099
Neg logit contributions
-0.089
 shouldn-0.085
 Curious-0.083
Don-0.083
 hadn-0.078
Token we
Token ID356
Feature activation-8.18e-02

Pos logit contributions
ies+0.005
ishing+0.005
els+0.004
ig+0.004
ish+0.004
Neg logit contributions
 Curious-0.004
Don-0.004
 Looking-0.004
 Fasc-0.004
 Who-0.004
Token go
Token ID467
Feature activation-2.22e+00

Pos logit contributions
ies+0.002
ishing+0.002
els+0.002
ig+0.002
ins+0.001
Neg logit contributions
 Curious-0.001
Don-0.001
 Looking-0.001
 Who-0.001
 Fasc-0.001
Token there
Token ID612
Feature activation-6.01e+00

Pos logit contributions
els+0.034
ies+0.033
ins+0.033
ishing+0.032
ries+0.031
Neg logit contributions
 Curious-0.043
Don-0.042
 Who-0.039
Hi-0.039
Would-0.038
Token."
Token ID526
Feature activation+1.47e+01

Pos logit contributions
ishing+0.105
els+0.103
ies+0.096
ig+0.094
isher+0.092
Neg logit contributions
 Curious-0.102
Don-0.101
 Would-0.099
-0.099
 hadn-0.098
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.230
Don+0.227
 Looking+0.213
 Terr+0.206
 Fasc+0.205
Neg logit contributions
ies-0.294
ishing-0.261
ig-0.254
els-0.252
ie-0.250
Token
Token ID198
Feature activation+1.47e+01

Pos logit contributions
 Curious+0.216
Don+0.208
 Looking+0.199
 Fasc+0.196
 Terr+0.193
Neg logit contributions
ies-0.314
--0.287
ie-0.282
ig-0.278
ishing-0.271
TokenMax
Token ID11518
Feature activation+5.58e+00

Pos logit contributions
 Curious+0.197
 Looking+0.184
 Terr+0.183
 Fasc+0.175
 Who+0.172
Neg logit contributions
ies-0.328
--0.302
ishing-0.293
ie-0.287
ig-0.287
Token and
Token ID290
Feature activation+8.99e+00

Pos logit contributions
 Curious+0.088
Don+0.086
 Looking+0.084
 Who+0.082
Would+0.081
Neg logit contributions
ies-0.114
ishing-0.102
--0.099
ish-0.099
ig-0.096
Token Lily
Token ID20037
Feature activation+5.10e+00

Pos logit contributions
 Curious+0.173
 Looking+0.170
Don+0.165
 Fasc+0.165
 Terr+0.164
Neg logit contributions
ies-0.156
ie-0.137
ishing-0.125
--0.123
 again-0.120
Token listened
Token ID16399
Feature activation+1.38e+00

Pos logit contributions
 Curious+0.073
Don+0.070
 Looking+0.069
 Who+0.067
 Terr+0.066
Neg logit contributions
ies-0.111
ishing-0.101
ish-0.099
els-0.096
ig-0.096
Token don
Token ID836
Feature activation+6.52e-01

Pos logit contributions
Don+0.017
 Curious+0.017
 Looking+0.015
 Who+0.015
 Are+0.015
Neg logit contributions
ies-0.023
ishing-0.022
els-0.021
ig-0.020
ish-0.020
Token't
Token ID470
Feature activation+1.55e+00

Pos logit contributions
 Curious+0.009
Don+0.009
 Looking+0.008
Would+0.008
 Fasc+0.008
Neg logit contributions
ies-0.014
els-0.013
--0.013
ishing-0.013
ig-0.013
Token bite
Token ID13197
Feature activation-6.39e+00

Pos logit contributions
 Curious+0.025
Don+0.025
 Looking+0.024
 Fasc+0.023
 Who+0.022
Neg logit contributions
ies-0.031
ishing-0.029
els-0.028
ig-0.027
ish-0.027
Token it
Token ID340
Feature activation-5.80e+00

Pos logit contributions
ishing+0.112
ies+0.103
ig+0.102
ooming+0.102
els+0.098
Neg logit contributions
-0.105
Don-0.101
 Who-0.100
 Are-0.100
 shouldn-0.100
Token!"
Token ID2474
Feature activation+1.45e+01

Pos logit contributions
ishing+0.080
ies+0.078
els+0.078
ig+0.073
isher+0.071
Neg logit contributions
 Curious-0.118
Don-0.113
 hadn-0.112
-0.111
 shouldn-0.111
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.195
Don+0.195
Would+0.179
 Looking+0.178
 hadn+0.175
Neg logit contributions
ies-0.309
ishing-0.285
els-0.281
ish-0.277
--0.274
Token
Token ID198
Feature activation+1.48e+01

Pos logit contributions
 Curious+0.213
Don+0.204
 hadn+0.199
 Looking+0.197
 Terr+0.193
Neg logit contributions
ies-0.289
--0.278
ig-0.263
els-0.261
 again-0.261
TokenBut
Token ID1537
Feature activation+8.51e+00

Pos logit contributions
 Curious+0.196
 Looking+0.184
 Terr+0.182
 hadn+0.181
Don+0.179
Neg logit contributions
ies-0.309
--0.292
ishing-0.283
els-0.282
ish-0.275
Token Bob
Token ID5811
Feature activation+4.87e+00

Pos logit contributions
 Curious+0.109
Don+0.105
 Looking+0.100
 Fasc+0.095
 Who+0.092
Neg logit contributions
ies-0.202
ishing-0.191
els-0.182
ig-0.180
ish-0.179
Token still
Token ID991
Feature activation+3.33e+00

Pos logit contributions
 Curious+0.063
Don+0.061
 Looking+0.058
 Fasc+0.055
 Who+0.054
Neg logit contributions
ies-0.115
ishing-0.108
els-0.102
ig-0.102
ish-0.102
Token wanted
Token ID2227
Feature activation+2.17e+00

Pos logit contributions
 Curious+0.041
Don+0.039
 Looking+0.037
Would+0.036
 Fasc+0.036
Neg logit contributions
ies-0.081
ishing-0.075
ish-0.072
ig-0.071
els-0.071
Token "
Token ID366
Feature activation+8.58e+00

Pos logit contributions
 Curious+0.066
Don+0.064
 Looking+0.060
 Fasc+0.058
Would+0.056
Neg logit contributions
ies-0.123
ishing-0.113
els-0.109
ig-0.107
ish-0.107
TokenA
Token ID32
Feature activation+2.35e+00

Pos logit contributions
 Curious+0.083
 Looking+0.077
 Terr+0.074
 Fasc+0.071
Don+0.068
Neg logit contributions
ies-0.210
ishing-0.200
els-0.193
ish-0.191
--0.191
Token penny
Token ID26369
Feature activation-6.55e+00

Pos logit contributions
 Curious+0.050
Don+0.049
 Looking+0.047
 Fasc+0.046
 Terr+0.045
Neg logit contributions
ies-0.035
ishing-0.032
ig-0.030
ish-0.029
els-0.029
Token!"
Token ID2474
Feature activation+1.33e+01

Pos logit contributions
ies+0.099
ishing+0.098
els+0.096
ins+0.089
ishes+0.084
Neg logit contributions
 Curious-0.113
 Are-0.111
 hadn-0.111
Don-0.109
-0.108
Token
Token ID198
Feature activation+1.50e+01

Pos logit contributions
Don+0.157
 Curious+0.153
Would+0.149
 hadn+0.144
 Who+0.142
Neg logit contributions
ies-0.306
--0.272
ishing-0.262
ries-0.260
els-0.259
Token
Token ID198
Feature activation+1.49e+01

Pos logit contributions
 Curious+0.213
Don+0.208
 Fasc+0.195
 Looking+0.195
 Terr+0.194
Neg logit contributions
ies-0.320
--0.293
ie-0.291
ig-0.277
 again-0.276
TokenLu
Token ID25596
Feature activation+7.36e+00

Pos logit contributions
 Curious+0.201
Don+0.189
 Looking+0.187
 Terr+0.186
 Who+0.180
Neg logit contributions
ies-0.331
--0.296
ishing-0.296
ie-0.285
els-0.281
Tokency
Token ID948
Feature activation+3.78e+00

Pos logit contributions
 Curious+0.143
Don+0.140
 Looking+0.135
 Fasc+0.130
 Who+0.128
Neg logit contributions
ies-0.124
ishing-0.113
els-0.106
ig-0.105
ish-0.102
Token picked
Token ID6497
Feature activation+1.79e+00

Pos logit contributions
 Curious+0.053
Don+0.052
 Looking+0.049
 Fasc+0.047
 Terr+0.047
Neg logit contributions
ies-0.085
ishing-0.078
ig-0.074
els-0.074
ish-0.074
Token up
Token ID510
Feature activation+1.10e+00

Pos logit contributions
 Curious+0.037
Don+0.036
 Looking+0.035
 Fasc+0.034
 Who+0.034
Neg logit contributions
ies-0.028
ishing-0.026
els-0.024
ig-0.024
ish-0.023
Token the
Token ID262
Feature activation+3.84e+00

Pos logit contributions
 Curious+0.017
Don+0.016
 Looking+0.015
 Fasc+0.015
 Exc+0.014
Neg logit contributions
ies-0.024
ishing-0.023
ig-0.022
ier-0.021
ish-0.021
Token will
Token ID481
Feature activation-8.02e+00

Pos logit contributions
 Curious+0.146
Don+0.145
 hadn+0.143
 Terr+0.141
 Looking+0.140
Neg logit contributions
ies-0.228
els-0.207
 again-0.204
--0.199
ishing-0.199
Token put
Token ID1234
Feature activation-1.24e+01

Pos logit contributions
ishing+0.184
ies+0.181
els+0.174
ig+0.171
ins+0.168
Neg logit contributions
 Curious-0.103
Don-0.100
 Who-0.094
 Looking-0.094
 hadn-0.092
Token it
Token ID340
Feature activation-1.40e+01

Pos logit contributions
ies+0.261
ishing+0.233
ins+0.226
ier+0.223
ooming+0.223
Neg logit contributions
Don-0.166
-0.146
 Are-0.141
 Curious-0.141
 It-0.140
Token on
Token ID319
Feature activation-8.80e+00

Pos logit contributions
els+0.228
ins+0.206
ries+0.203
ies+0.202
ishing+0.194
Neg logit contributions
-0.227
Don-0.224
 hadn-0.220
 Curious-0.210
 had-0.207
Token my
Token ID616
Feature activation-9.59e+00

Pos logit contributions
ies+0.208
ishing+0.199
ins+0.184
ish+0.184
ig+0.182
Neg logit contributions
-0.110
Don-0.108
 Curious-0.104
 Who-0.100
 Exc-0.100
Token paper
Token ID3348
Feature activation-2.07e+01

Pos logit contributions
ies+0.173
ishing+0.165
ins+0.163
ier+0.160
els+0.154
Neg logit contributions
 Curious-0.170
 Exc-0.155
 Looking-0.154
-0.154
 Something-0.152
Token when
Token ID618
Feature activation-7.33e+00

Pos logit contributions
ishing+0.291
ies+0.277
els+0.268
ins+0.267
ries+0.261
Neg logit contributions
-0.389
 Exc-0.364
 Curious-0.349
 Something-0.347
 -0.337
Token I
Token ID314
Feature activation-6.10e+00

Pos logit contributions
ies+0.160
ishing+0.155
els+0.146
ish+0.143
ig+0.143
Neg logit contributions
 Curious-0.103
Don-0.101
 Looking-0.094
 Who-0.093
 hadn-0.091
Token finish
Token ID5461
Feature activation-1.28e+01

Pos logit contributions
ies+0.138
ishing+0.131
els+0.125
ig+0.122
ish+0.121
Neg logit contributions
 Curious-0.083
Don-0.080
 Looking-0.077
 Fasc-0.072
 Who-0.072
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
ies+0.254
ish+0.235
ishing+0.227
ier+0.218
ig+0.217
Neg logit contributions
-0.176
Don-0.165
 hadn-0.164
 Curious-0.163
 Who-0.161
Token It
Token ID632
Feature activation-5.63e+00

Pos logit contributions
 Curious+0.020
Don+0.020
 Fasc+0.018
 hadn+0.018
 Terr+0.018
Neg logit contributions
ies-0.031
ig-0.030
els-0.030
ish-0.029
ishing-0.029
Token you
Token ID345
Feature activation-9.11e+00

Pos logit contributions
 Curious+0.052
Don+0.050
 Looking+0.047
 Fasc+0.045
Would+0.045
Neg logit contributions
ies-0.075
ishing-0.072
els-0.069
ig-0.068
ish-0.067
Token help
Token ID1037
Feature activation-8.39e+00

Pos logit contributions
ies+0.172
ishing+0.167
els+0.158
ig+0.153
ish+0.150
Neg logit contributions
 Curious-0.155
Don-0.150
 Looking-0.144
 Who-0.140
 Fasc-0.137
Token me
Token ID502
Feature activation-1.13e+01

Pos logit contributions
ishing+0.163
ies+0.159
ig+0.158
ish+0.147
els+0.147
Neg logit contributions
Don-0.117
 Who-0.115
 Curious-0.114
-0.114
 Are-0.108
Token hang
Token ID8181
Feature activation-1.55e+01

Pos logit contributions
ies+0.190
els+0.187
ishing+0.185
ries+0.173
ig+0.173
Neg logit contributions
 Curious-0.188
-0.187
 Looking-0.172
Don-0.170
 Who-0.169
Token my
Token ID616
Feature activation-1.01e+01

Pos logit contributions
ies+0.274
ishing+0.271
ish+0.246
ig+0.242
ins+0.242
Neg logit contributions
Don-0.245
-0.242
 Curious-0.237
 Exc-0.226
 Is-0.223
Token art
Token ID1242
Feature activation-2.07e+01

Pos logit contributions
ishing+0.149
ies+0.147
ier+0.145
ins+0.141
els+0.131
Neg logit contributions
 Curious-0.193
 Terr-0.181
 Exc-0.177
 Looking-0.174
Don-0.172
Token?"
Token ID1701
Feature activation+5.65e+00

Pos logit contributions
ies+0.305
ishing+0.295
els+0.270
ish+0.259
ries+0.254
Neg logit contributions
-0.391
 Exc-0.378
 Curious-0.363
 Who-0.351
 Terr-0.343
Token The
Token ID383
Feature activation-6.09e-01

Pos logit contributions
 Curious+0.072
Don+0.068
 Looking+0.066
 Fasc+0.064
 Terr+0.061
Neg logit contributions
ies-0.128
ishing-0.117
els-0.113
ie-0.113
ig-0.112
Token cat
Token ID3797
Feature activation-3.24e+00

Pos logit contributions
ies+0.011
ishing+0.011
els+0.010
ig+0.010
ish+0.010
Neg logit contributions
 Curious-0.011
Don-0.010
 Looking-0.010
 Who-0.010
Would-0.009
Token said
Token ID531
Feature activation-1.89e+00

Pos logit contributions
ies+0.075
ishing+0.070
els+0.067
ig+0.067
ish+0.066
Neg logit contributions
 Curious-0.043
Don-0.041
 Looking-0.039
 Fasc-0.037
 Who-0.036
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
ies+0.027
ishing+0.026
els+0.024
ishes+0.022
ish+0.022
Neg logit contributions
 Curious-0.043
Don-0.041
 Looking-0.041
 Who-0.040
 Fasc-0.039
Token bad
Token ID2089
Feature activation-8.36e+00

Pos logit contributions
ies+0.052
ishing+0.048
els+0.046
ish+0.044
ig+0.043
Neg logit contributions
 Curious-0.066
Don-0.064
 Looking-0.062
 Fasc-0.060
 Exc-0.059
Token!
Token ID0
Feature activation+1.45e+00

Pos logit contributions
ig+0.120
ishing+0.117
els+0.116
ish+0.099
ies+0.095
Neg logit contributions
 Are-0.160
 Curious-0.160
 Who-0.158
Don-0.155
 Would-0.153
Token You
Token ID921
Feature activation-3.07e+00

Pos logit contributions
 Curious+0.018
Don+0.018
 Looking+0.017
 Fasc+0.016
 Exc+0.016
Neg logit contributions
ies-0.033
ishing-0.031
els-0.031
ig-0.030
ish-0.030
Token ruined
Token ID20484
Feature activation-1.08e+01

Pos logit contributions
ies+0.066
ishing+0.061
els+0.059
ish+0.058
ig+0.058
Neg logit contributions
 Curious-0.046
Don-0.044
 Looking-0.043
 Fasc-0.041
Would-0.040
Token my
Token ID616
Feature activation-9.06e+00

Pos logit contributions
ies+0.227
ishing+0.220
ig+0.214
ier+0.207
ish+0.199
Neg logit contributions
Don-0.145
 Curious-0.141
 Who-0.139
 Looking-0.137
 Are-0.133
Token desk
Token ID6915
Feature activation-2.03e+01

Pos logit contributions
ies+0.142
ishing+0.138
ier+0.138
ins+0.132
els+0.129
Neg logit contributions
 Curious-0.166
 Terr-0.155
Don-0.154
 hadn-0.153
 Who-0.153
Token!
Token ID0
Feature activation+1.74e+00

Pos logit contributions
ishing+0.308
ies+0.301
els+0.292
ishes+0.278
ish+0.263
Neg logit contributions
-0.402
 Curious-0.372
 hadn-0.370
 Exc-0.366
 Terr-0.356
Token Give
Token ID13786
Feature activation-6.21e+00

Pos logit contributions
 Curious+0.025
Don+0.024
 Fasc+0.023
 Looking+0.023
Would+0.022
Neg logit contributions
ies-0.033
els-0.032
ishing-0.032
ig-0.031
ish-0.031
Token me
Token ID502
Feature activation-8.94e+00

Pos logit contributions
ies+0.130
ishing+0.120
ig+0.115
els+0.112
ier+0.110
Neg logit contributions
Don-0.094
 Curious-0.092
 Looking-0.087
 Who-0.086
 Exc-0.083
Token back
Token ID736
Feature activation-8.04e+00

Pos logit contributions
ies+0.174
ishing+0.160
els+0.159
ins+0.154
ig+0.147
Neg logit contributions
 Curious-0.133
Don-0.129
-0.124
Thanks-0.120
 Looking-0.119
Token my
Token ID616
Feature activation-7.41e+00

Pos logit contributions
ies+0.168
ig+0.162
ishing+0.161
ier+0.154
ish+0.153
Neg logit contributions
 Curious-0.104
Don-0.103
Hi-0.097
 Who-0.097
-0.092
Token know
Token ID760
Feature activation-9.59e+00

Pos logit contributions
ies+0.172
ishing+0.167
els+0.158
ig+0.153
ish+0.150
Neg logit contributions
 Curious-0.155
Don-0.150
 Looking-0.144
 Who-0.140
 Fasc-0.137
Token how
Token ID703
Feature activation-6.23e+00

Pos logit contributions
ishing+0.175
ies+0.164
ig+0.159
ish+0.156
ooming+0.150
Neg logit contributions
 Who-0.154
-0.153
 Would-0.150
 Curious-0.150
Don-0.148
Token to
Token ID284
Feature activation-4.89e+00

Pos logit contributions
ies+0.092
ishing+0.086
els+0.079
ig+0.077
ish+0.076
Neg logit contributions
 Curious-0.136
Don-0.132
 Looking-0.128
 Who-0.125
 Fasc-0.124
Token increase
Token ID2620
Feature activation-1.46e+01

Pos logit contributions
ies+0.112
ishing+0.110
ins+0.102
ig+0.102
els+0.101
Neg logit contributions
 Curious-0.062
Don-0.058
 Looking-0.057
 Who-0.056
 Something-0.053
Token your
Token ID534
Feature activation-9.96e+00

Pos logit contributions
ishing+0.239
ig+0.237
ies+0.236
ooming+0.231
ish+0.223
Neg logit contributions
-0.239
Don-0.231
 Curious-0.218
 Are-0.217
 hadn-0.212
Token finger
Token ID7660
Feature activation-2.03e+01

Pos logit contributions
ies+0.174
ier+0.168
ishing+0.166
ins+0.162
els+0.155
Neg logit contributions
 Curious-0.169
 Looking-0.155
 Terr-0.155
 hadn-0.151
-0.151
Token!
Token ID0
Feature activation+1.54e+00

Pos logit contributions
ishing+0.289
els+0.279
ies+0.265
ig+0.254
ooming+0.245
Neg logit contributions
-0.403
 Curious-0.358
Don-0.347
 hadn-0.342
 shouldn-0.340
Tokenâ
Token ID22940
Feature activation-4.38e+00

Pos logit contributions
Don+0.017
 Curious+0.016
 Fasc+0.015
 Looking+0.014
 hadn+0.014
Neg logit contributions
ies-0.033
els-0.032
ish-0.032
ig-0.032
ishing-0.031
Token
Token ID26391
Feature activation-2.67e+00

Pos logit contributions
ies+0.109
ishing+0.108
ig+0.102
ish+0.102
els+0.100
Neg logit contributions
 Curious-0.047
Don-0.044
 Looking-0.041
 Who-0.040
 Fasc-0.040
Token
Token ID198
Feature activation+5.00e+00

Pos logit contributions
ies+0.055
ig+0.050
ishing+0.049
-+0.049
ish+0.048
Neg logit contributions
 Curious-0.040
Don-0.039
 Looking-0.038
Would-0.036
 Fasc-0.036
Token
Token ID198
Feature activation+5.94e+00

Pos logit contributions
 Curious+0.073
Don+0.069
 Fasc+0.068
 Looking+0.067
 Terr+0.065
Neg logit contributions
ies-0.092
ig-0.088
--0.087
ins-0.086
 again-0.083
Token children
Token ID1751
Feature activation-1.06e+01

Pos logit contributions
 Curious+0.061
Don+0.059
 Looking+0.057
 Fasc+0.055
 Terr+0.054
Neg logit contributions
ies-0.096
ishing-0.086
--0.086
ig-0.084
ish-0.082
Token put
Token ID1234
Feature activation-1.13e+01

Pos logit contributions
ishing+0.183
ies+0.180
els+0.168
ig+0.167
ish+0.159
Neg logit contributions
 Curious-0.189
Don-0.182
 Looking-0.176
 hadn-0.175
 Who-0.173
Token on
Token ID319
Feature activation-6.67e+00

Pos logit contributions
ies+0.176
ishing+0.169
ig+0.160
ooming+0.151
els+0.149
Neg logit contributions
Don-0.203
 Curious-0.203
 Looking-0.190
 Exc-0.186
 Terr-0.186
Token their
Token ID511
Feature activation-8.22e+00

Pos logit contributions
ies+0.129
ishing+0.126
ig+0.119
ier+0.117
ish+0.117
Neg logit contributions
 Curious-0.111
 Looking-0.104
Don-0.104
 Fasc-0.101
 Exc-0.100
Token bath
Token ID7837
Feature activation-1.50e+01

Pos logit contributions
ies+0.122
ishing+0.115
ier+0.112
ins+0.106
els+0.104
Neg logit contributions
 Curious-0.171
Don-0.160
 Looking-0.160
 Fasc-0.156
 Who-0.154
Tokenro
Token ID305
Feature activation-2.02e+01

Pos logit contributions
els+0.278
ries+0.262
ishing+0.248
ies+0.240
ish+0.239
Neg logit contributions
 Curious-0.216
 Looking-0.202
Don-0.200
 Exc-0.194
 Something-0.193
Tokenbes
Token ID12636
Feature activation-1.61e+01

Pos logit contributions
ies+0.378
ishing+0.362
els+0.356
ish+0.353
ries+0.342
Neg logit contributions
 Curious-0.321
 Looking-0.307
Don-0.299
 Terr-0.289
 Exc-0.289
Token and
Token ID290
Feature activation-2.60e+00

Pos logit contributions
els+0.225
ies+0.223
ishing+0.217
ig+0.205
ries+0.191
Neg logit contributions
 Curious-0.338
-0.331
Don-0.329
 Exc-0.318
 On-0.312
Token ran
Token ID4966
Feature activation-6.70e+00

Pos logit contributions
ies+0.055
ishing+0.053
els+0.051
ig+0.050
ish+0.049
Neg logit contributions
 Curious-0.037
Don-0.037
 Looking-0.034
 Who-0.033
 Fasc-0.033
Token around
Token ID1088
Feature activation-6.21e+00

Pos logit contributions
ies+0.111
els+0.109
ishing+0.103
ig+0.101
ries+0.101
Neg logit contributions
 Curious-0.132
 Looking-0.127
Don-0.126
 Who-0.119
 Fasc-0.118
Token in
Token ID287
Feature activation-4.53e+00

Pos logit contributions
ies+0.117
ishing+0.111
ish+0.108
els+0.107
ig+0.105
Neg logit contributions
 Curious-0.107
Don-0.106
 Who-0.102
 Looking-0.102
Would-0.101
TokenWhere
Token ID8496
Feature activation-8.15e+00

Pos logit contributions
ishing+0.040
ig+0.038
els+0.038
ier+0.038
ies+0.037
Neg logit contributions
 Curious-0.047
Don-0.045
 Fasc-0.043
 Looking-0.042
Would-0.041
Token did
Token ID750
Feature activation-8.64e+00

Pos logit contributions
ies+0.165
ishing+0.154
els+0.146
ig+0.144
ish+0.143
Neg logit contributions
 Curious-0.131
Don-0.127
 Looking-0.122
 Fasc-0.117
 Who-0.114
Token I
Token ID314
Feature activation-6.19e+00

Pos logit contributions
ies+0.151
ishing+0.143
els+0.132
ig+0.131
ish+0.130
Neg logit contributions
 Curious-0.162
Don-0.158
 Looking-0.154
 Fasc-0.147
 Who-0.147
Token lose
Token ID4425
Feature activation-1.35e+01

Pos logit contributions
ies+0.137
ishing+0.130
els+0.123
ig+0.121
ish+0.120
Neg logit contributions
 Curious-0.087
Don-0.084
 Looking-0.081
 Fasc-0.076
 Who-0.076
Token my
Token ID616
Feature activation-9.38e+00

Pos logit contributions
ies+0.248
ishing+0.233
ins+0.225
els+0.222
ig+0.220
Neg logit contributions
 Curious-0.218
Don-0.218
 Looking-0.212
 Who-0.208
 Are-0.208
Token wand
Token ID11569
Feature activation-2.00e+01

Pos logit contributions
ies+0.147
ishing+0.144
ins+0.137
els+0.136
ier+0.135
Neg logit contributions
 Curious-0.177
 hadn-0.165
Don-0.162
 Looking-0.161
 Terr-0.160
Token?"
Token ID1701
Feature activation+5.89e+00

Pos logit contributions
els+0.288
ies+0.288
ishing+0.279
ins+0.263
ries+0.257
Neg logit contributions
 Curious-0.379
-0.376
 hadn-0.374
Don-0.371
 shouldn-0.354
Token Lily
Token ID20037
Feature activation+2.06e-01

Pos logit contributions
 Curious+0.078
Don+0.074
 Looking+0.071
 Fasc+0.070
Follow+0.068
Neg logit contributions
ies-0.126
ishing-0.113
--0.112
ie-0.112
 again-0.112
Token asked
Token ID1965
Feature activation-7.20e-01

Pos logit contributions
 Curious+0.003
Don+0.003
 Looking+0.003
 Fasc+0.002
Would+0.002
Neg logit contributions
ies-0.005
ishing-0.004
els-0.004
ish-0.004
ig-0.004
Token her
Token ID607
Feature activation-2.46e+00

Pos logit contributions
ishing+0.011
ies+0.011
ig+0.010
ish+0.010
ishes+0.010
Neg logit contributions
 Curious-0.014
 Looking-0.014
Don-0.014
 Who-0.014
 -0.013
Token friends
Token ID2460
Feature activation-5.05e+00

Pos logit contributions
ishing+0.044
ies+0.040
ig+0.039
els+0.039
ries+0.038
Neg logit contributions
 Curious-0.043
 Who-0.041
 Looking-0.039
Don-0.038
Would-0.038
Token better
Token ID1365
Feature activation-1.39e+01

Pos logit contributions
ies+0.044
ishing+0.044
ishes+0.039
ins+0.039
ish+0.038
Neg logit contributions
 Curious-0.084
Don-0.082
 Looking-0.081
 hadn-0.080
 Is-0.078
Token way
Token ID835
Feature activation-1.28e+01

Pos logit contributions
els+0.187
ishing+0.170
ig+0.163
ries+0.155
ies+0.154
Neg logit contributions
Don-0.306
 Curious-0.288
 Are-0.280
 hadn-0.275
 Who-0.274
Token to
Token ID284
Feature activation-6.05e+00

Pos logit contributions
ies+0.185
ishing+0.182
els+0.181
ish+0.165
ig+0.164
Neg logit contributions
Don-0.257
 Curious-0.254
 hadn-0.244
 Looking-0.240
-0.240
Token repair
Token ID9185
Feature activation-1.49e+01

Pos logit contributions
ies+0.131
ishing+0.124
els+0.117
ig+0.115
ish+0.114
Neg logit contributions
 Curious-0.088
Don-0.086
 Looking-0.082
 Who-0.078
 Fasc-0.077
Token the
Token ID262
Feature activation-8.13e+00

Pos logit contributions
ies+0.265
ishing+0.249
ig+0.234
ish+0.226
ooming+0.221
Neg logit contributions
Don-0.250
-0.234
 Curious-0.232
 Who-0.227
 hadn-0.225
Token TV
Token ID3195
Feature activation-1.99e+01

Pos logit contributions
ies+0.147
ishing+0.140
els+0.130
ins+0.128
ig+0.124
Neg logit contributions
 Curious-0.148
Don-0.145
 hadn-0.135
 Terr-0.135
 Who-0.134
Token together
Token ID1978
Feature activation-1.18e+01

Pos logit contributions
ies+0.264
ishing+0.255
els+0.229
ins+0.216
ig+0.215
Neg logit contributions
-0.442
 Curious-0.433
Don-0.429
 Who-0.427
 Exc-0.418
Token."
Token ID526
Feature activation+5.81e+00

Pos logit contributions
ies+0.144
ishing+0.137
els+0.130
ig+0.122
ish+0.120
Neg logit contributions
Don-0.271
 hadn-0.268
 Curious-0.262
-0.261
 Who-0.253
Token
Token ID220
Feature activation+1.31e+00

Pos logit contributions
 Curious+0.066
Don+0.063
 Looking+0.060
 Fasc+0.058
 Terr+0.054
Neg logit contributions
ies-0.139
ishing-0.127
ig-0.123
ish-0.123
els-0.123
Token
Token ID198
Feature activation+6.02e+00

Pos logit contributions
 Curious+0.017
Don+0.016
 Looking+0.015
 Fasc+0.015
Would+0.014
Neg logit contributions
ies-0.029
ishing-0.026
ins-0.026
els-0.026
--0.026
TokenThey
Token ID2990
Feature activation-1.65e+00

Pos logit contributions
 Curious+0.083
 Looking+0.076
 Fasc+0.075
Don+0.074
 hadn+0.071
Neg logit contributions
ies-0.124
ie-0.113
ins-0.113
--0.112
 again-0.110
Token I
Token ID314
Feature activation-6.37e+00

Pos logit contributions
els+0.016
ins+0.016
ishing+0.015
ies+0.015
ries+0.015
Neg logit contributions
 Curious-0.019
 Looking-0.018
 Exc-0.018
Don-0.017
 Fasc-0.017
Token folded
Token ID24650
Feature activation-1.45e+01

Pos logit contributions
ies+0.144
ishing+0.136
els+0.130
ig+0.127
ish+0.126
Neg logit contributions
 Curious-0.088
Don-0.084
 Looking-0.081
 Fasc-0.076
 Who-0.075
Token up
Token ID510
Feature activation-1.23e+01

Pos logit contributions
ies+0.282
ishing+0.255
ins+0.247
ish+0.246
ig+0.246
Neg logit contributions
-0.206
 Curious-0.206
 Exc-0.205
Don-0.201
 Looking-0.198
Token those
Token ID883
Feature activation-1.17e+01

Pos logit contributions
ies+0.208
ishing+0.196
ish+0.188
ig+0.183
els+0.182
Neg logit contributions
 Curious-0.215
Don-0.212
-0.208
 Exc-0.201
 Looking-0.201
Token paper
Token ID3348
Feature activation-2.03e+01

Pos logit contributions
ies+0.174
ishing+0.163
ig+0.148
ooming+0.138
els+0.137
Neg logit contributions
 Curious-0.230
-0.225
 Who-0.218
 Exc-0.218
Don-0.214
Token towels
Token ID38494
Feature activation-1.99e+01

Pos logit contributions
ishing+0.271
ies+0.252
els+0.236
ig+0.230
ooming+0.227
Neg logit contributions
-0.422
 Exc-0.375
 Curious-0.372
 Who-0.359
 hadn-0.357
Token for
Token ID329
Feature activation-7.28e+00

Pos logit contributions
ies+0.259
ishing+0.251
els+0.243
ig+0.241
ins+0.217
Neg logit contributions
-0.410
 Curious-0.399
Don-0.396
 hadn-0.390
 Exc-0.389
Token our
Token ID674
Feature activation-8.23e+00

Pos logit contributions
ies+0.144
ishing+0.136
ish+0.132
ig+0.129
els+0.122
Neg logit contributions
Don-0.114
 Curious-0.112
 Who-0.109
 Looking-0.108
Would-0.105
Token nap
Token ID25422
Feature activation-1.59e+01

Pos logit contributions
ishing+0.123
ies+0.122
ier+0.114
els+0.113
ins+0.111
Neg logit contributions
 Curious-0.154
 Who-0.149
 hadn-0.149
Don-0.148
-0.148
Tokenkins
Token ID5331
Feature activation-1.64e+01

Pos logit contributions
els+0.222
ries+0.209
ishing+0.204
ies+0.200
ish+0.190
Neg logit contributions
-0.286
 hadn-0.285
 Curious-0.274
Don-0.266
 Something-0.261
Token?
Token ID30
Feature activation+8.08e-01

Pos logit contributions
ies+0.215
els+0.212
ishing+0.211
ig+0.196
ries+0.180
Neg logit contributions
-0.356
 Curious-0.336
Don-0.333
 hadn-0.331
 Exc-0.319
Token.
Token ID13
Feature activation+7.43e-01

Pos logit contributions
 Curious+0.039
Don+0.037
 Looking+0.036
 Fasc+0.035
 Who+0.035
Neg logit contributions
ies-0.025
ishing-0.022
ig-0.021
els-0.021
ie-0.020
Token I
Token ID314
Feature activation-2.31e+00

Pos logit contributions
 Curious+0.007
Don+0.007
 Looking+0.007
 Fasc+0.006
 Terr+0.006
Neg logit contributions
ies-0.018
els-0.016
ishing-0.016
ig-0.016
ish-0.016
Token ruined
Token ID20484
Feature activation-1.16e+01

Pos logit contributions
ies+0.049
ishing+0.044
ish+0.043
ig+0.043
els+0.042
Neg logit contributions
 Curious-0.035
Don-0.033
 Looking-0.033
 Fasc-0.031
Would-0.030
Token the
Token ID262
Feature activation-7.79e+00

Pos logit contributions
ies+0.244
ishing+0.240
ig+0.227
ish+0.219
ier+0.214
Neg logit contributions
Don-0.156
 Curious-0.151
 Who-0.148
 Are-0.145
 Looking-0.145
Token pretty
Token ID2495
Feature activation-1.40e+01

Pos logit contributions
ies+0.133
ishing+0.133
els+0.122
ins+0.121
ig+0.116
Neg logit contributions
 Curious-0.136
Don-0.135
 hadn-0.126
 Terr-0.126
-0.125
Token board
Token ID3096
Feature activation-1.98e+01

Pos logit contributions
ishing+0.234
ins+0.215
ies+0.214
ier+0.214
els+0.212
Neg logit contributions
 Curious-0.240
Hi-0.224
 Are-0.223
 Exc-0.222
 Looking-0.221
Token."
Token ID526
Feature activation+5.96e+00

Pos logit contributions
ishing+0.292
ies+0.290
els+0.276
ishes+0.255
ins+0.248
Neg logit contributions
-0.415
 Exc-0.390
 Curious-0.384
Don-0.371
 hadn-0.371
Token
Token ID198
Feature activation+6.94e+00

Pos logit contributions
 Curious+0.072
Don+0.069
 Looking+0.066
 Fasc+0.065
Follow+0.062
Neg logit contributions
ies-0.133
 again-0.118
ishing-0.117
--0.117
ig-0.116
Token
Token ID198
Feature activation+7.08e+00

Pos logit contributions
 Curious+0.105
 Fasc+0.097
 hadn+0.096
 Looking+0.096
Don+0.096
Neg logit contributions
ies-0.133
--0.123
ie-0.118
els-0.118
.-0.117
TokenThe
Token ID464
Feature activation-5.66e-01

Pos logit contributions
 Curious+0.114
 Looking+0.104
 Fasc+0.104
 hadn+0.102
 Terr+0.100
Neg logit contributions
ies-0.129
--0.119
ig-0.116
ins-0.116
els-0.115
Token family
Token ID1641
Feature activation-2.92e+00

Pos logit contributions
ies+0.011
ishing+0.011
els+0.010
ig+0.010
ish+0.009
Neg logit contributions
 Curious-0.010
Don-0.009
 Looking-0.009
 Who-0.009
Would-0.009
Token it
Token ID340
Feature activation-1.30e+01

Pos logit contributions
ies+0.212
ier+0.194
ishing+0.193
ins+0.186
ish+0.184
Neg logit contributions
 Exc-0.120
-0.120
Don-0.118
 Curious-0.118
 Looking-0.116
Token to
Token ID284
Feature activation-4.28e+00

Pos logit contributions
ies+0.213
ishing+0.213
els+0.201
ins+0.198
ooming+0.192
Neg logit contributions
-0.220
 On-0.194
Don-0.191
 hadn-0.189
 Curious-0.188
Token decor
Token ID11705
Feature activation-1.13e+01

Pos logit contributions
ies+0.095
ishing+0.092
ins+0.085
ig+0.085
ish+0.084
Neg logit contributions
 Curious-0.056
Don-0.055
 Looking-0.053
 Who-0.052
 Something-0.051
Tokenate
Token ID378
Feature activation-1.30e+01

Pos logit contributions
ies+0.196
ig+0.166
ishing+0.165
ish+0.164
ier+0.163
Neg logit contributions
 Curious-0.172
Don-0.170
 Looking-0.169
 Fasc-0.168
 Exc-0.165
Token your
Token ID534
Feature activation-9.52e+00

Pos logit contributions
ishing+0.276
ooming+0.266
ies+0.257
ig+0.255
ish+0.244
Neg logit contributions
-0.166
Don-0.152
 Who-0.147
 Is-0.142
 It-0.139
Token paper
Token ID3348
Feature activation-1.98e+01

Pos logit contributions
ier+0.168
ies+0.154
ing+0.149
ishing+0.148
ins+0.146
Neg logit contributions
 Curious-0.151
 Terr-0.146
 Something-0.143
 Who-0.142
 Exc-0.140
Token things
Token ID1243
Feature activation-1.41e+01

Pos logit contributions
ishing+0.284
ooming+0.251
ier+0.247
ins+0.243
ies+0.243
Neg logit contributions
-0.376
 Exc-0.342
 Curious-0.329
 Something-0.325
 Who-0.321
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
ies+0.217
ishing+0.207
els+0.192
ins+0.191
ig+0.184
Neg logit contributions
-0.293
Don-0.252
 hadn-0.247
 Exc-0.247
 Who-0.245
Token And
Token ID843
Feature activation-2.01e+00

Pos logit contributions
 Curious+0.036
Don+0.035
 Looking+0.031
 Fasc+0.031
 hadn+0.031
Neg logit contributions
ies-0.069
els-0.068
ishing-0.066
ig-0.066
ish-0.065
Token you
Token ID345
Feature activation-1.64e+00

Pos logit contributions
ies+0.039
ishing+0.037
els+0.034
ig+0.034
ish+0.034
Neg logit contributions
 Curious-0.033
Don-0.033
 Looking-0.031
 Who-0.031
Would-0.030
Token can
Token ID460
Feature activation-1.52e+00

Pos logit contributions
ies+0.036
ishing+0.034
els+0.032
ig+0.032
ish+0.032
Neg logit contributions
 Curious-0.024
Don-0.023
 Looking-0.022
 Who-0.021
 Fasc-0.021
Token for
Token ID329
Feature activation-7.24e+00

Pos logit contributions
 Curious+0.042
Don+0.040
 Looking+0.039
 Who+0.039
 Fasc+0.037
Neg logit contributions
ies-0.068
ishing-0.068
ish-0.062
els-0.062
ig-0.061
Token fighting
Token ID4330
Feature activation-1.41e+01

Pos logit contributions
ies+0.135
ishing+0.129
ish+0.127
ig+0.124
els+0.118
Neg logit contributions
Don-0.123
 Curious-0.121
 Looking-0.116
 Who-0.113
Would-0.111
Token over
Token ID625
Feature activation-9.77e+00

Pos logit contributions
ig+0.215
ishing+0.199
ies+0.188
els+0.177
ooming+0.176
Neg logit contributions
Don-0.263
-0.261
 Who-0.257
 shouldn-0.256
 hadn-0.255
Token the
Token ID262
Feature activation-8.75e+00

Pos logit contributions
ig+0.178
ies+0.173
ishing+0.166
ish+0.159
ier+0.149
Neg logit contributions
Don-0.163
 Curious-0.157
 Who-0.155
 hadn-0.151
 Are-0.149
Token yellow
Token ID7872
Feature activation-1.51e+01

Pos logit contributions
ies+0.149
ishing+0.148
ig+0.135
els+0.134
ins+0.129
Neg logit contributions
 Curious-0.160
Don-0.154
 Who-0.146
 Terr-0.146
 hadn-0.145
Token block
Token ID2512
Feature activation-1.97e+01

Pos logit contributions
ishing+0.254
ies+0.244
ier+0.235
ins+0.234
els+0.222
Neg logit contributions
 Curious-0.255
 Who-0.244
Hi-0.243
 Are-0.238
-0.237
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
ishing+0.279
ies+0.258
els+0.234
ins+0.224
ishes+0.220
Neg logit contributions
-0.399
 hadn-0.372
 Exc-0.368
 Curious-0.366
 Who-0.366
Token They
Token ID1119
Feature activation-1.69e+00

Pos logit contributions
 Curious+0.044
Don+0.044
 Looking+0.040
 Fasc+0.040
 hadn+0.038
Neg logit contributions
ies-0.085
els-0.077
 again-0.077
ishing-0.076
ig-0.076
Token realized
Token ID6939
Feature activation-2.60e+00

Pos logit contributions
ies+0.033
ishing+0.030
els+0.029
ish+0.029
ig+0.028
Neg logit contributions
 Curious-0.028
Don-0.027
 Looking-0.027
 Fasc-0.026
Would-0.025
Token they
Token ID484
Feature activation-3.14e+00

Pos logit contributions
ies+0.048
ishing+0.047
ig+0.043
ish+0.043
els+0.042
Neg logit contributions
 Curious-0.045
Don-0.045
 Looking-0.043
 Fasc-0.041
 Who-0.041
Token both
Token ID1111
Feature activation-4.10e+00

Pos logit contributions
ies+0.079
ishing+0.071
ish+0.069
els+0.069
ie+0.068
Neg logit contributions
 Curious-0.038
 Looking-0.035
Don-0.034
 Fasc-0.033
 Terr-0.032
Token for
Token ID329
Feature activation-7.24e+00

Pos logit contributions
 Curious+0.083
Don+0.081
 Looking+0.078
 Fasc+0.075
 Who+0.074
Neg logit contributions
ies-0.075
ishing-0.069
els-0.065
ig-0.064
ish-0.064
Token getting
Token ID1972
Feature activation-1.29e+01

Pos logit contributions
ies+0.135
ishing+0.129
ish+0.127
ig+0.124
els+0.118
Neg logit contributions
Don-0.123
 Curious-0.121
 Looking-0.116
 Who-0.113
Would-0.111
Token the
Token ID262
Feature activation-8.71e+00

Pos logit contributions
ig+0.230
ies+0.229
ishing+0.216
els+0.211
ish+0.204
Neg logit contributions
Don-0.216
 Curious-0.206
 hadn-0.193
 Fasc-0.189
 Looking-0.188
Token wild
Token ID4295
Feature activation-1.47e+01

Pos logit contributions
ies+0.153
ishing+0.151
els+0.148
ins+0.140
ig+0.135
Neg logit contributions
 Curious-0.148
Don-0.144
-0.140
 Terr-0.135
 hadn-0.135
Token new
Token ID649
Feature activation-1.19e+01

Pos logit contributions
ishing+0.213
els+0.210
ins+0.206
ies+0.201
ig+0.194
Neg logit contributions
 Curious-0.268
Don-0.259
-0.258
 Are-0.251
 Who-0.250
Token carpet
Token ID20710
Feature activation-1.97e+01

Pos logit contributions
ishing+0.189
ies+0.187
els+0.185
ins+0.181
ier+0.175
Neg logit contributions
 Curious-0.207
-0.199
 Looking-0.193
 Who-0.191
 Exc-0.190
Token.
Token ID13
Feature activation+3.94e+00

Pos logit contributions
els+0.264
ishing+0.255
ies+0.253
ins+0.226
ries+0.221
Neg logit contributions
-0.413
 hadn-0.398
Don-0.392
 Curious-0.390
 Exc-0.371
Token<|endoftext|>
Token ID50256
Feature activation+2.74e+00

Pos logit contributions
 Curious+0.041
Don+0.041
 Looking+0.037
 Fasc+0.036
Follow+0.034
Neg logit contributions
ies-0.089
ig-0.081
 again-0.081
ishing-0.081
ie-0.080
TokenOnce
Token ID7454
Feature activation+2.71e+00

Pos logit contributions
 Curious+0.033
Don+0.032
 Looking+0.030
 Fasc+0.028
 Terr+0.027
Neg logit contributions
ies-0.063
ishing-0.059
ig-0.058
els-0.057
--0.057
Token there
Token ID612
Feature activation+5.14e-01

Pos logit contributions
 Curious+0.046
Don+0.045
 Looking+0.043
 Fasc+0.042
 Who+0.041
Neg logit contributions
ies-0.053
ishing-0.049
ig-0.046
els-0.046
ish-0.045
Token was
Token ID373
Feature activation+2.18e+00

Pos logit contributions
 Curious+0.008
Don+0.008
 Looking+0.008
 Fasc+0.007
 Terr+0.007
Neg logit contributions
ies-0.011
ishing-0.010
ig-0.009
ish-0.009
els-0.009
Token do
Token ID466
Feature activation-8.99e+00

Pos logit contributions
ies+0.011
ishing+0.011
ig+0.010
ish+0.009
ie+0.009
Neg logit contributions
 Curious-0.015
Don-0.015
 Looking-0.014
 Who-0.014
 Fasc-0.014
Token not
Token ID407
Feature activation-5.75e+00

Pos logit contributions
ies+0.164
ishing+0.161
els+0.149
ish+0.146
ig+0.144
Neg logit contributions
 Curious-0.157
Don-0.151
 Looking-0.149
 Who-0.144
 Fasc-0.140
Token ruin
Token ID16866
Feature activation-1.27e+01

Pos logit contributions
ishing+0.127
ies+0.126
els+0.122
ig+0.122
ries+0.115
Neg logit contributions
 Curious-0.076
Don-0.076
 Who-0.071
 Looking-0.069
 Are-0.067
Token my
Token ID616
Feature activation-9.64e+00

Pos logit contributions
ies+0.241
ishing+0.240
ig+0.233
ier+0.227
ish+0.222
Neg logit contributions
 Who-0.178
 Curious-0.176
Don-0.176
 Looking-0.171
 Something-0.169
Token ice
Token ID4771
Feature activation-1.80e+01

Pos logit contributions
ies+0.151
ishing+0.150
ier+0.145
ins+0.139
els+0.138
Neg logit contributions
 Curious-0.179
 Terr-0.165
 Who-0.165
Don-0.164
 Looking-0.164
Token cream
Token ID8566
Feature activation-1.97e+01

Pos logit contributions
els+0.277
ishing+0.230
ooming+0.223
ities+0.222
 pilots+0.218
Neg logit contributions
LL-0.249
-0.243
Me-0.241
 had-0.238
I-0.229
Token.
Token ID13
Feature activation+2.24e+00

Pos logit contributions
els+0.289
ies+0.285
ishing+0.283
ig+0.258
ins+0.240
Neg logit contributions
-0.365
 Who-0.325
 Are-0.319
Don-0.318
 Exc-0.315
Token You
Token ID921
Feature activation-2.77e+00

Pos logit contributions
 Curious+0.028
Don+0.028
 Fasc+0.026
 Looking+0.025
 wond+0.025
Neg logit contributions
ies-0.041
els-0.041
."-0.039
ig-0.039
ish-0.038
Token can
Token ID460
Feature activation-1.86e+00

Pos logit contributions
ies+0.066
ishing+0.060
els+0.059
ish+0.059
ig+0.058
Neg logit contributions
 Curious-0.036
Don-0.033
 Looking-0.033
 Fasc-0.031
Would-0.030
Token stay
Token ID2652
Feature activation-9.23e+00

Pos logit contributions
ies+0.047
ishing+0.045
els+0.042
ig+0.042
ish+0.041
Neg logit contributions
 Curious-0.020
Don-0.019
 Looking-0.018
 Who-0.017
 Fasc-0.017
Token,
Token ID11
Feature activation-3.08e+00

Pos logit contributions
ies+0.148
els+0.145
ig+0.140
ries+0.137
ier+0.135
Neg logit contributions
Don-0.158
 relieved-0.154
 Curious-0.154
 Looking-0.142
-0.141
Token is
Token ID318
Feature activation-6.69e+00

Pos logit contributions
 Curious+0.245
 Looking+0.234
 Fasc+0.225
 Terr+0.221
 Int+0.215
Neg logit contributions
ies-0.205
ins-0.179
--0.173
els-0.167
ig-0.165
Token cheaper
Token ID11721
Feature activation-1.48e+01

Pos logit contributions
ies+0.127
ishing+0.121
ig+0.115
els+0.114
ish+0.110
Neg logit contributions
 Curious-0.113
Don-0.112
 Looking-0.106
 Who-0.103
 hadn-0.102
Token than
Token ID621
Feature activation-1.03e+01

Pos logit contributions
ig+0.213
ishing+0.205
ooming+0.194
els+0.189
ies+0.177
Neg logit contributions
-0.258
 Who-0.252
Don-0.247
 Are-0.244
 Is-0.243
Token sugar
Token ID7543
Feature activation-2.01e+01

Pos logit contributions
ies+0.181
ishing+0.173
ig+0.172
ooming+0.160
ish+0.159
Neg logit contributions
Don-0.178
 Who-0.169
 hadn-0.167
 Curious-0.167
Would-0.161
Token and
Token ID290
Feature activation-6.60e+00

Pos logit contributions
ishing+0.257
ies+0.255
ig+0.226
ooming+0.221
els+0.218
Neg logit contributions
-0.414
Don-0.390
 hadn-0.383
 shouldn-0.368
 Who-0.363
Token flour
Token ID10601
Feature activation-1.97e+01

Pos logit contributions
ies+0.107
ishing+0.104
ig+0.097
ish+0.091
els+0.089
Neg logit contributions
Don-0.131
 Curious-0.124
 Who-0.120
 Are-0.118
 hadn-0.117
Token.
Token ID13
Feature activation+3.17e+00

Pos logit contributions
ishing+0.273
ies+0.253
ig+0.252
ooming+0.233
els+0.210
Neg logit contributions
-0.382
Don-0.375
 He-0.344
 hadn-0.344
 Is-0.340
Token We
Token ID775
Feature activation-1.62e+00

Pos logit contributions
 Curious+0.036
Don+0.034
 Looking+0.032
 Fasc+0.032
 hadn+0.030
Neg logit contributions
ies-0.070
els-0.066
ishing-0.063
ins-0.063
ig-0.063
Token are
Token ID389
Feature activation-3.19e+00

Pos logit contributions
ies+0.036
ishing+0.033
els+0.032
ish+0.032
ig+0.031
Neg logit contributions
 Curious-0.024
Don-0.022
 Looking-0.022
 Fasc-0.021
 Terr-0.020
Token not
Token ID407
Feature activation-4.73e+00

Pos logit contributions
ies+0.067
ishing+0.064
ig+0.061
els+0.059
ish+0.058
Neg logit contributions
 Curious-0.047
Don-0.046
 Looking-0.044
 Who-0.043
 Fasc-0.043
Token very
Token ID845
Feature activation-5.44e+00

Pos logit contributions
ig+0.089
ishing+0.084
els+0.083
ies+0.082
ooming+0.081
Neg logit contributions
Don-0.073
 Curious-0.069
 hadn-0.067
 Who-0.066
 Are-0.065
Token the
Token ID262
Feature activation-7.54e+00

Pos logit contributions
 Curious+0.030
Don+0.029
 Looking+0.028
 Terr+0.027
 Fasc+0.027
Neg logit contributions
ies-0.046
ishing-0.041
ie-0.039
els-0.039
--0.039
Token water
Token ID1660
Feature activation-1.68e+01

Pos logit contributions
ies+0.128
ishing+0.126
els+0.116
ig+0.112
ier+0.110
Neg logit contributions
 Curious-0.139
Don-0.136
 Looking-0.129
 Terr-0.126
 Who-0.125
Token wash
Token ID13502
Feature activation-1.40e+01

Pos logit contributions
ishing+0.208
els+0.206
ies+0.204
ig+0.190
ries+0.174
Neg logit contributions
Don-0.355
 Curious-0.353
 hadn-0.343
 Looking-0.334
-0.329
Token away
Token ID1497
Feature activation-1.19e+01

Pos logit contributions
ies+0.199
ishing+0.199
ig+0.187
els+0.185
ooming+0.173
Neg logit contributions
Don-0.273
-0.263
 Curious-0.262
 Looking-0.250
 hadn-0.250
Token the
Token ID262
Feature activation-7.68e+00

Pos logit contributions
ig+0.152
ishing+0.151
els+0.146
ies+0.132
ish+0.132
Neg logit contributions
 Curious-0.251
Don-0.248
-0.242
 Looking-0.236
 hadn-0.236
Token shampoo
Token ID47259
Feature activation-1.96e+01

Pos logit contributions
ishing+0.126
ies+0.126
els+0.120
ig+0.115
ins+0.110
Neg logit contributions
 Curious-0.140
Don-0.139
 hadn-0.130
 Looking-0.128
 Terr-0.127
Token and
Token ID290
Feature activation-2.55e+00

Pos logit contributions
ishing+0.245
els+0.244
ies+0.223
ries+0.207
ig+0.205
Neg logit contributions
-0.398
 hadn-0.391
Don-0.386
 Curious-0.377
 Exc-0.359
Token she
Token ID673
Feature activation-2.35e+00

Pos logit contributions
ies+0.050
ishing+0.048
els+0.045
ig+0.045
ish+0.044
Neg logit contributions
 Curious-0.041
Don-0.041
 Looking-0.039
 Fasc-0.037
 Who-0.037
Token smiled
Token ID13541
Feature activation-6.74e+00

Pos logit contributions
ies+0.054
ishing+0.051
els+0.049
ig+0.048
ish+0.048
Neg logit contributions
 Curious-0.031
Don-0.030
 Looking-0.029
 Fasc-0.027
 Who-0.026
Token.
Token ID13
Feature activation+5.63e+00

Pos logit contributions
els+0.086
ig+0.080
ishing+0.078
ies+0.076
ries+0.075
Neg logit contributions
 Curious-0.158
Don-0.153
 Looking-0.148
 Fasc-0.143
 Who-0.140
Token Then
Token ID3244
Feature activation+4.25e-02

Pos logit contributions
Don+0.062
 Curious+0.061
 Looking+0.054
 Fasc+0.053
Follow+0.053
Neg logit contributions
ies-0.122
ig-0.112
 again-0.112
ishing-0.111
els-0.111
Token to
Token ID284
Feature activation-4.06e+00

Pos logit contributions
 Curious+0.036
Don+0.035
 Looking+0.033
 Fasc+0.032
 Who+0.031
Neg logit contributions
ies-0.053
ishing-0.050
els-0.047
ig-0.047
ish-0.046
Token work
Token ID670
Feature activation-1.33e+01

Pos logit contributions
ies+0.082
ishing+0.077
els+0.072
ig+0.072
ish+0.071
Neg logit contributions
 Curious-0.066
Don-0.063
 Looking-0.061
 Fasc-0.059
 Who-0.058
Token gathering
Token ID11228
Feature activation-1.41e+01

Pos logit contributions
ies+0.185
ish+0.178
ishing+0.173
ig+0.166
ishes+0.156
Neg logit contributions
-0.267
Don-0.265
 hadn-0.265
 Curious-0.259
 Exc-0.257
Token up
Token ID510
Feature activation-1.26e+01

Pos logit contributions
ies+0.203
ig+0.198
ishing+0.195
ooming+0.188
burn+0.181
Neg logit contributions
 Curious-0.264
Don-0.256
 Looking-0.253
 Exc-0.253
 Fasc-0.251
Token the
Token ID262
Feature activation-9.19e+00

Pos logit contributions
ies+0.196
ishing+0.194
ig+0.193
ish+0.185
ier+0.175
Neg logit contributions
 Curious-0.228
Don-0.221
 Fasc-0.215
 Looking-0.214
 hadn-0.211
Token sand
Token ID6450
Feature activation-1.96e+01

Pos logit contributions
ies+0.154
ishing+0.149
els+0.141
ig+0.138
ins+0.136
Neg logit contributions
 Curious-0.164
Don-0.154
 Terr-0.150
 Fasc-0.149
 Looking-0.149
Token and
Token ID290
Feature activation-3.88e+00

Pos logit contributions
ishing+0.244
els+0.223
ig+0.223
ooming+0.219
ies+0.209
Neg logit contributions
 hadn-0.338
 Curious-0.337
 Exc-0.334
-0.334
 He-0.331
Token carrying
Token ID6872
Feature activation-9.35e+00

Pos logit contributions
ies+0.071
ishing+0.067
ig+0.063
els+0.063
ish+0.062
Neg logit contributions
 Curious-0.069
Don-0.067
 Looking-0.065
 Fasc-0.062
 Who-0.062
Token it
Token ID340
Feature activation-1.06e+01

Pos logit contributions
ies+0.173
ishing+0.161
ig+0.151
ier+0.151
ins+0.148
Neg logit contributions
 Curious-0.156
 Looking-0.148
Don-0.145
 Exc-0.144
 Fasc-0.143
Token to
Token ID284
Feature activation-3.47e+00

Pos logit contributions
ies+0.155
els+0.150
ins+0.147
ishing+0.146
ig+0.141
Neg logit contributions
 Curious-0.203
 hadn-0.197
Don-0.194
 Looking-0.193
-0.192
Token the
Token ID262
Feature activation-5.01e+00

Pos logit contributions
ies+0.064
ishing+0.060
els+0.056
ig+0.056
ish+0.055
Neg logit contributions
 Curious-0.062
Don-0.060
 Looking-0.059
 Fasc-0.056
 Who-0.056
Token those
Token ID883
Feature activation-1.09e+01

Pos logit contributions
ies+0.145
ishing+0.140
ig+0.135
ish+0.133
ier+0.128
Neg logit contributions
Don-0.129
 Curious-0.129
 Looking-0.123
 Who-0.122
 Are-0.119
Token who
Token ID508
Feature activation-8.35e+00

Pos logit contributions
ishing+0.168
ies+0.161
ig+0.157
ooming+0.140
ish+0.136
Neg logit contributions
 Who-0.209
 Curious-0.201
-0.199
Don-0.197
 Are-0.191
Token might
Token ID1244
Feature activation-7.84e+00

Pos logit contributions
ies+0.176
ishing+0.168
ig+0.158
els+0.157
ish+0.153
Neg logit contributions
 Curious-0.124
Don-0.122
 Looking-0.115
 Who-0.111
 Fasc-0.109
Token invade
Token ID30419
Feature activation-1.14e+01

Pos logit contributions
ies+0.176
ishing+0.172
els+0.162
ig+0.161
ish+0.158
Neg logit contributions
 Curious-0.104
Don-0.103
 Looking-0.096
 Who-0.092
 hadn-0.090
Token your
Token ID534
Feature activation-9.67e+00

Pos logit contributions
ies+0.198
ishing+0.191
ig+0.179
ish+0.171
ooming+0.171
Neg logit contributions
Don-0.198
 Curious-0.198
 Who-0.195
 Looking-0.189
 Something-0.184
Token space
Token ID2272
Feature activation-1.95e+01

Pos logit contributions
ier+0.157
ies+0.156
ishing+0.154
ins+0.147
els+0.145
Neg logit contributions
 Curious-0.171
 Who-0.162
Don-0.161
 Terr-0.159
 Looking-0.158
Token.
Token ID13
Feature activation+2.96e+00

Pos logit contributions
ishing+0.268
els+0.259
ries+0.258
ies+0.239
ooming+0.232
Neg logit contributions
-0.373
 Who-0.359
 Curious-0.355
 hadn-0.349
Don-0.347
Token Being
Token ID11204
Feature activation-5.99e+00

Pos logit contributions
Don+0.032
 Curious+0.031
 Fasc+0.029
 Looking+0.028
 hadn+0.028
Neg logit contributions
ies-0.062
ig-0.059
els-0.059
ish-0.058
 again-0.057
Token quiet
Token ID5897
Feature activation-1.28e+01

Pos logit contributions
ies+0.120
ishing+0.112
els+0.105
ig+0.104
ish+0.103
Neg logit contributions
 Curious-0.098
Don-0.094
 Looking-0.091
 Fasc-0.088
 Who-0.086
Token can
Token ID460
Feature activation-5.63e+00

Pos logit contributions
ies+0.204
ishing+0.203
ig+0.189
els+0.187
ries+0.180
Neg logit contributions
 Curious-0.233
Don-0.231
 hadn-0.225
 Are-0.221
 Looking-0.221
Token help
Token ID1037
Feature activation-8.23e+00

Pos logit contributions
ies+0.119
ishing+0.114
els+0.108
ig+0.106
ish+0.106
Neg logit contributions
 Curious-0.083
Don-0.082
 Looking-0.077
 Who-0.073
 Fasc-0.073
Token You
Token ID921
Feature activation-3.29e+00

Pos logit contributions
 Curious+0.003
Don+0.003
 Fasc+0.002
 Looking+0.002
 Terr+0.002
Neg logit contributions
ies-0.005
els-0.004
ishing-0.004
ish-0.004
ig-0.004
Token should
Token ID815
Feature activation-1.87e+00

Pos logit contributions
ies+0.074
ishing+0.069
els+0.065
ig+0.065
ish+0.065
Neg logit contributions
 Curious-0.046
Don-0.045
 Looking-0.043
 Fasc-0.041
Would-0.040
Token never
Token ID1239
Feature activation-4.43e+00

Pos logit contributions
ies+0.041
ishing+0.039
els+0.036
ig+0.036
ish+0.036
Neg logit contributions
 Curious-0.027
Don-0.026
 Looking-0.026
 Who-0.024
 Fasc-0.024
Token touch
Token ID3638
Feature activation-9.99e+00

Pos logit contributions
ies+0.092
ishing+0.085
els+0.081
ig+0.080
ish+0.079
Neg logit contributions
 Curious-0.070
Don-0.068
 Looking-0.065
 Fasc-0.062
 Who-0.061
Token the
Token ID262
Feature activation-7.75e+00

Pos logit contributions
ies+0.205
ishing+0.200
ig+0.193
ier+0.188
ooming+0.186
Neg logit contributions
Don-0.132
 Who-0.126
 Curious-0.125
 Are-0.120
 It-0.117
Token razor
Token ID26811
Feature activation-1.94e+01

Pos logit contributions
ishing+0.135
ies+0.134
ins+0.121
ig+0.121
els+0.119
Neg logit contributions
 Curious-0.132
Don-0.132
 Who-0.125
 Are-0.123
 Terr-0.120
Token.
Token ID13
Feature activation+3.04e+00

Pos logit contributions
ishing+0.318
ies+0.312
els+0.296
ins+0.279
ishes+0.276
Neg logit contributions
-0.373
 Is-0.324
 Who-0.323
 He-0.297
 shouldn-0.297
Token It
Token ID632
Feature activation-4.45e+00

Pos logit contributions
 Curious+0.031
Don+0.031
 Fasc+0.028
 hadn+0.027
Would+0.027
Neg logit contributions
els-0.069
ies-0.068
ish-0.065
ig-0.065
ishing-0.064
Token is
Token ID318
Feature activation-4.38e+00

Pos logit contributions
ies+0.088
ishing+0.080
els+0.077
ish+0.076
ig+0.076
Neg logit contributions
 Curious-0.075
Don-0.072
 Looking-0.070
 Fasc-0.068
Would-0.067
Token very
Token ID845
Feature activation-4.88e+00

Pos logit contributions
ies+0.086
ishing+0.081
ig+0.076
els+0.075
ish+0.074
Neg logit contributions
 Curious-0.072
Don-0.072
 Looking-0.068
 Fasc-0.065
 Who-0.064
Token dangerous
Token ID4923
Feature activation-1.15e+01

Pos logit contributions
ies+0.095
ishing+0.088
els+0.082
ig+0.081
ish+0.081
Neg logit contributions
 Curious-0.084
Don-0.081
 Looking-0.079
 Fasc-0.076
Would-0.074
Token help
Token ID1037
Feature activation-1.04e+01

Pos logit contributions
 Curious+0.104
Don+0.096
 Looking+0.092
 hadn+0.091
 Terr+0.090
Neg logit contributions
ies-0.152
ishing-0.150
els-0.144
ig-0.142
ish-0.142
Token Mom
Token ID11254
Feature activation-8.19e+00

Pos logit contributions
ishing+0.189
ig+0.183
ies+0.175
ish+0.169
els+0.165
Neg logit contributions
-0.167
Don-0.164
 Who-0.160
 Curious-0.158
 Are-0.155
Token to
Token ID284
Feature activation-2.60e+00

Pos logit contributions
ishing+0.128
ies+0.127
ig+0.126
els+0.118
isher+0.110
Neg logit contributions
 Curious-0.151
-0.147
Don-0.145
 Who-0.140
 Looking-0.138
Token pour
Token ID12797
Feature activation-1.38e+01

Pos logit contributions
ies+0.050
ishing+0.047
ig+0.044
els+0.044
ish+0.043
Neg logit contributions
 Curious-0.044
Don-0.043
 Looking-0.042
 Fasc-0.040
 Who-0.040
Token the
Token ID262
Feature activation-9.16e+00

Pos logit contributions
ishing+0.278
ies+0.274
ig+0.272
ooming+0.266
ologist+0.264
Neg logit contributions
-0.191
Don-0.190
 Curious-0.168
 Exc-0.168
 -0.166
Token batter
Token ID4984
Feature activation-1.94e+01

Pos logit contributions
ishing+0.146
ies+0.145
els+0.132
ig+0.132
ier+0.125
Neg logit contributions
 Curious-0.163
-0.157
Don-0.155
 Terr-0.151
 Exc-0.149
Token into
Token ID656
Feature activation-8.71e+00

Pos logit contributions
ishing+0.297
ooming+0.276
ig+0.274
els+0.273
ies+0.270
Neg logit contributions
-0.329
 Exc-0.293
 Curious-0.286
 On-0.281
 -0.280
Token a
Token ID257
Feature activation-7.67e+00

Pos logit contributions
ishing+0.197
ies+0.196
ish+0.189
ig+0.184
ier+0.173
Neg logit contributions
Don-0.106
 Curious-0.105
 Exc-0.098
-0.097
 Fasc-0.096
Token pan
Token ID3425
Feature activation-1.71e+01

Pos logit contributions
ies+0.133
ishing+0.130
els+0.123
ries+0.112
ier+0.111
Neg logit contributions
 Curious-0.142
Don-0.135
 Who-0.131
-0.130
 Looking-0.130
Token.
Token ID13
Feature activation+5.33e+00

Pos logit contributions
ishing+0.243
els+0.225
ig+0.225
ies+0.224
ooming+0.212
Neg logit contributions
-0.329
 Curious-0.306
Don-0.303
 Is-0.299
 Who-0.297
Token Mom
Token ID11254
Feature activation-1.24e+00

Pos logit contributions
Don+0.065
 Curious+0.061
 hadn+0.057
 Looking+0.056
Follow+0.056
Neg logit contributions
ies-0.109
els-0.100
 again-0.099
ish-0.098
ries-0.098
Token You
Token ID921
Feature activation-2.94e+00

Pos logit contributions
Don+0.010
 Curious+0.010
 Fasc+0.009
Would+0.009
 Looking+0.009
Neg logit contributions
ies-0.016
els-0.015
ish-0.015
ig-0.015
ishing-0.015
Token need
Token ID761
Feature activation-6.06e+00

Pos logit contributions
ies+0.067
ishing+0.062
els+0.059
ig+0.059
ish+0.059
Neg logit contributions
 Curious-0.041
Don-0.039
 Looking-0.038
 Fasc-0.036
Would-0.035
Token to
Token ID284
Feature activation-3.45e+00

Pos logit contributions
ies+0.116
ishing+0.109
ish+0.103
ig+0.102
els+0.096
Neg logit contributions
Don-0.106
 Curious-0.103
 Looking-0.099
 Who-0.096
 Something-0.096
Token remove
Token ID4781
Feature activation-1.24e+01

Pos logit contributions
ies+0.077
ishing+0.075
els+0.071
ig+0.070
ins+0.069
Neg logit contributions
 Curious-0.046
Don-0.044
 Looking-0.043
 Who-0.040
 Fasc-0.039
Token the
Token ID262
Feature activation-7.04e+00

Pos logit contributions
ies+0.247
ishing+0.235
ig+0.223
ooming+0.221
ier+0.212
Neg logit contributions
Don-0.178
-0.171
 Curious-0.168
 Who-0.163
 Exc-0.160
Token glue
Token ID22749
Feature activation-1.94e+01

Pos logit contributions
ies+0.119
ishing+0.117
ins+0.105
ig+0.104
els+0.102
Neg logit contributions
 Curious-0.123
Don-0.121
 Who-0.115
 Terr-0.114
 Are-0.112
Token and
Token ID290
Feature activation-5.51e+00

Pos logit contributions
ishing+0.292
ins+0.266
ies+0.265
ooming+0.264
els+0.260
Neg logit contributions
-0.368
Don-0.288
 Exc-0.288
 shouldn-0.286
 hadn-0.281
Token put
Token ID1234
Feature activation-1.11e+01

Pos logit contributions
ies+0.110
ishing+0.106
ig+0.101
ins+0.098
ish+0.097
Neg logit contributions
Don-0.085
 Curious-0.083
 Who-0.077
 Looking-0.077
 Are-0.077
Token it
Token ID340
Feature activation-1.20e+01

Pos logit contributions
ies+0.249
ishing+0.225
ooming+0.217
ig+0.215
ins+0.212
Neg logit contributions
Don-0.143
 Who-0.128
 Curious-0.128
 It-0.125
 Looking-0.125
Token away
Token ID1497
Feature activation-1.15e+01

Pos logit contributions
ies+0.193
els+0.184
ins+0.183
ishing+0.172
ig+0.172
Neg logit contributions
Don-0.205
 hadn-0.197
-0.192
 Curious-0.188
In-0.183
Token,
Token ID11
Feature activation-3.98e+00

Pos logit contributions
ig+0.166
ishing+0.158
ies+0.155
els+0.154
ooming+0.152
Neg logit contributions
Don-0.218
-0.217
 Curious-0.210
 hadn-0.207
 Who-0.202
Token show
Token ID905
Feature activation+9.05e-01

Pos logit contributions
 Curious+0.069
Don+0.069
 Looking+0.065
 Fasc+0.063
Would+0.062
Neg logit contributions
ies-0.074
ishing-0.066
els-0.064
ish-0.062
ig-0.062
Token them
Token ID606
Feature activation-3.08e+00

Pos logit contributions
 Curious+0.017
 Looking+0.016
 Who+0.015
Don+0.015
 Exc+0.015
Neg logit contributions
ishing-0.016
ies-0.016
ig-0.015
ish-0.014
ins-0.014
Token her
Token ID607
Feature activation-1.99e+00

Pos logit contributions
ishing+0.052
ies+0.051
ig+0.048
ish+0.046
ier+0.046
Neg logit contributions
 Curious-0.058
 Looking-0.055
Don-0.053
 Fasc-0.053
 Who-0.053
Token fancy
Token ID14996
Feature activation-4.84e+00

Pos logit contributions
ishing+0.029
ies+0.027
ier+0.026
els+0.025
ish+0.024
Neg logit contributions
 Curious-0.042
 Looking-0.040
 Fasc-0.038
Don-0.038
 Who-0.038
Token dress
Token ID6576
Feature activation-7.84e+00

Pos logit contributions
ishing+0.076
ins+0.074
ier+0.074
ishes+0.069
els+0.069
Neg logit contributions
 Curious-0.091
 Looking-0.086
 Exc-0.080
 Fasc-0.079
Hi-0.076
Token.
Token ID13
Feature activation+1.17e+01

Pos logit contributions
ishing+0.103
ins+0.095
ies+0.094
els+0.093
ishes+0.087
Neg logit contributions
 Curious-0.171
 Looking-0.162
 hadn-0.161
Don-0.159
 Exc-0.158
Token So
Token ID1406
Feature activation+6.03e+00

Pos logit contributions
Don+0.124
 hadn+0.116
 Curious+0.110
Looks+0.105
Thanks+0.104
Neg logit contributions
ies-0.274
ish-0.251
ishing-0.251
els-0.250
--0.248
Token she
Token ID673
Feature activation+5.69e+00

Pos logit contributions
 Curious+0.114
Don+0.111
 Looking+0.108
 Fasc+0.104
 Who+0.102
Neg logit contributions
ies-0.107
ishing-0.099
els-0.092
ig-0.091
ish-0.090
Token raised
Token ID4376
Feature activation-4.19e-01

Pos logit contributions
 Curious+0.075
Don+0.072
 Looking+0.069
 Fasc+0.066
 Who+0.064
Neg logit contributions
ies-0.133
ishing-0.124
els-0.118
ig-0.117
ish-0.117
Token her
Token ID607
Feature activation+1.64e-01

Pos logit contributions
ies+0.008
ishing+0.008
ig+0.008
ish+0.008
ier+0.007
Neg logit contributions
 Curious-0.007
Don-0.007
 Looking-0.007
 Fasc-0.006
 Exc-0.006
Token hand
Token ID1021
Feature activation-5.72e+00

Pos logit contributions
 Curious+0.004
 Looking+0.003
Don+0.003
 Fasc+0.003
 Exc+0.003
Neg logit contributions
ies-0.002
ishing-0.002
ier-0.002
ins-0.002
ries-0.002
Token touching
Token ID15241
Feature activation-2.28e+00

Pos logit contributions
 Curious+0.042
Don+0.040
 Looking+0.040
 Fasc+0.037
Would+0.036
Neg logit contributions
ies-0.043
ishing-0.040
ig-0.040
els-0.039
ish-0.039
Token his
Token ID465
Feature activation-3.27e+00

Pos logit contributions
ies+0.055
ishing+0.054
ig+0.051
ier+0.050
els+0.049
Neg logit contributions
Don-0.027
 Curious-0.026
 Who-0.025
 Looking-0.024
Me-0.023
Token nose
Token ID9686
Feature activation-8.26e+00

Pos logit contributions
ier+0.058
ies+0.057
ishing+0.056
ins+0.056
els+0.051
Neg logit contributions
 Curious-0.058
 Terr-0.053
 Looking-0.053
Don-0.052
 Something-0.051
Token.
Token ID13
Feature activation+9.61e+00

Pos logit contributions
ishing+0.134
isher+0.124
ig+0.122
ins+0.117
ies+0.114
Neg logit contributions
 Curious-0.148
 He-0.146
-0.141
Don-0.139
 -0.134
Token
Token ID198
Feature activation+1.13e+01

Pos logit contributions
 Curious+0.124
Don+0.124
 Looking+0.116
 Fasc+0.115
 hadn+0.114
Neg logit contributions
ies-0.198
 again-0.181
--0.178
ins-0.176
ig-0.175
Token
Token ID198
Feature activation+1.17e+01

Pos logit contributions
 Curious+0.153
Don+0.141
 Looking+0.140
 Fasc+0.139
 Terr+0.131
Neg logit contributions
ies-0.255
ishing-0.225
--0.225
ig-0.225
els-0.221
Token"
Token ID1
Feature activation+8.90e+00

Pos logit contributions
 Curious+0.215
 Looking+0.204
Don+0.200
 Fasc+0.198
 Terr+0.193
Neg logit contributions
ies-0.216
ishing-0.194
ig-0.186
els-0.181
ish-0.179
TokenFl
Token ID7414
Feature activation+7.01e-02

Pos logit contributions
 Curious+0.101
 Looking+0.096
 Fasc+0.090
 Terr+0.089
 Exc+0.086
Neg logit contributions
ies-0.210
ishing-0.196
ins-0.194
els-0.188
--0.184
Tokenuffy
Token ID15352
Feature activation-2.04e+00

Pos logit contributions
 Curious+0.001
 Looking+0.001
Don+0.001
 Something+0.001
 Are+0.001
Neg logit contributions
ies-0.001
ishing-0.001
els-0.001
ier-0.001
ie-0.001
Token,
Token ID11
Feature activation+2.47e+00

Pos logit contributions
ies+0.028
ishing+0.026
els+0.024
ins+0.022
ig+0.022
Neg logit contributions
 Curious-0.045
Don-0.043
 Looking-0.043
 Are-0.041
 Terr-0.041
Token you
Token ID345
Feature activation+1.62e+00

Pos logit contributions
 Curious+0.040
Don+0.040
 Looking+0.038
 Who+0.036
 Are+0.036
Neg logit contributions
ies-0.049
ishing-0.047
els-0.044
ig-0.043
ish-0.042
Token she
Token ID673
Feature activation-1.76e+00

Pos logit contributions
ies+0.031
ishing+0.029
els+0.027
ig+0.027
ish+0.027
Neg logit contributions
 Curious-0.024
Don-0.023
 Looking-0.022
 Fasc-0.021
 Who-0.021
Token wanted
Token ID2227
Feature activation-3.69e+00

Pos logit contributions
ies+0.042
ishing+0.039
ig+0.037
ish+0.037
els+0.037
Neg logit contributions
 Curious-0.022
Don-0.021
 Looking-0.020
 Fasc-0.019
 Terr-0.019
Token to
Token ID284
Feature activation-1.88e-01

Pos logit contributions
ies+0.073
ishing+0.072
ins+0.066
ish+0.066
ishes+0.063
Neg logit contributions
 Who-0.060
 Curious-0.059
Don-0.058
 Looking-0.057
-0.057
Token.
Token ID13
Feature activation+1.03e+01

Pos logit contributions
ies+0.004
ishing+0.004
els+0.004
ig+0.004
ish+0.004
Neg logit contributions
 Curious-0.003
Don-0.003
 Looking-0.003
 Fasc-0.003
 Who-0.002
Token
Token ID220
Feature activation+5.12e+00

Pos logit contributions
 Curious+0.128
Don+0.128
Follow+0.124
Ah+0.120
 hadn+0.119
Neg logit contributions
ies-0.219
 again-0.196
--0.194
els-0.192
ishing-0.192
Token
Token ID198
Feature activation+1.04e+01

Pos logit contributions
 Curious+0.065
Don+0.062
 Looking+0.059
 Fasc+0.057
Would+0.054
Neg logit contributions
ies-0.118
els-0.106
ishing-0.106
ig-0.104
ins-0.103
Token
Token ID198
Feature activation+1.04e+01

Pos logit contributions
 Curious+0.153
Don+0.146
 Terr+0.142
 Looking+0.142
 Fasc+0.140
Neg logit contributions
ies-0.205
--0.190
ie-0.184
 again-0.182
.-0.179
TokenSarah
Token ID29284
Feature activation+3.43e+00

Pos logit contributions
 Curious+0.129
 Terr+0.121
 Looking+0.117
 Fasc+0.113
 Who+0.113
Neg logit contributions
ies-0.235
--0.216
ishing-0.210
els-0.207
ie-0.204
Token was
Token ID373
Feature activation+2.55e+00

Pos logit contributions
 Curious+0.050
Don+0.049
 Looking+0.047
 Who+0.045
 Fasc+0.045
Neg logit contributions
ies-0.074
ishing-0.069
ig-0.066
ish-0.065
els-0.065
Token so
Token ID523
Feature activation+2.41e+00

Pos logit contributions
Don+0.043
 Curious+0.043
 Looking+0.041
 Who+0.040
Would+0.039
Neg logit contributions
ies-0.046
ishing-0.043
--0.040
els-0.040
ish-0.039
Token excited
Token ID6568
Feature activation-3.13e+00

Pos logit contributions
 Curious+0.041
Don+0.040
 Looking+0.040
Would+0.039
 Terr+0.039
Neg logit contributions
ies-0.045
--0.038
ishing-0.038
ish-0.036
ins-0.035
Token it
Token ID340
Feature activation-7.63e+00

Pos logit contributions
ies+0.078
ig+0.076
ishing+0.072
els+0.069
ish+0.066
Neg logit contributions
Don-0.078
 Curious-0.076
 Looking-0.072
 hadn-0.071
 Who-0.070
Token.
Token ID13
Feature activation+1.02e+01

Pos logit contributions
els+0.108
ies+0.107
ishing+0.102
ig+0.101
ins+0.100
Neg logit contributions
-0.159
 Curious-0.152
Don-0.151
 hadn-0.150
 shouldn-0.145
Token<|endoftext|>
Token ID50256
Feature activation+7.21e+00

Pos logit contributions
 Curious+0.111
Don+0.109
 Fasc+0.100
 Looking+0.095
Would+0.093
Neg logit contributions
ies-0.225
ishing-0.215
 again-0.212
els-0.211
ish-0.210
TokenOnce
Token ID7454
Feature activation+7.18e+00

Pos logit contributions
 Curious+0.098
Don+0.096
 Looking+0.090
 Fasc+0.086
 Terr+0.084
Neg logit contributions
ies-0.153
ishing-0.143
ig-0.139
els-0.138
 again-0.136
Token upon
Token ID2402
Feature activation+7.27e+00

Pos logit contributions
 Curious+0.122
Don+0.119
 Looking+0.115
 Fasc+0.110
Would+0.108
Neg logit contributions
ies-0.140
ishing-0.129
els-0.121
ig-0.121
ish-0.119
Token a
Token ID257
Feature activation+6.76e+00

Pos logit contributions
 Curious+0.147
Don+0.144
 Looking+0.141
 Fasc+0.136
 Terr+0.136
Neg logit contributions
ies-0.111
ishing-0.098
els-0.091
--0.090
ig-0.089
Token time
Token ID640
Feature activation+5.92e+00

Pos logit contributions
 Curious+0.134
Don+0.130
 Looking+0.127
 Fasc+0.122
 Who+0.120
Neg logit contributions
ies-0.112
ishing-0.103
els-0.097
ig-0.094
ish-0.094
Token,
Token ID11
Feature activation+8.40e+00

Pos logit contributions
 Curious+0.160
Don+0.156
 Looking+0.154
 Fasc+0.150
 Who+0.148
Neg logit contributions
ies-0.055
ishing-0.049
els-0.042
ish-0.040
ig-0.040
Token there
Token ID612
Feature activation+6.44e+00

Pos logit contributions
 Curious+0.186
Don+0.181
 Looking+0.177
 Fasc+0.172
 Terr+0.171
Neg logit contributions
ies-0.119
ishing-0.104
ig-0.098
els-0.097
ish-0.096
Token was
Token ID373
Feature activation+6.75e+00

Pos logit contributions
 Curious+0.115
Don+0.111
 Looking+0.105
 Terr+0.104
 Fasc+0.102
Neg logit contributions
ies-0.115
ie-0.105
--0.103
 again-0.102
ig-0.102
Token a
Token ID257
Feature activation+5.09e+00

Pos logit contributions
 Curious+0.136
Don+0.133
 Looking+0.129
 Who+0.125
 Terr+0.124
Neg logit contributions
ies-0.106
ishing-0.096
els-0.090
ig-0.088
ish-0.087
Token friends
Token ID2460
Feature activation-6.48e+00

Pos logit contributions
ies+0.056
ig+0.052
ishing+0.051
els+0.051
ish+0.048
Neg logit contributions
 Curious-0.069
Don-0.069
 Looking-0.064
Thanks-0.064
 Fasc-0.063
Token again
Token ID757
Feature activation-6.93e+00

Pos logit contributions
ies+0.098
els+0.097
ishing+0.092
ries+0.088
ig+0.083
Neg logit contributions
Don-0.138
 Curious-0.132
-0.130
Would-0.129
 Who-0.126
Token.
Token ID13
Feature activation+6.89e+00

Pos logit contributions
ishing+0.095
els+0.091
isher+0.089
ish+0.088
ig+0.088
Neg logit contributions
Don-0.147
 Who-0.141
 Curious-0.140
 Are-0.136
-0.134
Token
Token ID198
Feature activation+7.96e+00

Pos logit contributions
 hadn+0.080
 Curious+0.077
Don+0.076
 Fasc+0.075
 Looking+0.072
Neg logit contributions
ies-0.137
.-0.137
 again-0.135
--0.129
 and-0.127
Token
Token ID198
Feature activation+8.48e+00

Pos logit contributions
 Curious+0.127
 Fasc+0.122
 Looking+0.118
 hadn+0.115
 Terr+0.115
Neg logit contributions
ies-0.149
--0.143
.-0.135
 again-0.132
els-0.131
Token"
Token ID1
Feature activation+6.20e+00

Pos logit contributions
 Curious+0.157
 Fasc+0.150
 Looking+0.148
 Terr+0.145
 Exc+0.140
Neg logit contributions
ies-0.147
--0.138
ig-0.127
els-0.125
ishing-0.125
TokenL
Token ID43
Feature activation-1.67e+00

Pos logit contributions
 Curious+0.077
 Looking+0.072
 Fasc+0.071
 Terr+0.070
 Exc+0.066
Neg logit contributions
ies-0.140
ishing-0.132
--0.127
ins-0.126
els-0.126
Tokenily
Token ID813
Feature activation-4.02e+00

Pos logit contributions
ies+0.037
ishing+0.035
els+0.033
ish+0.033
ishes+0.033
Neg logit contributions
Don-0.022
 Curious-0.022
 Looking-0.021
 Who-0.019
 Int-0.019
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
ies+0.047
ishing+0.046
els+0.042
ins+0.038
ig+0.038
Neg logit contributions
 Curious-0.096
Don-0.092
 Looking-0.091
 hadn-0.089
 Are-0.089
Token please
Token ID3387
Feature activation-2.41e+00

Pos logit contributions
 Curious+0.020
Don+0.020
 Looking+0.019
 Who+0.019
 Are+0.018
Neg logit contributions
ies-0.031
ishing-0.030
els-0.028
ins-0.027
ish-0.027
Token don
Token ID836
Feature activation-1.77e+00

Pos logit contributions
ies+0.041
ishing+0.040
els+0.039
ig+0.036
ins+0.036
Neg logit contributions
 Curious-0.045
Don-0.043
 Looking-0.043
 Are-0.041
 Who-0.041
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
 Curious+0.005
 Looking+0.005
Don+0.005
 Who+0.005
 Fasc+0.005
Neg logit contributions
ies-0.003
ishing-0.003
els-0.003
ishes-0.003
ries-0.003
Token "
Token ID366
Feature activation+4.25e+00

Pos logit contributions
 Curious+0.014
Don+0.014
 Looking+0.013
 Fasc+0.012
 Who+0.012
Neg logit contributions
ies-0.031
ishing-0.029
els-0.028
ig-0.028
ish-0.027
TokenNo
Token ID2949
Feature activation-2.99e+00

Pos logit contributions
 Curious+0.044
 Looking+0.041
 Fasc+0.038
 Terr+0.037
Don+0.037
Neg logit contributions
ies-0.104
ishing-0.100
els-0.096
ish-0.095
ins-0.094
Token,
Token ID11
Feature activation-5.33e-01

Pos logit contributions
ishing+0.043
ies+0.040
els+0.036
ins+0.035
ishes+0.034
Neg logit contributions
 Curious-0.065
 Looking-0.062
-0.059
 Are-0.059
 Who-0.059
Token Joe
Token ID5689
Feature activation-4.93e+00

Pos logit contributions
ies+0.012
ishing+0.011
els+0.010
ig+0.010
ins+0.010
Neg logit contributions
 Curious-0.008
Don-0.008
 Looking-0.007
 Who-0.007
 Are-0.007
Token.
Token ID13
Feature activation+2.28e+00

Pos logit contributions
ies+0.051
ishing+0.049
els+0.044
ig+0.040
ins+0.038
Neg logit contributions
 Curious-0.118
Don-0.116
 hadn-0.113
 Looking-0.113
 Who-0.112
Token That
Token ID1320
Feature activation-3.55e+00

Pos logit contributions
 Curious+0.026
Don+0.025
 Looking+0.023
 Fasc+0.023
Would+0.022
Neg logit contributions
ies-0.054
ishing-0.051
ish-0.051
els-0.050
ig-0.049
Token ornament
Token ID45618
Feature activation-1.02e+01

Pos logit contributions
ishing+0.076
ies+0.074
els+0.069
ig+0.066
ooming+0.065
Neg logit contributions
 Curious-0.053
Don-0.051
-0.048
 Exc-0.046
 Looking-0.046
Token is
Token ID318
Feature activation-4.04e+00

Pos logit contributions
ishing+0.218
ies+0.213
els+0.203
ooming+0.199
ig+0.188
Neg logit contributions
 Curious-0.119
 hadn-0.117
-0.116
 shouldn-0.114
Don-0.114
Token special
Token ID2041
Feature activation-1.04e+01

Pos logit contributions
ies+0.078
ishing+0.074
ig+0.070
els+0.068
ish+0.067
Neg logit contributions
 Curious-0.067
Don-0.067
 Looking-0.063
 hadn-0.062
 Fasc-0.061
Token and
Token ID290
Feature activation-2.10e+00

Pos logit contributions
ooming+0.165
ishing+0.162
ies+0.158
els+0.152
ig+0.151
Neg logit contributions
-0.198
 Would-0.190
 Who-0.189
 Could-0.186
 shouldn-0.183
TokenOnce
Token ID7454
Feature activation+2.27e+00

Pos logit contributions
 Curious+0.018
Don+0.017
 Looking+0.016
 Fasc+0.015
 Terr+0.015
Neg logit contributions
ies-0.029
ishing-0.027
els-0.026
ig-0.026
ish-0.026
Token upon
Token ID2402
Feature activation+2.20e+00

Pos logit contributions
 Curious+0.039
Don+0.038
 Looking+0.037
 Fasc+0.035
 Who+0.034
Neg logit contributions
ies-0.044
ishing-0.040
ig-0.038
els-0.038
ish-0.037
Token a
Token ID257
Feature activation+2.01e+00

Pos logit contributions
 Curious+0.045
Don+0.044
 Looking+0.043
 Fasc+0.042
 Terr+0.041
Neg logit contributions
ies-0.033
ishing-0.029
els-0.027
--0.027
ig-0.027
Token time
Token ID640
Feature activation+1.83e+00

Pos logit contributions
 Curious+0.039
Don+0.038
 Looking+0.037
 Fasc+0.036
Would+0.035
Neg logit contributions
ies-0.034
ishing-0.032
els-0.030
ish-0.029
ig-0.029
Token,
Token ID11
Feature activation+4.97e+00

Pos logit contributions
 Curious+0.049
Don+0.048
 Looking+0.047
 Fasc+0.046
 Who+0.045
Neg logit contributions
ies-0.018
ishing-0.016
els-0.014
ish-0.014
ig-0.013
Token there
Token ID612
Feature activation+2.15e+00

Pos logit contributions
 Curious+0.103
Don+0.101
 Looking+0.098
 Fasc+0.094
 Who+0.093
Neg logit contributions
ies-0.077
ishing-0.071
els-0.066
ish-0.064
ig-0.064
Token was
Token ID373
Feature activation+3.15e+00

Pos logit contributions
 Curious+0.039
Don+0.036
 Looking+0.036
 Terr+0.035
 Fasc+0.035
Neg logit contributions
ies-0.039
ie-0.035
--0.034
ig-0.033
ishing-0.033
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
 Curious+0.063
Don+0.061
 Looking+0.060
 Who+0.057
 Fasc+0.057
Neg logit contributions
ies-0.051
ishing-0.046
els-0.043
ig-0.043
ish-0.042
Token little
Token ID1310
Feature activation-2.54e+00

Pos logit contributions
 Curious+0.022
Don+0.021
 Looking+0.021
 Fasc+0.020
 Terr+0.020
Neg logit contributions
ies-0.020
ishing-0.018
ig-0.017
ish-0.017
els-0.017
Token girl
Token ID2576
Feature activation-1.08e+00

Pos logit contributions
ishing+0.053
els+0.052
ies+0.051
ier+0.049
ins+0.048
Neg logit contributions
 Curious-0.035
Don-0.033
 Are-0.032
Hi-0.031
 Looking-0.030
Token called
Token ID1444
Feature activation+1.46e+00

Pos logit contributions
ies+0.027
ishing+0.026
ier+0.026
els+0.025
ins+0.025
Neg logit contributions
Don-0.011
 Curious-0.011
 Looking-0.010
 Who-0.009
Would-0.009
Token discovered
Token ID5071
Feature activation-4.74e+00

Pos logit contributions
ies+0.037
ishing+0.034
els+0.033
ish+0.032
ig+0.032
Neg logit contributions
 Curious-0.034
Don-0.033
 Looking-0.032
 Fasc-0.031
 Who-0.030
Token something
Token ID1223
Feature activation-3.98e+00

Pos logit contributions
ies+0.099
ishing+0.093
ish+0.083
ins+0.083
els+0.083
Neg logit contributions
 Curious-0.074
 Looking-0.070
Don-0.069
 Exc-0.067
 Who-0.066
Token special
Token ID2041
Feature activation-7.89e+00

Pos logit contributions
ies+0.076
ishing+0.072
els+0.067
ig+0.066
ish+0.065
Neg logit contributions
 Curious-0.068
Don-0.066
 Looking-0.064
 Who-0.061
 Fasc-0.061
Token!"
Token ID2474
Feature activation+9.21e+00

Pos logit contributions
ies+0.116
ishing+0.116
els+0.107
ooming+0.107
ig+0.104
Neg logit contributions
-0.148
 Curious-0.146
 hadn-0.142
Don-0.142
 Would-0.142
Token Ivy
Token ID27386
Feature activation+2.98e+00

Pos logit contributions
 Curious+0.142
Would+0.142
Don+0.142
Thanks+0.138
 hadn+0.137
Neg logit contributions
ies-0.163
--0.146
ie-0.142
.-0.141
 again-0.140
Token shouted
Token ID17293
Feature activation+1.73e+00

Pos logit contributions
Don+0.037
 Curious+0.037
 Looking+0.034
 Fasc+0.033
Would+0.032
Neg logit contributions
ies-0.073
ishing-0.065
--0.064
ig-0.063
ish-0.063
Token with
Token ID351
Feature activation+1.52e+00

Pos logit contributions
 Curious+0.045
 Looking+0.043
Don+0.043
 Who+0.042
 Fasc+0.042
Neg logit contributions
ies-0.018
ishing-0.017
els-0.015
ig-0.015
ins-0.015
Token excitement
Token ID14067
Feature activation-5.48e+00

Pos logit contributions
Don+0.027
 Curious+0.027
Would+0.026
 Fasc+0.026
 Who+0.026
Neg logit contributions
ies-0.022
ishing-0.020
els-0.020
 again-0.020
--0.019
Token.
Token ID13
Feature activation+8.93e+00

Pos logit contributions
ig+0.081
ishing+0.077
ishes+0.074
els+0.073
ish+0.073
Neg logit contributions
 Curious-0.111
 Looking-0.105
-0.102
 hadn-0.100
 �-0.100
Token
Token ID198
Feature activation+1.08e+01

Pos logit contributions
Don+0.109
 hadn+0.105
 Curious+0.105
Looks+0.105
Ta+0.098
Neg logit contributions
ies-0.169
 again-0.167
--0.161
ig-0.157
ing-0.157
Token
Token ID198
Feature activation+1.11e+01

Pos logit contributions
 Curious+0.149
 hadn+0.137
Don+0.136
 Fasc+0.134
 Looking+0.133
Neg logit contributions
ies-0.212
--0.205
ig-0.204
 again-0.197
ins-0.196
Token to
Token ID284
Feature activation-2.15e-01

Pos logit contributions
 Curious+0.017
Don+0.017
 Looking+0.017
 Fasc+0.016
 Terr+0.016
Neg logit contributions
ies-0.007
ishing-0.006
--0.006
els-0.005
ig-0.005
Token do
Token ID466
Feature activation-3.65e+00

Pos logit contributions
ies+0.005
ishing+0.005
els+0.004
ig+0.004
ish+0.004
Neg logit contributions
 Curious-0.003
Don-0.003
 Looking-0.003
 Fasc-0.003
 Who-0.002
Token.
Token ID13
Feature activation+9.83e+00

Pos logit contributions
ies+0.049
ig+0.046
ishing+0.045
ish+0.045
ishes+0.044
Neg logit contributions
Don-0.083
 Curious-0.080
 Who-0.080
 hadn-0.079
 Looking-0.078
Token His
Token ID2399
Feature activation+1.77e+00

Pos logit contributions
Don+0.108
Ta+0.108
Follow+0.106
Take+0.102
Would+0.101
Neg logit contributions
ies-0.200
 again-0.197
--0.196
ishing-0.191
.-0.188
Token mom
Token ID1995
Feature activation+1.41e+00

Pos logit contributions
 Curious+0.034
Don+0.032
 Looking+0.032
 Who+0.031
 hadn+0.031
Neg logit contributions
ies-0.030
ishing-0.029
els-0.026
ier-0.026
ins-0.026
Tokenmy
Token ID1820
Feature activation+1.79e-01

Pos logit contributions
 Curious+0.030
Don+0.030
 Looking+0.029
 Fasc+0.028
Would+0.027
Neg logit contributions
ies-0.021
ishing-0.019
els-0.018
ig-0.018
ish-0.017
Token suggested
Token ID5220
Feature activation+2.45e+00

Pos logit contributions
 Curious+0.002
Don+0.002
 Looking+0.002
 Fasc+0.002
 Terr+0.002
Neg logit contributions
ies-0.004
ishing-0.004
els-0.004
ig-0.004
ish-0.004
Token that
Token ID326
Feature activation+3.12e+00

Pos logit contributions
 Curious+0.044
Don+0.044
 Looking+0.042
 Fasc+0.040
 Who+0.040
Neg logit contributions
ies-0.045
ishing-0.043
ig-0.041
els-0.040
ish-0.040
Token they
Token ID484
Feature activation+3.96e-01

Pos logit contributions
Don+0.041
 Curious+0.040
 Looking+0.037
Would+0.036
 Who+0.036
Neg logit contributions
ies-0.072
ishing-0.070
ig-0.067
els-0.065
ish-0.065
Token go
Token ID467
Feature activation-6.62e-01

Pos logit contributions
 Curious+0.005
Don+0.005
 Looking+0.005
 Who+0.005
 hadn+0.005
Neg logit contributions
ies-0.009
ishing-0.009
ig-0.008
els-0.008
ins-0.008
Token up
Token ID510
Feature activation-3.77e+00

Pos logit contributions
ies+0.010
els+0.010
ishing+0.010
ins+0.009
ish+0.009
Neg logit contributions
 Curious-0.013
Don-0.013
 Looking-0.013
 Who-0.012
Would-0.012
Token his
Token ID465
Feature activation-5.37e+00

Pos logit contributions
ies+0.097
ishing+0.090
ish+0.086
ig+0.084
els+0.082
Neg logit contributions
 Curious-0.078
Don-0.076
 Looking-0.073
 Fasc-0.071
 Exc-0.069
Token large
Token ID1588
Feature activation-8.90e+00

Pos logit contributions
ies+0.097
ishing+0.089
ier+0.085
els+0.083
ins+0.083
Neg logit contributions
 Curious-0.099
 Looking-0.093
Don-0.092
 Fasc-0.089
 Exc-0.089
Token pocket
Token ID10000
Feature activation-1.35e+01

Pos logit contributions
ies+0.155
ier+0.149
ins+0.142
ishing+0.141
els+0.129
Neg logit contributions
 Curious-0.164
 Looking-0.154
 Exc-0.151
 Fasc-0.145
 Are-0.143
Token and
Token ID290
Feature activation-3.90e-01

Pos logit contributions
ishing+0.172
ies+0.171
els+0.166
ries+0.159
ins+0.150
Neg logit contributions
 Curious-0.271
-0.261
 Exc-0.258
 hadn-0.258
Don-0.257
Token then
Token ID788
Feature activation-1.26e+00

Pos logit contributions
ies+0.008
ishing+0.008
els+0.007
ig+0.007
ish+0.007
Neg logit contributions
 Curious-0.006
Don-0.006
 Looking-0.006
 Fasc-0.005
 Who-0.005
Token followed
Token ID3940
Feature activation-3.02e+00

Pos logit contributions
ies+0.028
ishing+0.027
els+0.026
ig+0.025
ish+0.025
Neg logit contributions
 Curious-0.017
Don-0.017
 Looking-0.016
 Fasc-0.015
 Who-0.015
Token his
Token ID465
Feature activation-2.49e+00

Pos logit contributions
ies+0.061
ishing+0.057
ig+0.055
ins+0.054
ish+0.053
Neg logit contributions
 Curious-0.048
Don-0.045
 Looking-0.045
 Who-0.045
 Fasc-0.043
Token mom
Token ID1995
Feature activation-4.26e+00

Pos logit contributions
ies+0.043
ishing+0.041
ins+0.038
ier+0.038
els+0.037
Neg logit contributions
 Curious-0.046
 Who-0.044
 Looking-0.043
Don-0.041
 Exc-0.040
Token.
Token ID13
Feature activation+7.12e+00

Pos logit contributions
ies+0.064
ishing+0.060
ier+0.058
ins+0.056
els+0.055
Neg logit contributions
-0.084
 Curious-0.084
 Who-0.079
 Looking-0.079
Don-0.078
Token He
Token ID679
Feature activation-1.54e-01

Pos logit contributions
Don+0.065
 Curious+0.062
 Looking+0.052
 hadn+0.052
Would+0.052
Neg logit contributions
ies-0.177
ishing-0.166
els-0.163
--0.161
ig-0.158
Token hoped
Token ID10719
Feature activation-1.96e+00

Pos logit contributions
ies+0.003
ishing+0.003
ig+0.003
els+0.003
-+0.003
Neg logit contributions
 Curious-0.002
Don-0.002
 Looking-0.002
 Fasc-0.002
 Terr-0.002
Token but
Token ID475
Feature activation+2.92e+00

Pos logit contributions
 Curious+0.017
Don+0.016
 Looking+0.016
 Fasc+0.015
 Who+0.015
Neg logit contributions
ies-0.015
ishing-0.014
els-0.013
ig-0.013
ish-0.013
Token he
Token ID339
Feature activation+7.65e-01

Pos logit contributions
 Curious+0.049
Don+0.048
 Looking+0.046
 Fasc+0.044
 Who+0.043
Neg logit contributions
ies-0.057
ishing-0.053
els-0.050
ig-0.050
ish-0.049
Token didn
Token ID1422
Feature activation+3.14e-01

Pos logit contributions
 Curious+0.009
Don+0.008
 Looking+0.008
 Fasc+0.008
 Who+0.007
Neg logit contributions
ies-0.019
ishing-0.018
els-0.017
ig-0.017
ish-0.017
Token't
Token ID470
Feature activation+6.47e-01

Pos logit contributions
 Curious+0.004
Don+0.004
 Looking+0.004
Would+0.004
 Fasc+0.004
Neg logit contributions
ies-0.007
els-0.006
ishing-0.006
ig-0.006
--0.006
Token stay
Token ID2652
Feature activation-4.62e+00

Pos logit contributions
 Curious+0.008
Don+0.008
 Who+0.008
 Looking+0.008
 Are+0.007
Neg logit contributions
ishing-0.015
ies-0.015
els-0.014
ig-0.014
ries-0.014
Token long
Token ID890
Feature activation-1.12e+01

Pos logit contributions
ies+0.078
els+0.071
ig+0.070
ishing+0.070
ish+0.068
Neg logit contributions
 Curious-0.085
Don-0.083
 Looking-0.080
 Fasc-0.078
 hadn-0.076
Token.
Token ID13
Feature activation+8.80e+00

Pos logit contributions
ies+0.163
ish+0.158
els+0.157
ig+0.154
ishing+0.147
Neg logit contributions
-0.207
 hadn-0.203
 Curious-0.201
Don-0.196
 Would-0.183
Token He
Token ID679
Feature activation+1.32e+00

Pos logit contributions
Don+0.084
 Curious+0.074
Follow+0.069
Would+0.068
 Fasc+0.066
Neg logit contributions
ies-0.203
 again-0.200
ishing-0.199
ish-0.196
els-0.188
Token saw
Token ID2497
Feature activation-3.41e-01

Pos logit contributions
 Curious+0.017
Don+0.017
 Looking+0.016
 Fasc+0.016
 Terr+0.015
Neg logit contributions
ies-0.030
ishing-0.028
ig-0.027
ish-0.027
els-0.027
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
ies+0.007
ishing+0.006
ig+0.006
ish+0.006
els+0.006
Neg logit contributions
 Curious-0.006
Don-0.006
 Looking-0.006
 Who-0.005
Would-0.005
Token security
Token ID2324
Feature activation-7.43e+00

Pos logit contributions
ies+0.020
ishing+0.018
els+0.017
ig+0.017
ish+0.017
Neg logit contributions
 Curious-0.020
Don-0.019
 Looking-0.019
 Fasc-0.018
 Who-0.018
Token they
Token ID484
Feature activation+1.94e+00

Pos logit contributions
 Curious+0.060
 Looking+0.057
Don+0.057
 Terr+0.054
 Fasc+0.053
Neg logit contributions
ies-0.071
ishing-0.061
--0.059
ie-0.059
ish-0.057
Token decided
Token ID3066
Feature activation+4.87e-01

Pos logit contributions
 Curious+0.027
Don+0.026
 Looking+0.025
 Fasc+0.024
 Who+0.023
Neg logit contributions
ies-0.044
ishing-0.041
els-0.039
ig-0.039
ish-0.038
Token to
Token ID284
Feature activation+3.63e+00

Pos logit contributions
 Curious+0.011
Don+0.011
 Looking+0.010
 Fasc+0.010
 Who+0.010
Neg logit contributions
ies-0.007
ishing-0.006
els-0.006
ig-0.006
ish-0.006
Token have
Token ID423
Feature activation-1.28e-01

Pos logit contributions
 Curious+0.055
Don+0.051
 Looking+0.051
 Fasc+0.048
 Who+0.048
Neg logit contributions
ies-0.078
ishing-0.075
els-0.069
ig-0.068
ins-0.067
Token a
Token ID257
Feature activation+5.24e-01

Pos logit contributions
ies+0.003
ishing+0.002
els+0.002
ish+0.002
ig+0.002
Neg logit contributions
 Curious-0.002
Don-0.002
 Looking-0.002
 Fasc-0.002
Would-0.002
Token race
Token ID3234
Feature activation-6.96e+00

Pos logit contributions
 Curious+0.009
Don+0.008
 Looking+0.008
 Fasc+0.008
 Who+0.008
Neg logit contributions
ies-0.010
ishing-0.010
els-0.010
ier-0.009
ins-0.009
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
ies+0.104
ishing+0.099
els+0.095
ins+0.091
ish+0.089
Neg logit contributions
 Curious-0.129
 Who-0.128
Don-0.127
 Exc-0.123
-0.122
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 Curious+0.018
Don+0.018
 Looking+0.017
 Fasc+0.016
 Terr+0.016
Neg logit contributions
ies-0.019
ishing-0.018
els-0.017
ig-0.017
ish-0.017
Token lovely
Token ID14081
Feature activation-4.35e+00

Pos logit contributions
 Curious+0.019
Don+0.018
 Looking+0.017
 Fasc+0.017
 Who+0.017
Neg logit contributions
ies-0.021
ishing-0.020
els-0.019
ig-0.019
ish-0.018
Token pond
Token ID16723
Feature activation-8.64e+00

Pos logit contributions
ies+0.095
els+0.093
ishing+0.089
ier+0.087
ins+0.087
Neg logit contributions
 Curious-0.059
 Fasc-0.053
 Looking-0.053
Hi-0.051
 Who-0.050
Token.
Token ID13
Feature activation+1.25e+01

Pos logit contributions
ishing+0.111
els+0.110
ies+0.107
ig+0.106
ries+0.095
Neg logit contributions
 Curious-0.179
Don-0.176
 Who-0.166
 hadn-0.165
-0.163
Token brother
Token ID3956
Feature activation-1.93e+00

Pos logit contributions
ishing+0.027
ies+0.026
ier+0.025
els+0.025
ins+0.025
Neg logit contributions
 Curious-0.024
 Looking-0.023
Don-0.023
 Who-0.022
 Fasc-0.021
Token Tom
Token ID4186
Feature activation-2.09e+00

Pos logit contributions
ies+0.039
ishing+0.035
els+0.033
ish+0.033
-+0.033
Neg logit contributions
 Curious-0.031
Don-0.031
 Looking-0.029
 Fasc-0.029
 Terr-0.028
Token came
Token ID1625
Feature activation-1.25e+00

Pos logit contributions
ies+0.046
ishing+0.042
els+0.040
ig+0.040
ish+0.040
Neg logit contributions
 Curious-0.031
Don-0.030
 Looking-0.028
 Fasc-0.027
 Who-0.026
Token into
Token ID656
Feature activation-2.54e+00

Pos logit contributions
ies+0.020
ig+0.020
ishing+0.019
els+0.019
ier+0.019
Neg logit contributions
 Curious-0.023
Don-0.021
 Looking-0.021
-0.021
 Fasc-0.021
Token her
Token ID607
Feature activation-5.45e+00

Pos logit contributions
ies+0.046
ishing+0.043
els+0.040
ig+0.040
ish+0.040
Neg logit contributions
 Curious-0.046
Don-0.045
 Looking-0.044
 Fasc-0.042
 Who-0.041
Token room
Token ID2119
Feature activation-9.37e+00

Pos logit contributions
ier+0.103
ies+0.099
ins+0.097
ishing+0.094
els+0.092
Neg logit contributions
 Curious-0.087
 Looking-0.079
 Exc-0.078
 Fasc-0.077
Don-0.077
Token.
Token ID13
Feature activation+7.27e+00

Pos logit contributions
ig+0.146
ies+0.144
ishes+0.143
ish+0.141
ishing+0.141
Neg logit contributions
 Curious-0.178
Don-0.168
 Looking-0.167
 On-0.164
 He-0.161
Token He
Token ID679
Feature activation+6.94e-01

Pos logit contributions
Don+0.097
Follow+0.096
 hadn+0.093
Would+0.086
 wond+0.085
Neg logit contributions
ies-0.143
--0.134
.-0.133
 blocks-0.130
 and-0.130
Token saw
Token ID2497
Feature activation+2.06e-02

Pos logit contributions
 Curious+0.011
Don+0.011
Follow+0.011
 Looking+0.011
 Fasc+0.011
Neg logit contributions
ies-0.014
--0.012
ie-0.012
 and-0.011
els-0.011
Token Jazz
Token ID21406
Feature activation-7.68e+00

Pos logit contributions
 Curious+0.000
Don+0.000
 Looking+0.000
 Fasc+0.000
 Who+0.000
Neg logit contributions
ies-0.000
ishing-0.000
ig-0.000
ish-0.000
els-0.000
Token on
Token ID319
Feature activation-3.52e+00

Pos logit contributions
ishing+0.114
ies+0.113
ishes+0.104
isher+0.103
els+0.103
Neg logit contributions
 Curious-0.154
-0.149
 Looking-0.148
Don-0.145
 Exc-0.141
Token girl
Token ID2576
Feature activation-7.98e-01

Pos logit contributions
 Curious+0.029
Don+0.027
 Looking+0.027
 Fasc+0.025
 Who+0.025
Neg logit contributions
ies-0.028
ishing-0.028
els-0.026
ig-0.025
ish-0.025
Token found
Token ID1043
Feature activation-2.22e+00

Pos logit contributions
ies+0.019
ishing+0.017
els+0.017
ig+0.017
ish+0.016
Neg logit contributions
 Curious-0.010
Don-0.010
 Looking-0.010
 Fasc-0.009
 Who-0.009
Token some
Token ID617
Feature activation-4.80e+00

Pos logit contributions
ies+0.041
ishing+0.039
ig+0.036
ish+0.035
els+0.035
Neg logit contributions
 Curious-0.039
Don-0.039
 Looking-0.037
 Fasc-0.035
 Exc-0.035
Token cr
Token ID1067
Feature activation-7.56e+00

Pos logit contributions
ies+0.084
ishing+0.083
ig+0.075
ish+0.073
ins+0.073
Neg logit contributions
 Curious-0.082
Don-0.078
 Fasc-0.076
 Looking-0.076
 Exc-0.074
Tokenay
Token ID323
Feature activation-6.39e+00

Pos logit contributions
ies+0.133
ishing+0.123
ish+0.112
ig+0.112
els+0.107
Neg logit contributions
 Curious-0.133
 Looking-0.130
 Exc-0.125
Don-0.124
 Fasc-0.121
Tokenons
Token ID684
Feature activation-9.06e+00

Pos logit contributions
ishing+0.101
ies+0.099
burn+0.089
ish+0.088
ier+0.086
Neg logit contributions
-0.108
 Looking-0.104
 Curious-0.103
Don-0.103
 hadn-0.101
Token and
Token ID290
Feature activation+5.96e-01

Pos logit contributions
ishing+0.117
ies+0.112
els+0.104
ins+0.103
ig+0.099
Neg logit contributions
-0.189
 hadn-0.186
Don-0.181
 Curious-0.180
 Looking-0.171
Token hard
Token ID1327
Feature activation-4.65e+00

Pos logit contributions
 Curious+0.009
Don+0.009
 Looking+0.008
 Fasc+0.008
Would+0.008
Neg logit contributions
ies-0.012
ishing-0.012
ig-0.011
ish-0.011
els-0.011
Token markers
Token ID19736
Feature activation-9.87e+00

Pos logit contributions
ies+0.074
ins+0.073
ish+0.072
ishing+0.071
ig+0.071
Neg logit contributions
 Curious-0.077
 Looking-0.077
 Who-0.073
Don-0.071
 Fasc-0.070
Token in
Token ID287
Feature activation-2.39e+00

Pos logit contributions
ishing+0.129
ig+0.117
ies+0.114
ish+0.107
ins+0.105
Neg logit contributions
 hadn-0.203
-0.197
Don-0.196
 Curious-0.191
 Exc-0.186
Token her
Token ID607
Feature activation-4.86e+00

Pos logit contributions
ies+0.045
ishing+0.044
ish+0.041
ig+0.041
ins+0.039
Neg logit contributions
 Curious-0.041
Don-0.040
 Looking-0.038
 Who-0.037
 Exc-0.037
Token and
Token ID290
Feature activation+9.81e-01

Pos logit contributions
els+0.134
ies+0.134
ishing+0.127
ins+0.126
ries+0.125
Neg logit contributions
 Curious-0.156
Don-0.156
 hadn-0.153
-0.149
 Sadly-0.147
Token watch
Token ID2342
Feature activation-3.65e+00

Pos logit contributions
 Curious+0.014
Don+0.014
 Looking+0.013
 Who+0.013
 Fasc+0.012
Neg logit contributions
ies-0.021
ishing-0.020
els-0.019
ig-0.019
ish-0.019
Token all
Token ID477
Feature activation-1.04e+00

Pos logit contributions
ies+0.082
ishing+0.080
ig+0.079
ins+0.078
els+0.075
Neg logit contributions
 Curious-0.046
 Who-0.044
Don-0.043
 Looking-0.042
 Fasc-0.041
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
ies+0.021
ishing+0.019
els+0.019
ig+0.018
ish+0.018
Neg logit contributions
 Curious-0.017
Don-0.016
 Looking-0.016
 Fasc-0.015
 Who-0.015
Token other
Token ID584
Feature activation-1.48e+00

Pos logit contributions
ies+0.005
ishing+0.005
els+0.004
ig+0.004
ish+0.004
Neg logit contributions
 Curious-0.006
Don-0.006
 Looking-0.006
Would-0.006
 Fasc-0.006
Token trains
Token ID13404
Feature activation-9.48e+00

Pos logit contributions
ies+0.020
ishing+0.019
ig+0.017
ish+0.017
els+0.017
Neg logit contributions
 Curious-0.032
Don-0.031
 Who-0.030
 Looking-0.030
 hadn-0.029
Token slide
Token ID10649
Feature activation-8.74e+00

Pos logit contributions
ies+0.160
ishing+0.159
els+0.156
ins+0.146
ig+0.144
Neg logit contributions
Don-0.163
-0.161
 Curious-0.159
 hadn-0.158
 Who-0.150
Token down
Token ID866
Feature activation-5.83e+00

Pos logit contributions
ies+0.113
ins+0.106
ishing+0.105
els+0.104
ig+0.097
Neg logit contributions
Don-0.172
 hadn-0.170
 Curious-0.169
-0.167
 Fasc-0.161
Token the
Token ID262
Feature activation+3.97e-01

Pos logit contributions
ies+0.122
ishing+0.112
ish+0.110
ig+0.109
ins+0.106
Neg logit contributions
Don-0.092
 Curious-0.092
 Fasc-0.087
 Looking-0.086
 Terr-0.082
Token tracks
Token ID8339
Feature activation-1.15e+01

Pos logit contributions
 Curious+0.007
Don+0.007
 Looking+0.006
 Fasc+0.006
 Who+0.006
Neg logit contributions
ies-0.008
ishing-0.007
els-0.007
ig-0.007
ish-0.007
Token.
Token ID13
Feature activation+1.04e+01

Pos logit contributions
ies+0.161
ishing+0.151
els+0.150
ries+0.137
ins+0.137
Neg logit contributions
 Curious-0.238
-0.233
Don-0.232
 hadn-0.231
 Looking-0.224
Token it
Token ID340
Feature activation-1.39e+01

Pos logit contributions
ishing+0.206
ies+0.200
ig+0.193
ish+0.189
els+0.182
Neg logit contributions
Don-0.165
 Curious-0.163
-0.161
 Who-0.158
 Looking-0.155
Token look
Token ID804
Feature activation-1.28e+01

Pos logit contributions
ishing+0.268
els+0.265
ies+0.251
ig+0.249
ries+0.245
Neg logit contributions
 Curious-0.209
-0.201
Don-0.201
 hadn-0.195
 Looking-0.186
Token like
Token ID588
Feature activation-1.04e+01

Pos logit contributions
ies+0.267
ig+0.266
els+0.265
ishing+0.265
ries+0.249
Neg logit contributions
 Curious-0.169
Don-0.162
 hadn-0.153
 Looking-0.151
 Fasc-0.150
Token a
Token ID257
Feature activation-8.53e+00

Pos logit contributions
ies+0.202
ishing+0.202
ig+0.192
ish+0.187
els+0.182
Neg logit contributions
Don-0.162
 Curious-0.156
 hadn-0.150
 Looking-0.146
 Who-0.145
Token real
Token ID1103
Feature activation-1.18e+01

Pos logit contributions
ishing+0.160
els+0.156
ies+0.156
ries+0.146
ins+0.139
Neg logit contributions
 Curious-0.145
 Who-0.139
Don-0.139
 Are-0.137
 hadn-0.133
Token spaceship
Token ID40663
Feature activation-1.89e+01

Pos logit contributions
els+0.206
ishing+0.204
ies+0.202
ins+0.196
ries+0.193
Neg logit contributions
 Curious-0.201
 Who-0.201
 Are-0.200
 Looking-0.185
 Would-0.185
Token.
Token ID13
Feature activation+3.83e+00

Pos logit contributions
ishing+0.272
els+0.262
ies+0.254
ries+0.241
ins+0.237
Neg logit contributions
-0.352
 hadn-0.351
 Curious-0.351
 Who-0.349
Don-0.344
Token
Token ID198
Feature activation+5.42e+00

Pos logit contributions
Don+0.040
 Curious+0.040
 Looking+0.036
 Fasc+0.035
 hadn+0.034
Neg logit contributions
ies-0.087
els-0.081
ig-0.080
ish-0.079
ishing-0.079
Token
Token ID198
Feature activation+5.96e+00

Pos logit contributions
 Curious+0.079
 Looking+0.073
 Fasc+0.072
Don+0.071
 Terr+0.069
Neg logit contributions
ies-0.109
ig-0.099
els-0.098
ie-0.098
ins-0.098
TokenOne
Token ID3198
Feature activation-2.05e+00

Pos logit contributions
 Curious+0.096
 Looking+0.089
 Fasc+0.088
 Terr+0.086
 hadn+0.082
Neg logit contributions
ies-0.109
ig-0.100
ins-0.098
els-0.097
--0.097
Token day
Token ID1110
Feature activation+8.17e-03

Pos logit contributions
ies+0.037
ishing+0.036
els+0.034
ries+0.032
ish+0.032
Neg logit contributions
 Curious-0.037
Don-0.035
 Looking-0.034
 Fasc-0.033
 Who-0.033
Token "
Token ID366
Feature activation+3.75e+00

Pos logit contributions
 Curious+0.038
Don+0.038
 Fasc+0.034
 hadn+0.034
 Looking+0.033
Neg logit contributions
ies-0.079
els-0.073
ishing-0.071
 again-0.070
ig-0.070
TokenYou
Token ID1639
Feature activation-2.97e+00

Pos logit contributions
 Curious+0.044
 Fasc+0.041
 Terr+0.040
 Looking+0.040
 hadn+0.040
Neg logit contributions
ies-0.078
ishing-0.077
els-0.074
--0.072
ins-0.071
Token are
Token ID389
Feature activation-3.52e+00

Pos logit contributions
ies+0.056
ishing+0.051
ish+0.049
els+0.048
ig+0.048
Neg logit contributions
 Curious-0.052
Don-0.051
 Looking-0.049
Would-0.047
 Fasc-0.047
Token ruining
Token ID39842
Feature activation-9.22e+00

Pos logit contributions
ies+0.060
ishing+0.055
els+0.052
ish+0.050
ig+0.049
Neg logit contributions
 Curious-0.068
Don-0.065
 Looking-0.064
 Fasc-0.062
 Exc-0.061
Token our
Token ID674
Feature activation-7.30e+00

Pos logit contributions
ies+0.184
ig+0.176
ishing+0.172
ier+0.166
ish+0.164
Neg logit contributions
Don-0.137
 Curious-0.132
 Who-0.131
 Looking-0.130
 Are-0.122
Token chess
Token ID19780
Feature activation-1.92e+01

Pos logit contributions
ier+0.123
ies+0.119
ishing+0.116
ins+0.109
els+0.107
Neg logit contributions
 Curious-0.124
 Looking-0.116
 Terr-0.114
 Who-0.114
Me-0.113
Token game
Token ID983
Feature activation-1.48e+01

Pos logit contributions
ies+0.306
ishing+0.303
els+0.302
ries+0.287
ishes+0.266
Neg logit contributions
-0.341
 Exc-0.333
 Curious-0.327
 hadn-0.304
 Looking-0.303
Token!"
Token ID2474
Feature activation+5.69e+00

Pos logit contributions
ishing+0.232
ies+0.219
els+0.206
ig+0.202
ish+0.200
Neg logit contributions
-0.287
 Exc-0.266
 Curious-0.263
 hadn-0.263
 Who-0.259
Token
Token ID198
Feature activation+7.51e+00

Pos logit contributions
 Curious+0.075
Don+0.071
 Looking+0.069
 Fasc+0.069
Would+0.067
Neg logit contributions
ies-0.127
ishing-0.112
--0.112
els-0.110
ig-0.109
Token
Token ID198
Feature activation+7.79e+00

Pos logit contributions
 Curious+0.099
Don+0.090
 Fasc+0.089
 Looking+0.089
 hadn+0.085
Neg logit contributions
ies-0.170
--0.152
ishing-0.152
els-0.151
ig-0.149
TokenThe
Token ID464
Feature activation-7.68e-01

Pos logit contributions
 Curious+0.127
 Looking+0.117
 Fasc+0.116
 Terr+0.113
 hadn+0.113
Neg logit contributions
ies-0.152
ishing-0.137
els-0.134
--0.133
ig-0.132
Token to
Token ID284
Feature activation-2.39e+00

Pos logit contributions
ies+0.100
ishing+0.098
ig+0.090
ish+0.088
els+0.086
Neg logit contributions
 Curious-0.104
Don-0.103
 Who-0.099
 Looking-0.098
Would-0.097
Token put
Token ID1234
Feature activation-1.07e+01

Pos logit contributions
ies+0.053
ishing+0.051
els+0.047
ig+0.047
ins+0.046
Neg logit contributions
 Curious-0.034
Don-0.032
 Looking-0.032
 Who-0.030
 Fasc-0.030
Token it
Token ID340
Feature activation-1.29e+01

Pos logit contributions
ies+0.236
ishing+0.217
ier+0.212
ins+0.206
ooming+0.202
Neg logit contributions
Don-0.133
 Curious-0.132
 Looking-0.125
 Who-0.123
 Exc-0.122
Token on
Token ID319
Feature activation-7.80e+00

Pos logit contributions
ins+0.224
els+0.218
ries+0.205
ishing+0.205
ies+0.202
Neg logit contributions
-0.189
Don-0.174
 had-0.167
 Curious-0.167
 hadn-0.164
Token our
Token ID674
Feature activation-6.78e+00

Pos logit contributions
ies+0.166
ishing+0.159
ig+0.148
ish+0.147
ins+0.144
Neg logit contributions
 Curious-0.114
Don-0.112
-0.109
 Exc-0.107
 Looking-0.106
Token castle
Token ID16669
Feature activation-1.81e+01

Pos logit contributions
ier+0.113
ies+0.109
ishing+0.105
ins+0.104
ing+0.095
Neg logit contributions
 Curious-0.122
Me-0.115
 Looking-0.112
 Exc-0.112
Don-0.111
Token."
Token ID526
Feature activation+6.77e+00

Pos logit contributions
ishing+0.271
ies+0.253
els+0.251
ins+0.243
ooming+0.235
Neg logit contributions
-0.320
 Exc-0.320
 Curious-0.311
 On-0.295
case-0.294
Token
Token ID198
Feature activation+7.27e+00

Pos logit contributions
 Curious+0.086
Don+0.084
 Looking+0.080
 hadn+0.078
 Fasc+0.078
Neg logit contributions
ies-0.146
 again-0.128
ie-0.127
--0.127
ig-0.127
Token
Token ID198
Feature activation+7.43e+00

Pos logit contributions
 Curious+0.110
 Looking+0.102
 hadn+0.102
 Fasc+0.101
Don+0.100
Neg logit contributions
ies-0.142
--0.131
ie-0.128
els-0.124
ig-0.124
TokenBen
Token ID11696
Feature activation+1.03e+00

Pos logit contributions
 Curious+0.124
 Looking+0.115
 Fasc+0.113
 hadn+0.112
 Terr+0.111
Neg logit contributions
ies-0.136
--0.122
ig-0.120
ins-0.118
ishing-0.118
Token nods
Token ID34833
Feature activation-2.95e+00

Pos logit contributions
 Curious+0.019
Don+0.019
 Looking+0.018
Would+0.017
 Who+0.017
Neg logit contributions
ies-0.018
ishing-0.016
els-0.015
ish-0.015
ig-0.015
Token how
Token ID703
Feature activation-5.87e+00

Pos logit contributions
ies+0.199
ishing+0.192
els+0.182
ig+0.180
ish+0.175
Neg logit contributions
 Curious-0.131
Don-0.125
 Looking-0.120
 Who-0.115
 Fasc-0.112
Token to
Token ID284
Feature activation-5.38e+00

Pos logit contributions
ies+0.090
ishing+0.083
els+0.077
ig+0.076
ish+0.074
Neg logit contributions
 Curious-0.123
Don-0.120
 Looking-0.117
 Fasc-0.113
 Who-0.112
Token match
Token ID2872
Feature activation-1.57e+01

Pos logit contributions
ies+0.116
ishing+0.111
ig+0.105
els+0.103
ins+0.101
Neg logit contributions
 Curious-0.077
Don-0.074
 Looking-0.072
 Who-0.068
 Fasc-0.068
Token the
Token ID262
Feature activation-9.04e+00

Pos logit contributions
ishing+0.275
ies+0.268
ig+0.257
ooming+0.251
ish+0.241
Neg logit contributions
Don-0.241
 Who-0.241
 Curious-0.238
-0.237
 Something-0.234
Token different
Token ID1180
Feature activation-1.43e+01

Pos logit contributions
ies+0.147
ishing+0.144
els+0.131
ins+0.128
ig+0.127
Neg logit contributions
 Curious-0.168
Don-0.159
-0.157
 hadn-0.155
 Who-0.152
Token colours
Token ID18915
Feature activation-1.81e+01

Pos logit contributions
ishing+0.195
ig+0.186
ier+0.181
ies+0.180
ooming+0.178
Neg logit contributions
 Curious-0.278
 Who-0.266
-0.261
 Are-0.261
 Would-0.257
Token and
Token ID290
Feature activation-5.42e+00

Pos logit contributions
ishing+0.252
ies+0.237
ig+0.222
ooming+0.215
ish+0.210
Neg logit contributions
-0.372
 Curious-0.339
 hadn-0.336
 Who-0.332
Don-0.332
Token shapes
Token ID15268
Feature activation-1.37e+01

Pos logit contributions
ies+0.098
ishing+0.094
ig+0.088
els+0.087
ish+0.086
Neg logit contributions
 Curious-0.097
Don-0.094
 Looking-0.091
 Who-0.089
 Fasc-0.087
Token of
Token ID286
Feature activation-7.94e+00

Pos logit contributions
ishing+0.187
ies+0.175
ig+0.165
ish+0.160
els+0.156
Neg logit contributions
-0.270
 Curious-0.263
Don-0.259
 Who-0.257
 Looking-0.257
Token the
Token ID262
Feature activation-7.46e+00

Pos logit contributions
ies+0.140
ishing+0.136
ig+0.130
ish+0.124
ier+0.122
Neg logit contributions
 Curious-0.140
Don-0.135
 Looking-0.134
 Who-0.132
 Fasc-0.130
Token gum
Token ID27142
Feature activation-1.63e+01

Pos logit contributions
ies+0.124
ishing+0.121
ig+0.112
els+0.110
ins+0.110
Neg logit contributions
 Curious-0.136
Don-0.129
 Looking-0.125
 hadn-0.124
 Who-0.124
Token pour
Token ID12797
Feature activation-1.58e+01

Pos logit contributions
ies+0.185
ishing+0.178
els+0.178
ig+0.173
isher+0.171
Neg logit contributions
-0.144
 Curious-0.137
 Something-0.124
 Who-0.123
 Are-0.121
Token the
Token ID262
Feature activation-9.49e+00

Pos logit contributions
ies+0.322
ishing+0.318
ooming+0.311
ig+0.308
ologist+0.296
Neg logit contributions
-0.219
Don-0.208
 Exc-0.181
 It-0.180
Oh-0.177
Token mix
Token ID5022
Feature activation-1.83e+01

Pos logit contributions
ishing+0.160
ies+0.159
els+0.143
ig+0.140
ier+0.138
Neg logit contributions
 Curious-0.163
Don-0.156
-0.155
 Exc-0.150
Me-0.149
Token into
Token ID656
Feature activation-1.00e+01

Pos logit contributions
ies+0.261
ishing+0.259
ooming+0.257
ig+0.235
els+0.229
Neg logit contributions
-0.341
 Exc-0.302
 Who-0.294
Don-0.292
This-0.292
Token the
Token ID262
Feature activation-7.30e+00

Pos logit contributions
ies+0.234
ishing+0.226
ig+0.219
ish+0.217
ooming+0.207
Neg logit contributions
Don-0.109
-0.107
 Curious-0.103
Ah-0.103
 Who-0.100
Token mixer
Token ID33938
Feature activation-1.88e+01

Pos logit contributions
ies+0.126
ishing+0.118
els+0.109
ig+0.105
ier+0.104
Neg logit contributions
 Curious-0.135
Don-0.125
 Terr-0.124
-0.123
 Are-0.123
Token.
Token ID13
Feature activation+3.46e+00

Pos logit contributions
ishing+0.272
ies+0.265
els+0.250
ig+0.246
ooming+0.232
Neg logit contributions
-0.375
 Exc-0.328
 Curious-0.326
 Who-0.321
Don-0.316
Token But
Token ID887
Feature activation-1.08e+00

Pos logit contributions
Don+0.037
 Curious+0.037
 Looking+0.033
 hadn+0.032
 Fasc+0.032
Neg logit contributions
ies-0.077
ish-0.074
els-0.074
ig-0.074
ishing-0.074
Token be
Token ID307
Feature activation-2.02e+00

Pos logit contributions
ies+0.021
ishing+0.021
ins+0.019
els+0.019
ier+0.018
Neg logit contributions
Don-0.017
 Curious-0.016
 Who-0.016
 Are-0.016
 Is-0.016
Token careful
Token ID8161
Feature activation-8.36e+00

Pos logit contributions
ies+0.039
ishing+0.035
els+0.034
ig+0.033
ish+0.033
Neg logit contributions
Don-0.034
 Curious-0.034
 Looking-0.032
 Fasc-0.031
Would-0.030
Token,
Token ID11
Feature activation-2.46e+00

Pos logit contributions
ishing+0.137
ishes+0.120
ig+0.119
ies+0.119
els+0.118
Neg logit contributions
-0.153
Don-0.142
 Curious-0.140
 hadn-0.134
 Looking-0.133