TOP ACTIVATIONS
MAX = 17.411

't let yourself become too dependent on one thing . It
bring the quarrel to a resolution , until everyone was happy
this time she wanted to compete with her brother . Her
better to cooperate than to compete . <|endoftext|> L ily and
I agree , we can do it
I think we can all agree that this popcorn tastes better
that sharing is better than competing . They learn that purple
first two friends started to compete . They ran
a boy who loved to compete with his friends . He
competitive and was excited to compete with her friend . So
not independent . You are dependent on us . We love
competitive . He wanted to compete in running , and win
had been too ready to joke around with the pan and
melodies and even her own compositions . Emma was so proud
competitive . He loved to compete with his friends and was
girl and she loved to compete with Jack in the mornings
. " I can 't compete with the other kids ."
brightest . She loved to compete with the other animals .
It was so ready to compete . At the
It makes me want to spit ." " Me

MIN ACTIVATIONS
MIN = -15.161

mess . But their mom said , " No
, Lily and Sam !" their mom calls . " You
ira and R avi and their friends ran to the policeman
, Ben and Lily ?" their mom asked . " We
, Tim and Mia ," their mom says . " The
, Ben and Anna ?" their mom asked . " Maybe
. Lily and Max and their mom talked to the elephant
. Tom and Jen and their wheels were broken and hurt
noticed Lily and Max and their mom . It lifted its
then , Tim my and his dad have made sure to
. Later , their friend Bunny came to play
By the end of their adventure , Lily had learned
you 're safe now ," their dad said . He drove
. Soon , their friends came . They played
very naughty of you !" their mom said .
up with each other ," their mom said . " And
. Suddenly , their mom came outside and saw
." Soon , their mom came to check on
, it 's okay ," their mom said . " You
Tom my and his friends all worked together to

INTERVAL 9.268 - 17.411
CONTAINS 1.559%

the day so cool ! <|endoftext|> Once upon a time ,
sister kept her promise . <|endoftext|> Once there was a big
Tom and Lily wished they had not fought .
happy to see how much joy the picture was bringing her
start exploring . Her mom handed her a pretty notebook so

INTERVAL 1.125 - 9.268
CONTAINS 45.591%

a piece of cloth and put it around the triangle .
up a pe b ble too . He threw it as
back out of the office . She was so excited to
so much fun , they didn 't want the day to
little girl had so much fun ! She laughed as the

INTERVAL -7.018 - 1.125
CONTAINS 40.330%

the park to play with other kids in a club .
the eggs away . The bird was very upset because it
can . Hold his paw and say ' shake '. He
the dry letter and he never got his magical castle .
brown and tall . It has a sign on it .

INTERVAL -15.161 - -7.018
CONTAINS 12.520%

Eventually , Lily and her mom had to leave
the cold . One day she put her warm m ittens
The squirrel shook his head and replied : "
out his camera to take a photo of the tasty dish
her friends . She had a large family with many people
Token't
Token ID470
Feature activation-6.89e-01

Pos logit contributions
used+0.178
eyed+0.139
icked+0.137
overed+0.136
wards+0.136
Neg logit contributions
 attic-0.315
 needle-0.314
 shelf-0.312
 garage-0.310
 puzzle-0.310
Token let
Token ID1309
Feature activation+8.27e+00

Pos logit contributions
 fort+0.020
 attic+0.019
 folder+0.019
 garage+0.019
 sheet+0.019
Neg logit contributions
used-0.025
uts-0.022
eyed-0.022
but-0.021
overed-0.021
Token yourself
Token ID3511
Feature activation-1.09e-01

Pos logit contributions
used+0.280
eyed+0.245
uts+0.240
all+0.232
ting+0.228
Neg logit contributions
 fort-0.272
 attic-0.268
 sheet-0.257
 folder-0.256
 garage-0.253
Token become
Token ID1716
Feature activation+9.10e-01

Pos logit contributions
 fort+0.004
 folder+0.004
 attic+0.004
 sheet+0.004
 garage+0.004
Neg logit contributions
used-0.003
eyed-0.003
unch-0.002
uts-0.002
ucked-0.002
Token too
Token ID1165
Feature activation+3.57e+00

Pos logit contributions
used+0.031
eyed+0.028
uts+0.026
 cheering+0.026
all+0.026
Neg logit contributions
 fort-0.029
 attic-0.029
 sheet-0.028
 folder-0.028
 garage-0.028
Token dependent
Token ID10795
Feature activation+1.74e+01

Pos logit contributions
used+0.151
eyed+0.135
but+0.135
overed+0.134
unch+0.133
Neg logit contributions
 attic-0.078
 folder-0.075
 fort-0.075
 sheet-0.074
 trash-0.073
Token on
Token ID319
Feature activation+1.59e+00

Pos logit contributions
used+0.401
eyed+0.338
uts+0.316
unch+0.308
 cheering+0.299
Neg logit contributions
 attic-0.739
 folder-0.734
 fort-0.732
 sheet-0.721
 garage-0.708
Token one
Token ID530
Feature activation-1.73e+00

Pos logit contributions
used+0.057
eyed+0.051
uts+0.050
all+0.047
but+0.047
Neg logit contributions
 fort-0.049
 attic-0.048
 folder-0.046
 sheet-0.046
 garage-0.045
Token thing
Token ID1517
Feature activation+7.56e+00

Pos logit contributions
 fort+0.071
 attic+0.070
 folder+0.069
 sheet+0.069
 garage+0.067
Neg logit contributions
used-0.044
eyed-0.038
uts-0.036
unch-0.034
but-0.033
Token.
Token ID13
Feature activation+4.87e+00

Pos logit contributions
used+0.168
eyed+0.139
unch+0.127
uts+0.125
 cheering+0.119
Neg logit contributions
 fort-0.331
 attic-0.329
 folder-0.326
 sheet-0.323
 garage-0.316
Token It
Token ID632
Feature activation-2.10e+00

Pos logit contributions
used+0.112
eyed+0.097
uts+0.089
overed+0.085
ting+0.085
Neg logit contributions
 attic-0.205
 garage-0.199
 folder-0.197
 fort-0.196
 closet-0.194
Token bring
Token ID2222
Feature activation+6.38e+00

Pos logit contributions
 fort+0.073
 attic+0.071
 folder+0.070
 sheet+0.069
 closet+0.067
Neg logit contributions
used-0.077
eyed-0.070
uts-0.065
 cheering-0.064
all-0.064
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
used+0.180
eyed+0.165
 cheering+0.155
all+0.152
uts+0.152
Neg logit contributions
 fort-0.238
 attic-0.230
 garage-0.225
 sheet-0.223
 closet-0.219
Token quarrel
Token ID41998
Feature activation+1.10e+01

Pos logit contributions
 fort+0.301
 attic+0.284
 sheet+0.273
 folder+0.266
 closet+0.259
Neg logit contributions
used-0.439
eyed-0.406
 cheering-0.390
uts-0.363
all-0.360
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
used+0.239
eyed+0.204
uts+0.203
but+0.181
all+0.180
Neg logit contributions
 fort-0.475
 attic-0.473
 folder-0.463
 sheet-0.462
 closet-0.455
Token a
Token ID257
Feature activation-7.68e+00

Pos logit contributions
 fort+0.048
 attic+0.046
 sheet+0.045
 folder+0.045
 garage+0.044
Neg logit contributions
used-0.043
eyed-0.038
uts-0.037
 cheering-0.036
all-0.036
Token resolution
Token ID6323
Feature activation+1.72e+01

Pos logit contributions
 attic+0.193
 fort+0.191
 sheet+0.181
 closet+0.176
 folder+0.175
Neg logit contributions
used-0.291
 cheering-0.279
eyed-0.272
 joining-0.260
uts-0.251
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
used+0.440
eyed+0.376
uts+0.358
unch+0.337
but+0.330
Neg logit contributions
 fort-0.694
 attic-0.690
 folder-0.678
 sheet-0.675
 garage-0.661
Token until
Token ID1566
Feature activation+2.73e+00

Pos logit contributions
 fort+0.009
 attic+0.009
 folder+0.009
 sheet+0.009
 closet+0.009
Neg logit contributions
used-0.006
eyed-0.005
uts-0.005
but-0.005
 cheering-0.005
Token everyone
Token ID2506
Feature activation+5.88e-02

Pos logit contributions
used+0.075
eyed+0.060
all+0.059
 cheering+0.058
uts+0.058
Neg logit contributions
 fort-0.108
 attic-0.105
 sheet-0.102
 folder-0.102
 closet-0.102
Token was
Token ID373
Feature activation-2.60e+00

Pos logit contributions
used+0.002
eyed+0.002
uts+0.002
unch+0.002
but+0.002
Neg logit contributions
 fort-0.002
 attic-0.002
 folder-0.002
 sheet-0.002
 garage-0.002
Token happy
Token ID3772
Feature activation+3.07e+00

Pos logit contributions
 folder+0.079
 attic+0.079
 fort+0.078
 closet+0.078
 drawer+0.078
Neg logit contributions
used-0.085
uts-0.077
ting-0.073
but-0.073
eyed-0.073
Token this
Token ID428
Feature activation-8.08e+00

Pos logit contributions
 fort+0.003
 attic+0.003
 sheet+0.003
 garage+0.003
 folder+0.003
Neg logit contributions
used-0.002
eyed-0.002
all-0.002
but-0.002
 cheering-0.002
Token time
Token ID640
Feature activation+4.36e+00

Pos logit contributions
 fort+0.181
 attic+0.167
 closet+0.164
 garage+0.164
 folder+0.155
Neg logit contributions
used-0.331
all-0.307
eyed-0.294
 cheering-0.291
:-0.290
Token she
Token ID673
Feature activation-7.30e+00

Pos logit contributions
used+0.123
uts+0.101
eyed+0.098
 cheering+0.095
all+0.095
Neg logit contributions
 fort-0.168
 attic-0.162
 sheet-0.161
 folder-0.159
 garage-0.159
Token wanted
Token ID2227
Feature activation+9.00e+00

Pos logit contributions
 fort+0.256
 attic+0.253
 sheet+0.244
 folder+0.243
 garage+0.238
Neg logit contributions
used-0.233
eyed-0.203
uts-0.196
all-0.191
but-0.182
Token to
Token ID284
Feature activation-1.82e+00

Pos logit contributions
used+0.184
eyed+0.150
uts+0.146
all+0.136
but+0.133
Neg logit contributions
 fort-0.411
 attic-0.404
 sheet-0.394
 folder-0.393
 garage-0.390
Token compete
Token ID9320
Feature activation+1.65e+01

Pos logit contributions
 fort+0.053
 attic+0.053
 garage+0.052
 folder+0.052
 sheet+0.051
Neg logit contributions
used-0.067
eyed-0.060
uts-0.059
unch-0.058
ting-0.057
Token with
Token ID351
Feature activation+1.88e+00

Pos logit contributions
used+0.372
eyed+0.310
unch+0.292
uts+0.291
aring+0.262
Neg logit contributions
 attic-0.711
 folder-0.703
 fort-0.700
 sheet-0.689
 garage-0.680
Token her
Token ID607
Feature activation-1.07e+01

Pos logit contributions
used+0.060
eyed+0.052
uts+0.051
 cheering+0.051
all+0.050
Neg logit contributions
 fort-0.064
 attic-0.062
 folder-0.059
 garage-0.059
 sheet-0.059
Token brother
Token ID3956
Feature activation+3.82e+00

Pos logit contributions
 fort+0.314
 attic+0.310
 sheet+0.299
 folder+0.299
 garage+0.292
Neg logit contributions
used-0.395
eyed-0.354
uts-0.347
but-0.331
all-0.329
Token.
Token ID13
Feature activation+4.83e+00

Pos logit contributions
used+0.080
eyed+0.066
uts+0.063
all+0.057
but+0.057
Neg logit contributions
 fort-0.172
 attic-0.171
 sheet-0.167
 folder-0.167
 garage-0.165
Token Her
Token ID2332
Feature activation-1.00e+01

Pos logit contributions
used+0.131
eyed+0.113
uts+0.108
unch+0.105
 cheering+0.102
Neg logit contributions
 attic-0.189
 fort-0.186
 folder-0.184
 garage-0.181
 closet-0.180
Token better
Token ID1365
Feature activation+2.85e+00

Pos logit contributions
 fort+0.054
 attic+0.053
 sheet+0.052
 folder+0.052
 journal+0.051
Neg logit contributions
used-0.064
eyed-0.059
all-0.057
uts-0.057
 cheering-0.057
Token to
Token ID284
Feature activation-2.06e+00

Pos logit contributions
used+0.068
eyed+0.058
uts+0.056
but+0.051
unch+0.051
Neg logit contributions
 fort-0.121
 attic-0.120
 folder-0.118
 sheet-0.118
 garage-0.116
Token cooperate
Token ID21270
Feature activation+9.36e+00

Pos logit contributions
 fort+0.060
 attic+0.059
 folder+0.058
 garage+0.058
 sheet+0.057
Neg logit contributions
used-0.077
uts-0.068
eyed-0.068
but-0.066
unch-0.065
Token than
Token ID621
Feature activation+1.06e+00

Pos logit contributions
used+0.250
eyed+0.218
uts+0.209
but+0.195
all+0.194
Neg logit contributions
 fort-0.368
 attic-0.364
 folder-0.357
 sheet-0.355
 garage-0.349
Token to
Token ID284
Feature activation-2.02e+00

Pos logit contributions
used+0.031
eyed+0.027
uts+0.026
unch+0.025
but+0.025
Neg logit contributions
 fort-0.039
 attic-0.039
 folder-0.038
 sheet-0.038
 garage-0.037
Token compete
Token ID9320
Feature activation+1.63e+01

Pos logit contributions
 fort+0.055
 attic+0.054
 folder+0.053
 garage+0.053
 sheet+0.052
Neg logit contributions
used-0.080
eyed-0.070
uts-0.070
but-0.068
unch-0.067
Token.
Token ID13
Feature activation+5.49e+00

Pos logit contributions
used+0.381
eyed+0.315
uts+0.295
unch+0.293
but+0.274
Neg logit contributions
 attic-0.695
 fort-0.695
 folder-0.684
 sheet-0.676
 garage-0.670
Token<|endoftext|>
Token ID50256
Feature activation+1.38e+01

Pos logit contributions
used+0.169
eyed+0.139
overed+0.136
uts+0.133
ting+0.133
Neg logit contributions
 attic-0.200
 garage-0.197
 folder-0.184
 closet-0.183
 shelf-0.181
TokenL
Token ID43
Feature activation-1.66e+00

Pos logit contributions
used+0.418
eyed+0.372
uts+0.342
but+0.340
shadow+0.338
Neg logit contributions
 attic-0.527
 fort-0.510
 sheet-0.505
 folder-0.505
 garage-0.498
Tokenily
Token ID813
Feature activation-3.40e+00

Pos logit contributions
 fort+0.059
 proper+0.057
 attic+0.057
 garage+0.056
 folder+0.055
Neg logit contributions
used-0.045
all-0.042
eyed-0.041
uts-0.040
:-0.040
Token and
Token ID290
Feature activation-6.02e+00

Pos logit contributions
 fort+0.125
 sheet+0.120
 folder+0.118
 attic+0.117
 seas+0.116
Neg logit contributions
used-0.091
eyed-0.084
all-0.077
:-0.077
uts-0.073
Token
Token ID95
Feature activation+7.67e-01

Pos logit contributions
used+0.065
eyed+0.057
uts+0.056
unch+0.054
 cheering+0.053
Neg logit contributions
 attic-0.085
 fort-0.083
 folder-0.082
 garage-0.081
 closet-0.080
Token
Token ID26391
Feature activation+1.02e-02

Pos logit contributions
used+0.021
uts+0.017
unch+0.017
eyed+0.017
 cheering+0.016
Neg logit contributions
 attic-0.030
 garage-0.029
 folder-0.029
 closet-0.029
 fort-0.029
Token
Token ID129
Feature activation-5.05e-01

Pos logit contributions
used+0.000
ting+0.000
eyed+0.000
 joining+0.000
ucked+0.000
Neg logit contributions
 attic-0.000
 fort-0.000
 closet-0.000
 folder-0.000
 sheet-0.000
Token
Token ID241
Feature activation+4.65e+00

Pos logit contributions
 attic+0.014
 shelf+0.014
 folder+0.014
 closet+0.013
 fort+0.013
Neg logit contributions
used-0.018
ting-0.015
icked-0.015
 cheering-0.015
unch-0.015
TokenI
Token ID40
Feature activation-2.13e-01

Pos logit contributions
used+0.118
icked+0.093
uts+0.092
eyed+0.089
ting+0.087
Neg logit contributions
 attic-0.188
 garage-0.178
 sheet-0.175
 fort-0.173
 folder-0.172
Token agree
Token ID4236
Feature activation+1.62e+01

Pos logit contributions
 attic+0.007
 fort+0.007
 folder+0.007
 sheet+0.007
 garage+0.007
Neg logit contributions
used-0.007
uts-0.006
eyed-0.006
ting-0.006
but-0.005
Token,
Token ID11
Feature activation-3.74e-01

Pos logit contributions
used+0.407
eyed+0.348
unch+0.323
uts+0.322
ucked+0.302
Neg logit contributions
 attic-0.657
 fort-0.656
 sheet-0.653
 folder-0.649
 closet-0.630
Token we
Token ID356
Feature activation-1.75e+00

Pos logit contributions
 fort+0.014
 attic+0.014
 folder+0.014
 sheet+0.013
 garage+0.013
Neg logit contributions
used-0.011
uts-0.009
eyed-0.009
all-0.009
 cheering-0.009
Token can
Token ID460
Feature activation+7.18e+00

Pos logit contributions
 fort+0.049
 attic+0.048
 folder+0.047
 sheet+0.047
 garage+0.046
Neg logit contributions
used-0.067
eyed-0.061
uts-0.060
unch-0.057
but-0.057
Token do
Token ID466
Feature activation+5.56e+00

Pos logit contributions
used+0.282
eyed+0.253
uts+0.250
ting+0.248
unch+0.245
Neg logit contributions
 garage-0.187
 fort-0.184
 attic-0.183
 sheet-0.180
 folder-0.175
Token it
Token ID340
Feature activation+4.00e-01

Pos logit contributions
used+0.165
eyed+0.143
uts+0.141
all+0.132
 cheering+0.131
Neg logit contributions
 fort-0.202
 attic-0.200
 folder-0.195
 sheet-0.194
 garage-0.192
TokenI
Token ID40
Feature activation-5.34e-01

Pos logit contributions
used+0.090
uts+0.081
eyed+0.074
icked+0.072
unch+0.070
Neg logit contributions
 attic-0.107
 sheet-0.105
 folder-0.105
 garage-0.103
 fort-0.102
Token think
Token ID892
Feature activation+7.89e+00

Pos logit contributions
 fort+0.019
 folder+0.018
 attic+0.018
 sheet+0.018
 garage+0.018
Neg logit contributions
used-0.017
uts-0.014
eyed-0.014
ting-0.013
but-0.013
Token we
Token ID356
Feature activation-1.61e+00

Pos logit contributions
used+0.234
eyed+0.204
uts+0.197
unch+0.186
all+0.185
Neg logit contributions
 fort-0.290
 attic-0.287
 folder-0.281
 sheet-0.279
 garage-0.274
Token can
Token ID460
Feature activation+7.05e+00

Pos logit contributions
 fort+0.048
 attic+0.047
 folder+0.046
 sheet+0.046
 garage+0.044
Neg logit contributions
used-0.059
eyed-0.054
uts-0.052
unch-0.049
all-0.049
Token all
Token ID477
Feature activation+1.03e+00

Pos logit contributions
used+0.298
overed+0.271
ting+0.270
uts+0.265
smart+0.265
Neg logit contributions
 garage-0.167
 fort-0.162
 sheet-0.159
 attic-0.155
 shelf-0.152
Token agree
Token ID4236
Feature activation+1.62e+01

Pos logit contributions
used+0.036
uts+0.032
eyed+0.032
unch+0.030
but+0.030
Neg logit contributions
 fort-0.032
 attic-0.031
 sheet-0.031
 folder-0.030
 garage-0.030
Token that
Token ID326
Feature activation-1.79e+00

Pos logit contributions
used+0.402
eyed+0.342
unch+0.324
uts+0.305
ucked+0.294
Neg logit contributions
 fort-0.656
 folder-0.650
 attic-0.649
 sheet-0.648
 journal-0.632
Token this
Token ID428
Feature activation-6.75e+00

Pos logit contributions
 fort+0.061
 attic+0.060
 folder+0.059
 sheet+0.058
 closet+0.057
Neg logit contributions
used-0.057
eyed-0.049
uts-0.049
 cheering-0.047
all-0.046
Token popcorn
Token ID38914
Feature activation+4.10e+00

Pos logit contributions
 fort+0.180
 attic+0.177
 folder+0.172
 closet+0.167
 garage+0.166
Neg logit contributions
used-0.257
eyed-0.228
uts-0.227
 cheering-0.222
all-0.220
Token tastes
Token ID18221
Feature activation+7.04e+00

Pos logit contributions
used+0.138
uts+0.129
eyed+0.126
all+0.119
 cheering+0.119
Neg logit contributions
 attic-0.124
 fort-0.120
 garage-0.116
 folder-0.116
 closet-0.116
Token better
Token ID1365
Feature activation+4.17e+00

Pos logit contributions
used+0.241
eyed+0.214
uts+0.206
unch+0.197
but+0.195
Neg logit contributions
 fort-0.227
 attic-0.224
 folder-0.219
 sheet-0.219
 garage-0.211
Token that
Token ID326
Feature activation-2.97e+00

Pos logit contributions
used+0.187
eyed+0.147
unch+0.147
uts+0.135
oned+0.134
Neg logit contributions
 folder-0.230
 attic-0.229
 fort-0.225
 sheet-0.222
 journal-0.220
Token sharing
Token ID7373
Feature activation+8.39e+00

Pos logit contributions
 fort+0.111
 attic+0.109
 folder+0.106
 sheet+0.106
 closet+0.104
Neg logit contributions
used-0.083
uts-0.073
eyed-0.072
 cheering-0.070
all-0.067
Token is
Token ID318
Feature activation-1.38e+00

Pos logit contributions
used+0.264
eyed+0.250
uts+0.249
 cheering+0.236
but+0.228
Neg logit contributions
 fort-0.269
 attic-0.266
 sheet-0.255
 garage-0.251
 closet-0.250
Token better
Token ID1365
Feature activation+3.24e+00

Pos logit contributions
 fort+0.039
 attic+0.039
 folder+0.038
 sheet+0.037
 garage+0.036
Neg logit contributions
used-0.054
eyed-0.047
uts-0.046
unch-0.045
but-0.044
Token than
Token ID621
Feature activation+9.62e-01

Pos logit contributions
used+0.122
unch+0.106
eyed+0.099
all+0.098
qu+0.097
Neg logit contributions
 fort-0.100
 folder-0.098
 attic-0.098
 proper-0.092
 journal-0.092
Token competing
Token ID11780
Feature activation+1.62e+01

Pos logit contributions
used+0.023
eyed+0.020
uts+0.019
all+0.018
unch+0.018
Neg logit contributions
 fort-0.041
 attic-0.040
 folder-0.039
 sheet-0.039
 garage-0.039
Token.
Token ID13
Feature activation+5.05e+00

Pos logit contributions
used+0.400
unch+0.326
eyed+0.311
but+0.285
all+0.281
Neg logit contributions
 folder-0.679
 attic-0.672
 fort-0.662
 sheet-0.654
 garage-0.652
Token They
Token ID1119
Feature activation-5.03e+00

Pos logit contributions
used+0.158
eyed+0.125
ting+0.123
but+0.119
pr+0.119
Neg logit contributions
 attic-0.192
 fort-0.181
 closet-0.180
 folder-0.180
 garage-0.180
Token learn
Token ID2193
Feature activation+5.87e+00

Pos logit contributions
 fort+0.170
 attic+0.168
 sheet+0.163
 folder+0.163
 garage+0.159
Neg logit contributions
used-0.163
eyed-0.146
uts-0.140
all-0.132
 cheering-0.132
Token that
Token ID326
Feature activation-3.09e+00

Pos logit contributions
used+0.198
unch+0.156
eyed+0.156
oned+0.145
uts+0.144
Neg logit contributions
 folder-0.215
 attic-0.213
 fort-0.209
 sheet-0.206
 journal-0.205
Token purple
Token ID14032
Feature activation-2.68e-01

Pos logit contributions
 fort+0.117
 attic+0.115
 folder+0.112
 sheet+0.112
 closet+0.110
Neg logit contributions
used-0.086
eyed-0.075
uts-0.075
 cheering-0.070
all-0.069
Token first
Token ID717
Feature activation-3.91e+00

Pos logit contributions
 fort+0.335
 attic+0.332
 folder+0.321
 sheet+0.317
 journal+0.309
Neg logit contributions
used-0.444
 cheering-0.404
eyed-0.396
 joining-0.379
uts-0.376
Token two
Token ID734
Feature activation-5.19e+00

Pos logit contributions
 attic+0.112
 fort+0.110
 folder+0.106
 garage+0.105
 closet+0.105
Neg logit contributions
used-0.145
eyed-0.128
uts-0.126
all-0.123
 cheering-0.122
Token friends
Token ID2460
Feature activation+3.87e+00

Pos logit contributions
 attic+0.196
 fort+0.188
 folder+0.186
 garage+0.184
 sheet+0.184
Neg logit contributions
used-0.148
uts-0.128
 cheering-0.127
eyed-0.124
 joining-0.120
Token started
Token ID2067
Feature activation+4.95e+00

Pos logit contributions
used+0.134
eyed+0.118
uts+0.117
unch+0.114
all+0.111
Neg logit contributions
 fort-0.120
 attic-0.119
 folder-0.117
 sheet-0.116
 garage-0.115
Token to
Token ID284
Feature activation-2.12e+00

Pos logit contributions
used+0.135
unch+0.115
uts+0.115
eyed+0.115
but+0.106
Neg logit contributions
 fort-0.187
 attic-0.182
 garage-0.180
 sheet-0.180
 folder-0.179
Token compete
Token ID9320
Feature activation+1.60e+01

Pos logit contributions
 fort+0.059
 attic+0.058
 garage+0.056
 sheet+0.056
 folder+0.055
Neg logit contributions
used-0.081
uts-0.074
eyed-0.073
unch-0.070
but-0.070
Token.
Token ID13
Feature activation+5.15e+00

Pos logit contributions
used+0.372
eyed+0.303
unch+0.302
uts+0.288
aring+0.269
Neg logit contributions
 attic-0.676
 fort-0.671
 folder-0.662
 sheet-0.655
 garage-0.652
Token
Token ID220
Feature activation+7.13e+00

Pos logit contributions
used+0.138
eyed+0.118
uts+0.114
unch+0.112
all+0.106
Neg logit contributions
 attic-0.200
 fort-0.197
 garage-0.196
 folder-0.195
 closet-0.194
Token
Token ID198
Feature activation+3.56e+00

Pos logit contributions
used+0.176
ting+0.146
eyed+0.130
 believing+0.124
 cheering+0.123
Neg logit contributions
 attic-0.292
 garage-0.290
 fort-0.282
 closet-0.274
 folder-0.271
TokenThey
Token ID2990
Feature activation-3.79e+00

Pos logit contributions
used+0.128
ting+0.105
icked+0.103
owing+0.097
uts+0.097
Neg logit contributions
 garage-0.102
 attic-0.099
 shelf-0.096
pron-0.095
 sheet-0.095
Token ran
Token ID4966
Feature activation+5.54e+00

Pos logit contributions
 fort+0.126
 attic+0.124
 folder+0.121
 sheet+0.120
 garage+0.118
Neg logit contributions
used-0.126
eyed-0.113
uts-0.108
all-0.103
but-0.103
Token a
Token ID257
Feature activation-7.96e+00

Pos logit contributions
 fort+0.091
 folder+0.089
 drawer+0.087
 closet+0.087
 attic+0.087
Neg logit contributions
used-0.060
eyed-0.053
unch-0.053
all-0.053
uts-0.051
Token boy
Token ID2933
Feature activation+1.73e-01

Pos logit contributions
 attic+0.189
 folder+0.181
 fort+0.179
 sheet+0.173
 closet+0.170
Neg logit contributions
used-0.292
unch-0.285
eyed-0.283
 cheering-0.279
-0.276
Token who
Token ID508
Feature activation-3.30e+00

Pos logit contributions
used+0.006
uts+0.006
overed+0.005
eyed+0.005
but+0.005
Neg logit contributions
 fort-0.005
 attic-0.005
 garage-0.005
 folder-0.005
 closet-0.005
Token loved
Token ID6151
Feature activation+4.29e+00

Pos logit contributions
 folder+0.121
 sheet+0.121
 fort+0.121
 attic+0.118
 puzzle+0.111
Neg logit contributions
eyed-0.093
used-0.088
-0.079
uts-0.072
 cheering-0.072
Token to
Token ID284
Feature activation-2.51e+00

Pos logit contributions
used+0.082
eyed+0.068
uts+0.066
 cheering+0.061
all+0.061
Neg logit contributions
 fort-0.197
 attic-0.195
 folder-0.192
 sheet-0.190
 closet-0.188
Token compete
Token ID9320
Feature activation+1.60e+01

Pos logit contributions
 fort+0.075
 attic+0.075
 folder+0.072
 sheet+0.071
 garage+0.070
Neg logit contributions
used-0.091
eyed-0.081
uts-0.079
unch-0.076
all-0.075
Token with
Token ID351
Feature activation+1.74e+00

Pos logit contributions
used+0.420
eyed+0.353
unch+0.352
uts+0.338
aring+0.333
Neg logit contributions
 attic-0.635
 folder-0.609
 journal-0.609
 garage-0.606
 fort-0.605
Token his
Token ID465
Feature activation-1.19e+01

Pos logit contributions
used+0.061
eyed+0.055
uts+0.053
 cheering+0.051
all+0.051
Neg logit contributions
 fort-0.054
 attic-0.053
 folder-0.052
 sheet-0.051
 closet-0.050
Token friends
Token ID2460
Feature activation+4.36e+00

Pos logit contributions
 fort+0.325
 attic+0.322
 folder+0.315
 sheet+0.313
 closet+0.305
Neg logit contributions
used-0.446
uts-0.395
eyed-0.392
 cheering-0.381
but-0.368
Token.
Token ID13
Feature activation+5.44e+00

Pos logit contributions
used+0.103
unch+0.093
uts+0.091
eyed+0.086
qu+0.080
Neg logit contributions
 attic-0.182
 folder-0.182
 fort-0.180
 garage-0.177
 drawer-0.176
Token He
Token ID679
Feature activation-6.78e+00

Pos logit contributions
used+0.164
eyed+0.142
uts+0.137
unch+0.129
but+0.126
Neg logit contributions
 fort-0.202
 attic-0.201
 garage-0.195
 folder-0.194
 sheet-0.194
Token competitive
Token ID7606
Feature activation+2.65e+00

Pos logit contributions
 attic+0.093
 fort+0.092
 folder+0.091
 sheet+0.091
 closet+0.090
Neg logit contributions
used-0.076
uts-0.068
eyed-0.067
unch-0.065
but-0.062
Token and
Token ID290
Feature activation-6.12e+00

Pos logit contributions
used+0.059
eyed+0.049
uts+0.046
unch+0.043
but+0.042
Neg logit contributions
 fort-0.117
 attic-0.116
 folder-0.114
 sheet-0.114
 garage-0.112
Token was
Token ID373
Feature activation-2.25e+00

Pos logit contributions
 fort+0.230
 attic+0.224
 sheet+0.221
 folder+0.219
 closet+0.213
Neg logit contributions
used-0.181
eyed-0.159
uts-0.154
all-0.147
 cheering-0.146
Token excited
Token ID6568
Feature activation+5.43e+00

Pos logit contributions
 attic+0.075
 fort+0.074
 folder+0.073
 sheet+0.073
 closet+0.072
Neg logit contributions
used-0.070
uts-0.062
eyed-0.061
unch-0.061
but-0.057
Token to
Token ID284
Feature activation-2.13e+00

Pos logit contributions
used+0.129
eyed+0.110
uts+0.100
unch+0.098
ucked+0.092
Neg logit contributions
 fort-0.233
 attic-0.229
 folder-0.226
 sheet-0.224
 closet-0.220
Token compete
Token ID9320
Feature activation+1.59e+01

Pos logit contributions
 garage+0.058
 attic+0.058
 fort+0.057
 folder+0.057
 sheet+0.056
Neg logit contributions
used-0.079
uts-0.074
unch-0.073
but-0.070
eyed-0.070
Token with
Token ID351
Feature activation+1.63e+00

Pos logit contributions
used+0.390
unch+0.328
eyed+0.326
uts+0.307
aring+0.290
Neg logit contributions
 attic-0.646
 folder-0.632
 fort-0.629
 garage-0.620
 sheet-0.620
Token her
Token ID607
Feature activation-1.09e+01

Pos logit contributions
used+0.052
eyed+0.046
uts+0.045
 cheering+0.043
all+0.043
Neg logit contributions
 fort-0.056
 attic-0.055
 folder-0.053
 sheet-0.053
 garage-0.052
Token friend
Token ID1545
Feature activation+3.70e+00

Pos logit contributions
 fort+0.300
 attic+0.297
 folder+0.287
 sheet+0.286
 garage+0.278
Neg logit contributions
used-0.418
eyed-0.375
uts-0.368
all-0.350
but-0.350
Token.
Token ID13
Feature activation+4.74e+00

Pos logit contributions
used+0.085
eyed+0.072
uts+0.069
unch+0.064
but+0.063
Neg logit contributions
 fort-0.159
 attic-0.157
 folder-0.155
 sheet-0.154
 garage-0.152
Token So
Token ID1406
Feature activation+1.77e+00

Pos logit contributions
used+0.142
eyed+0.125
uts+0.115
qu+0.112
unch+0.111
Neg logit contributions
 attic-0.171
 fort-0.168
 folder-0.165
 garage-0.165
 closet-0.164
Token not
Token ID407
Feature activation-1.35e+00

Pos logit contributions
 fort+0.085
 sheet+0.081
 attic+0.079
 garage+0.078
 folder+0.077
Neg logit contributions
used-0.063
eyed-0.060
 joining-0.054
all-0.054
 cheering-0.053
Token independent
Token ID4795
Feature activation+6.89e-01

Pos logit contributions
 fort+0.039
 attic+0.038
 sheet+0.037
 folder+0.037
 garage+0.036
Neg logit contributions
used-0.051
eyed-0.046
uts-0.045
all-0.043
 cheering-0.043
Token.
Token ID13
Feature activation+4.89e+00

Pos logit contributions
used+0.017
eyed+0.014
uts+0.013
unch+0.013
but+0.012
Neg logit contributions
 fort-0.029
 attic-0.029
 folder-0.028
 sheet-0.028
 garage-0.028
Token You
Token ID921
Feature activation-5.13e-01

Pos logit contributions
used+0.137
eyed+0.117
uts+0.112
unch+0.110
 cheering+0.108
Neg logit contributions
 attic-0.189
 folder-0.182
 fort-0.182
 sheet-0.179
 closet-0.179
Token are
Token ID389
Feature activation-2.16e+00

Pos logit contributions
 fort+0.018
 attic+0.018
 sheet+0.017
 folder+0.017
 closet+0.017
Neg logit contributions
used-0.016
eyed-0.014
uts-0.014
all-0.013
unch-0.013
Token dependent
Token ID10795
Feature activation+1.58e+01

Pos logit contributions
 fort+0.082
 sheet+0.077
 attic+0.076
 garage+0.075
 folder+0.074
Neg logit contributions
used-0.061
eyed-0.058
 joining-0.053
uts-0.053
all-0.053
Token on
Token ID319
Feature activation+3.51e-01

Pos logit contributions
used+0.397
eyed+0.330
unch+0.309
uts+0.305
 cheering+0.302
Neg logit contributions
 folder-0.641
 attic-0.640
 fort-0.628
 sheet-0.623
 garage-0.611
Token us
Token ID514
Feature activation-2.47e+00

Pos logit contributions
used+0.013
eyed+0.012
uts+0.012
 cheering+0.011
all+0.011
Neg logit contributions
 fort-0.010
 attic-0.010
 folder-0.009
 garage-0.009
 sheet-0.009
Token.
Token ID13
Feature activation+4.91e+00

Pos logit contributions
 fort+0.105
 attic+0.104
 folder+0.103
 sheet+0.101
 garage+0.100
Neg logit contributions
used-0.060
unch-0.047
eyed-0.047
uts-0.044
 cheering-0.043
Token We
Token ID775
Feature activation-1.59e+00

Pos logit contributions
used+0.141
eyed+0.119
uts+0.115
unch+0.113
 cheering+0.111
Neg logit contributions
 attic-0.186
 folder-0.178
 fort-0.177
 garage-0.175
 sheet-0.175
Token love
Token ID1842
Feature activation+6.15e+00

Pos logit contributions
 fort+0.045
 attic+0.044
 folder+0.044
 garage+0.043
 sheet+0.042
Neg logit contributions
used-0.061
uts-0.053
eyed-0.053
unch-0.051
but-0.051
Token competitive
Token ID7606
Feature activation+2.76e+00

Pos logit contributions
 fort+0.077
 attic+0.075
 sheet+0.074
 folder+0.073
 garage+0.071
Neg logit contributions
used-0.084
eyed-0.078
uts-0.073
unch-0.070
all-0.070
Token.
Token ID13
Feature activation+4.79e+00

Pos logit contributions
used+0.068
eyed+0.058
uts+0.056
all+0.052
unch+0.051
Neg logit contributions
 fort-0.115
 attic-0.114
 folder-0.112
 sheet-0.112
 garage-0.109
Token He
Token ID679
Feature activation-7.02e+00

Pos logit contributions
used+0.171
eyed+0.152
uts+0.145
qu+0.145
ting+0.143
Neg logit contributions
 attic-0.154
 garage-0.153
 closet-0.146
 shelf-0.146
 fort-0.145
Token wanted
Token ID2227
Feature activation+8.21e+00

Pos logit contributions
 fort+0.276
 attic+0.266
 folder+0.262
 sheet+0.261
 journal+0.258
Neg logit contributions
used-0.186
eyed-0.169
all-0.153
uts-0.151
't-0.149
Token to
Token ID284
Feature activation-2.20e+00

Pos logit contributions
used+0.157
eyed+0.128
uts+0.124
all+0.113
but+0.112
Neg logit contributions
 fort-0.386
 attic-0.380
 folder-0.375
 sheet-0.373
 closet-0.366
Token compete
Token ID9320
Feature activation+1.58e+01

Pos logit contributions
 attic+0.065
 fort+0.065
 folder+0.064
 garage+0.063
 sheet+0.063
Neg logit contributions
used-0.081
eyed-0.071
uts-0.070
unch-0.068
but-0.067
Token in
Token ID287
Feature activation-2.09e+00

Pos logit contributions
used+0.434
eyed+0.372
unch+0.370
uts+0.357
qu+0.347
Neg logit contributions
 attic-0.603
 folder-0.581
 fort-0.579
 garage-0.577
 journal-0.576
Token running
Token ID2491
Feature activation+8.31e+00

Pos logit contributions
 attic+0.064
 fort+0.064
 folder+0.062
 sheet+0.062
 garage+0.061
Neg logit contributions
used-0.074
eyed-0.067
uts-0.065
unch-0.062
but-0.061
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
used+0.200
eyed+0.171
uts+0.163
but+0.151
all+0.150
Neg logit contributions
 fort-0.350
 attic-0.347
 folder-0.340
 sheet-0.340
 garage-0.333
Token and
Token ID290
Feature activation-5.41e+00

Pos logit contributions
 fort+0.011
 attic+0.011
 folder+0.011
 sheet+0.011
 garage+0.010
Neg logit contributions
used-0.009
eyed-0.008
uts-0.008
unch-0.008
but-0.008
Token win
Token ID1592
Feature activation+9.03e+00

Pos logit contributions
 attic+0.178
 fort+0.177
 garage+0.177
 folder+0.173
 sheet+0.170
Neg logit contributions
used-0.169
uts-0.156
unch-0.155
eyed-0.152
ucked-0.142
Token had
Token ID550
Feature activation-7.87e-02

Pos logit contributions
 attic+0.259
 fort+0.258
 folder+0.254
 sheet+0.250
 garage+0.241
Neg logit contributions
used-0.232
eyed-0.207
uts-0.186
 cheering-0.181
all-0.179
Token been
Token ID587
Feature activation-1.13e+00

Pos logit contributions
 fort+0.003
 attic+0.003
 sheet+0.003
 folder+0.003
 closet+0.003
Neg logit contributions
used-0.002
eyed-0.002
uts-0.002
all-0.002
 cheering-0.002
Token too
Token ID1165
Feature activation+2.56e+00

Pos logit contributions
 fort+0.040
 attic+0.040
 sheet+0.039
 folder+0.038
 garage+0.038
Neg logit contributions
used-0.036
eyed-0.032
uts-0.029
 cheering-0.029
all-0.028
Token ready
Token ID3492
Feature activation+7.80e+00

Pos logit contributions
used+0.102
uts+0.091
eyed+0.090
ting+0.088
but+0.087
Neg logit contributions
 attic-0.064
 fort-0.061
 folder-0.060
 garage-0.059
 sheet-0.058
Token to
Token ID284
Feature activation-1.70e+00

Pos logit contributions
used+0.219
eyed+0.186
uts+0.178
unch+0.172
qu+0.170
Neg logit contributions
 fort-0.296
 attic-0.292
 sheet-0.290
 folder-0.289
 closet-0.285
Token joke
Token ID9707
Feature activation+1.57e+01

Pos logit contributions
 fort+0.048
 attic+0.047
 sheet+0.047
 folder+0.046
 garage+0.046
Neg logit contributions
used-0.064
uts-0.058
eyed-0.058
but-0.056
unch-0.055
Token around
Token ID1088
Feature activation+6.02e+00

Pos logit contributions
used+0.364
eyed+0.304
uts+0.280
unch+0.271
all+0.257
Neg logit contributions
 fort-0.685
 attic-0.674
 folder-0.670
 sheet-0.669
 journal-0.648
Token with
Token ID351
Feature activation+1.68e+00

Pos logit contributions
used+0.129
eyed+0.106
uts+0.101
all+0.092
but+0.092
Neg logit contributions
 fort-0.270
 attic-0.268
 folder-0.263
 sheet-0.262
 garage-0.258
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
used+0.051
eyed+0.044
uts+0.043
 cheering+0.042
all+0.041
Neg logit contributions
 fort-0.060
 attic-0.060
 folder-0.058
 sheet-0.057
 garage-0.057
Token pan
Token ID3425
Feature activation+6.26e+00

Pos logit contributions
 attic+0.275
 fort+0.269
 folder+0.251
 garage+0.250
 proper+0.247
Neg logit contributions
used-0.430
 cheering-0.414
eyed-0.383
 joining-0.382
ting-0.374
Token and
Token ID290
Feature activation-4.98e+00

Pos logit contributions
used+0.127
eyed+0.104
uts+0.099
but+0.089
unch+0.089
Neg logit contributions
 fort-0.286
 attic-0.285
 folder-0.279
 sheet-0.277
 garage-0.274
Token melodies
Token ID47077
Feature activation+8.79e+00

Pos logit contributions
 fort+0.093
 attic+0.093
 garage+0.090
 closet+0.089
 folder+0.088
Neg logit contributions
used-0.095
eyed-0.083
 cheering-0.083
ting-0.080
 joining-0.079
Token and
Token ID290
Feature activation-6.00e+00

Pos logit contributions
used+0.202
eyed+0.169
uts+0.160
unch+0.149
all+0.145
Neg logit contributions
 fort-0.381
 attic-0.378
 folder-0.372
 sheet-0.371
 garage-0.363
Token even
Token ID772
Feature activation-3.14e+00

Pos logit contributions
 fort+0.212
 attic+0.208
 folder+0.199
 sheet+0.199
 garage+0.197
Neg logit contributions
used-0.184
eyed-0.164
 cheering-0.160
uts-0.154
all-0.149
Token her
Token ID607
Feature activation-1.22e+01

Pos logit contributions
 fort+0.119
 attic+0.118
 garage+0.111
 sheet+0.110
 closet+0.109
Neg logit contributions
used-0.091
eyed-0.081
 cheering-0.079
ting-0.074
 joining-0.073
Token own
Token ID898
Feature activation-5.24e+00

Pos logit contributions
 fort+0.330
 attic+0.320
 garage+0.305
 folder+0.298
 sheet+0.297
Neg logit contributions
used-0.469
uts-0.423
eyed-0.418
 cheering-0.412
all-0.391
Token compositions
Token ID33543
Feature activation+1.57e+01

Pos logit contributions
 fort+0.136
 attic+0.134
 folder+0.130
 sheet+0.129
 garage+0.126
Neg logit contributions
used-0.211
eyed-0.192
uts-0.187
unch-0.180
but-0.179
Token.
Token ID13
Feature activation+5.56e+00

Pos logit contributions
used+0.362
eyed+0.289
unch+0.269
uts+0.248
 cheering+0.244
Neg logit contributions
 attic-0.685
 fort-0.685
 folder-0.683
 sheet-0.674
 closet-0.662
Token Emma
Token ID18966
Feature activation-5.24e+00

Pos logit contributions
used+0.168
eyed+0.147
uts+0.139
unch+0.136
 cheering+0.130
Neg logit contributions
 attic-0.207
 folder-0.202
 fort-0.199
 garage-0.195
 closet-0.195
Token was
Token ID373
Feature activation-2.27e+00

Pos logit contributions
 fort+0.186
 attic+0.186
 folder+0.180
 garage+0.176
 sheet+0.175
Neg logit contributions
used-0.162
eyed-0.144
uts-0.137
all-0.132
 cheering-0.132
Token so
Token ID523
Feature activation-1.45e+00

Pos logit contributions
 fort+0.089
 attic+0.088
 folder+0.086
 sheet+0.086
 garage+0.085
Neg logit contributions
used-0.062
eyed-0.053
uts-0.051
all-0.048
but-0.047
Token proud
Token ID6613
Feature activation+5.07e+00

Pos logit contributions
 fort+0.052
 attic+0.052
 folder+0.051
 sheet+0.050
 garage+0.049
Neg logit contributions
used-0.044
eyed-0.039
uts-0.037
unch-0.035
but-0.035
Token competitive
Token ID7606
Feature activation+2.55e+00

Pos logit contributions
 fort+0.085
 attic+0.084
 sheet+0.083
 folder+0.082
 garage+0.080
Neg logit contributions
used-0.090
eyed-0.082
uts-0.077
unch-0.075
all-0.073
Token.
Token ID13
Feature activation+5.15e+00

Pos logit contributions
used+0.058
eyed+0.048
uts+0.042
but+0.041
unch+0.041
Neg logit contributions
 attic-0.116
 fort-0.116
 folder-0.113
 sheet-0.113
 closet-0.111
Token He
Token ID679
Feature activation-6.93e+00

Pos logit contributions
used+0.121
uts+0.102
eyed+0.100
unch+0.094
all+0.093
Neg logit contributions
 fort-0.217
 folder-0.213
 attic-0.212
 sheet-0.212
 journal-0.204
Token loved
Token ID6151
Feature activation+4.35e+00

Pos logit contributions
 fort+0.247
 folder+0.241
 attic+0.239
 sheet+0.238
 closet+0.231
Neg logit contributions
used-0.199
eyed-0.192
uts-0.181
all-0.175
't-0.174
Token to
Token ID284
Feature activation-2.70e+00

Pos logit contributions
used+0.086
eyed+0.071
uts+0.068
all+0.063
 cheering+0.062
Neg logit contributions
 fort-0.199
 attic-0.198
 folder-0.195
 sheet-0.194
 closet-0.191
Token compete
Token ID9320
Feature activation+1.57e+01

Pos logit contributions
 fort+0.082
 attic+0.082
 folder+0.080
 sheet+0.078
 closet+0.077
Neg logit contributions
used-0.094
eyed-0.085
uts-0.082
 cheering-0.079
all-0.078
Token with
Token ID351
Feature activation+1.51e+00

Pos logit contributions
used+0.411
eyed+0.345
unch+0.343
uts+0.330
qu+0.319
Neg logit contributions
 attic-0.621
 folder-0.596
 fort-0.596
 garage-0.591
 journal-0.590
Token his
Token ID465
Feature activation-1.21e+01

Pos logit contributions
used+0.053
eyed+0.048
uts+0.047
all+0.045
 cheering+0.044
Neg logit contributions
 fort-0.047
 attic-0.046
 folder-0.045
 sheet-0.045
 garage-0.044
Token friends
Token ID2460
Feature activation+4.23e+00

Pos logit contributions
 attic+0.402
 fort+0.398
 folder+0.396
 sheet+0.390
 closet+0.384
Neg logit contributions
used-0.380
uts-0.330
eyed-0.328
 cheering-0.309
all-0.307
Token and
Token ID290
Feature activation-5.44e+00

Pos logit contributions
used+0.112
unch+0.097
uts+0.096
eyed+0.094
qu+0.086
Neg logit contributions
 attic-0.167
 folder-0.165
 fort-0.165
 sheet-0.161
 garage-0.160
Token was
Token ID373
Feature activation-2.46e+00

Pos logit contributions
 fort+0.178
 attic+0.177
 folder+0.172
 sheet+0.172
 closet+0.167
Neg logit contributions
used-0.181
eyed-0.163
uts-0.157
 cheering-0.153
but-0.149
Token girl
Token ID2576
Feature activation-2.13e+00

Pos logit contributions
 proper+0.009
 attic+0.008
pron+0.008
 sheet+0.008
 journal+0.008
Neg logit contributions
 cheering-0.013
used-0.012
.-0.012
 joining-0.012
aring-0.012
Token and
Token ID290
Feature activation-6.59e+00

Pos logit contributions
 fort+0.073
 folder+0.072
 attic+0.072
 sheet+0.070
 closet+0.069
Neg logit contributions
used-0.070
eyed-0.058
uts-0.057
unch-0.057
ucked-0.055
Token she
Token ID673
Feature activation-7.57e+00

Pos logit contributions
 fort+0.257
 attic+0.240
 sheet+0.238
 garage+0.236
 folder+0.232
Neg logit contributions
used-0.183
eyed-0.159
uts-0.157
 cheering-0.157
all-0.155
Token loved
Token ID6151
Feature activation+4.04e+00

Pos logit contributions
 fort+0.223
 attic+0.220
 sheet+0.214
 folder+0.214
 garage+0.207
Neg logit contributions
used-0.278
eyed-0.255
uts-0.247
all-0.235
but-0.235
Token to
Token ID284
Feature activation-2.80e+00

Pos logit contributions
used+0.107
eyed+0.091
uts+0.084
unch+0.081
but+0.078
Neg logit contributions
 fort-0.166
 attic-0.165
 folder-0.161
 sheet-0.160
 garage-0.157
Token compete
Token ID9320
Feature activation+1.57e+01

Pos logit contributions
 fort+0.084
 attic+0.083
 folder+0.080
 sheet+0.080
 garage+0.078
Neg logit contributions
used-0.103
eyed-0.092
uts-0.090
unch-0.086
but-0.086
Token with
Token ID351
Feature activation+1.14e+00

Pos logit contributions
used+0.401
unch+0.343
eyed+0.339
uts+0.322
aring+0.307
Neg logit contributions
 attic-0.625
 folder-0.610
 fort-0.597
 journal-0.594
 sheet-0.586
Token Jack
Token ID3619
Feature activation-3.83e+00

Pos logit contributions
used+0.031
 cheering+0.028
uts+0.028
all+0.027
ting+0.027
Neg logit contributions
 fort-0.041
 attic-0.039
 garage-0.039
 closet-0.038
 folder-0.037
Token in
Token ID287
Feature activation-3.71e+00

Pos logit contributions
 fort+0.183
 attic+0.182
 folder+0.178
 sheet+0.178
 garage+0.176
Neg logit contributions
used-0.070
eyed-0.056
uts-0.053
but-0.047
all-0.047
Token the
Token ID262
Feature activation-1.14e+01

Pos logit contributions
 fort+0.125
 attic+0.121
 folder+0.119
 sheet+0.119
 closet+0.116
Neg logit contributions
used-0.121
eyed-0.107
uts-0.102
 cheering-0.100
all-0.099
Token mornings
Token ID31143
Feature activation+7.15e+00

Pos logit contributions
 sheet+0.315
 fort+0.311
kin+0.305
 folder+0.299
pron+0.298
Neg logit contributions
 cheering-0.355
used-0.349
eyed-0.332
 joining-0.320
ting-0.319
Token.
Token ID13
Feature activation+4.23e+00

Pos logit contributions
used+0.107
unch+0.089
qu+0.089
all+0.086
aro+0.086
Neg logit contributions
 folder-0.148
 drawer-0.145
 sheet-0.142
 journal-0.142
 closet-0.138
Token "
Token ID366
Feature activation+2.30e+00

Pos logit contributions
used+0.143
eyed+0.126
qu+0.119
aro+0.119
pr+0.115
Neg logit contributions
 folder-0.139
 attic-0.136
 closet-0.136
 drawer-0.134
 garage-0.132
TokenI
Token ID40
Feature activation-1.18e+00

Pos logit contributions
used+0.062
eyed+0.052
uts+0.050
unch+0.047
 cheering+0.047
Neg logit contributions
 attic-0.090
 fort-0.089
 folder-0.088
 sheet-0.087
 garage-0.087
Token can
Token ID460
Feature activation+6.15e+00

Pos logit contributions
 fort+0.042
 attic+0.041
 folder+0.040
 sheet+0.040
 garage+0.039
Neg logit contributions
used-0.037
eyed-0.033
uts-0.032
unch-0.030
all-0.030
Token't
Token ID470
Feature activation-1.23e+00

Pos logit contributions
used+0.142
eyed+0.119
uts+0.114
unch+0.112
ting+0.111
Neg logit contributions
 fort-0.254
 garage-0.254
 attic-0.252
 sheet-0.252
 folder-0.248
Token compete
Token ID9320
Feature activation+1.56e+01

Pos logit contributions
 fort+0.037
 attic+0.037
 folder+0.036
 sheet+0.036
 garage+0.036
Neg logit contributions
used-0.043
eyed-0.038
uts-0.038
unch-0.036
but-0.036
Token with
Token ID351
Feature activation+1.64e+00

Pos logit contributions
used+0.406
eyed+0.349
unch+0.342
uts+0.318
ucked+0.306
Neg logit contributions
 attic-0.612
 folder-0.600
 fort-0.598
 sheet-0.582
 garage-0.581
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
used+0.054
uts+0.045
eyed+0.045
ting+0.044
all+0.043
Neg logit contributions
 fort-0.056
 attic-0.053
 folder-0.053
 garage-0.052
 closet-0.052
Token other
Token ID584
Feature activation-5.12e+00

Pos logit contributions
 attic+0.239
 fort+0.238
 closet+0.232
 proper+0.227
 folder+0.222
Neg logit contributions
used-0.441
 cheering-0.426
 joining-0.409
ting-0.393
't-0.388
Token kids
Token ID3988
Feature activation+3.77e+00

Pos logit contributions
 attic+0.125
 fort+0.122
 folder+0.120
 closet+0.118
 sheet+0.117
Neg logit contributions
used-0.205
uts-0.178
 cheering-0.177
eyed-0.176
all-0.173
Token."
Token ID526
Feature activation+7.40e+00

Pos logit contributions
used+0.088
eyed+0.075
uts+0.071
unch+0.068
all+0.065
Neg logit contributions
 fort-0.160
 attic-0.159
 folder-0.157
 sheet-0.154
 garage-0.152
Token brightest
Token ID33871
Feature activation-1.79e+00

Pos logit contributions
 fort+0.302
 folder+0.281
 garage+0.278
 journal+0.275
 closet+0.269
Neg logit contributions
used-0.364
ting-0.357
eyed-0.333
all-0.333
 cheering-0.329
Token.
Token ID13
Feature activation+5.12e+00

Pos logit contributions
 fort+0.061
 attic+0.060
 folder+0.059
 sheet+0.059
 garage+0.058
Neg logit contributions
used-0.057
eyed-0.051
uts-0.049
unch-0.047
but-0.046
Token She
Token ID1375
Feature activation-7.64e+00

Pos logit contributions
used+0.116
eyed+0.096
uts+0.095
unch+0.088
all+0.087
Neg logit contributions
 fort-0.221
 folder-0.214
 sheet-0.213
 attic-0.213
 journal-0.208
Token loved
Token ID6151
Feature activation+4.58e+00

Pos logit contributions
 fort+0.287
 attic+0.278
 folder+0.274
 garage+0.270
 sheet+0.264
Neg logit contributions
used-0.204
eyed-0.195
all-0.173
icked-0.170
:-0.170
Token to
Token ID284
Feature activation-2.56e+00

Pos logit contributions
used+0.103
eyed+0.086
uts+0.083
all+0.077
 cheering+0.076
Neg logit contributions
 fort-0.200
 attic-0.198
 folder-0.195
 sheet-0.193
 garage-0.191
Token compete
Token ID9320
Feature activation+1.56e+01

Pos logit contributions
 fort+0.074
 attic+0.073
 folder+0.071
 sheet+0.071
 garage+0.069
Neg logit contributions
used-0.096
eyed-0.087
uts-0.084
unch-0.081
but-0.080
Token with
Token ID351
Feature activation+1.40e+00

Pos logit contributions
used+0.385
eyed+0.326
uts+0.311
unch+0.311
aring+0.290
Neg logit contributions
 attic-0.649
 folder-0.636
 fort-0.631
 sheet-0.624
 journal-0.621
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
used+0.047
eyed+0.041
uts+0.041
 cheering+0.039
all+0.039
Neg logit contributions
 fort-0.046
 attic-0.044
 folder-0.043
 garage-0.043
 sheet-0.042
Token other
Token ID584
Feature activation-5.07e+00

Pos logit contributions
 fort+0.287
 journal+0.271
 attic+0.262
 folder+0.261
 garage+0.258
Neg logit contributions
used-0.419
 cheering-0.380
eyed-0.376
ting-0.375
uts-0.375
Token animals
Token ID4695
Feature activation+1.90e+00

Pos logit contributions
 fort+0.126
 attic+0.122
 folder+0.120
 journal+0.120
 sheet+0.118
Neg logit contributions
used-0.206
eyed-0.184
uts-0.181
ting-0.178
all-0.174
Token.
Token ID13
Feature activation+4.94e+00

Pos logit contributions
used+0.052
eyed+0.045
uts+0.044
unch+0.043
all+0.041
Neg logit contributions
 fort-0.072
 attic-0.072
 folder-0.071
 sheet-0.070
 garage-0.068
Token It
Token ID632
Feature activation-2.58e+00

Pos logit contributions
used+0.156
eyed+0.141
uts+0.132
unch+0.127
 cheering+0.126
Neg logit contributions
 attic-0.157
 garage-0.150
 fort-0.149
 folder-0.148
 sheet-0.148
Token was
Token ID373
Feature activation-2.76e+00

Pos logit contributions
 fort+0.085
 attic+0.085
 folder+0.083
 sheet+0.082
 garage+0.080
Neg logit contributions
used-0.086
eyed-0.076
uts-0.074
but-0.070
all-0.070
Token so
Token ID523
Feature activation-1.51e+00

Pos logit contributions
 fort+0.109
 attic+0.108
 folder+0.107
 sheet+0.105
 journal+0.104
Neg logit contributions
used-0.073
eyed-0.063
uts-0.060
 cheering-0.059
all-0.059
Token ready
Token ID3492
Feature activation+6.17e+00

Pos logit contributions
 fort+0.042
 attic+0.042
 folder+0.040
 sheet+0.039
 garage+0.039
Neg logit contributions
used-0.059
eyed-0.052
uts-0.050
ting-0.049
but-0.049
Token to
Token ID284
Feature activation-2.29e+00

Pos logit contributions
used+0.138
eyed+0.116
uts+0.110
unch+0.101
but+0.100
Neg logit contributions
 fort-0.272
 attic-0.269
 folder-0.264
 sheet-0.264
 garage-0.259
Token compete
Token ID9320
Feature activation+1.56e+01

Pos logit contributions
 fort+0.061
 attic+0.060
 sheet+0.058
 folder+0.058
 garage+0.058
Neg logit contributions
used-0.091
eyed-0.082
uts-0.081
unch-0.078
but-0.078
Token.
Token ID13
Feature activation+5.14e+00

Pos logit contributions
used+0.375
eyed+0.319
unch+0.302
uts+0.293
aring+0.271
Neg logit contributions
 attic-0.644
 fort-0.638
 folder-0.628
 sheet-0.624
 garage-0.616
Token
Token ID198
Feature activation+3.97e+00

Pos logit contributions
used+0.140
eyed+0.121
uts+0.113
 cheering+0.108
ting+0.107
Neg logit contributions
 attic-0.200
 garage-0.193
 sheet-0.189
 fort-0.188
 folder-0.188
Token
Token ID198
Feature activation+3.48e+00

Pos logit contributions
used+0.133
ting+0.110
icked+0.096
beh+0.096
ucked+0.093
Neg logit contributions
 attic-0.136
 garage-0.133
 sheet-0.131
 fort-0.130
 shelf-0.128
TokenAt
Token ID2953
Feature activation+5.75e-01

Pos logit contributions
used+0.114
uts+0.091
eyed+0.091
icked+0.090
ting+0.088
Neg logit contributions
 attic-0.119
 fort-0.115
 garage-0.114
 sheet-0.113
 folder-0.113
Token the
Token ID262
Feature activation-1.23e+01

Pos logit contributions
used+0.020
eyed+0.017
uts+0.017
unch+0.016
but+0.016
Neg logit contributions
 fort-0.019
 attic-0.019
 folder-0.018
 sheet-0.018
 garage-0.018
TokenIt
Token ID1026
Feature activation+2.37e-01

Pos logit contributions
used+0.058
eyed+0.048
uts+0.046
 cheering+0.045
all+0.044
Neg logit contributions
 attic-0.075
 fort-0.075
 sheet-0.074
 folder-0.073
 garage-0.072
Token makes
Token ID1838
Feature activation+4.48e+00

Pos logit contributions
used+0.008
uts+0.007
eyed+0.007
all+0.006
't+0.006
Neg logit contributions
 attic-0.008
 folder-0.008
 fort-0.008
 closet-0.008
 garage-0.008
Token me
Token ID502
Feature activation-2.25e+00

Pos logit contributions
used+0.144
uts+0.128
eyed+0.128
all+0.124
unch+0.121
Neg logit contributions
 attic-0.149
 fort-0.148
 garage-0.142
 folder-0.141
 closet-0.141
Token want
Token ID765
Feature activation+9.38e+00

Pos logit contributions
 attic+0.070
 fort+0.068
 folder+0.065
 sheet+0.065
 garage+0.064
Neg logit contributions
used-0.081
eyed-0.074
uts-0.072
unch-0.069
 cheering-0.068
Token to
Token ID284
Feature activation-1.57e+00

Pos logit contributions
used+0.177
uts+0.157
eyed+0.154
all+0.147
 cheering+0.138
Neg logit contributions
 attic-0.417
 fort-0.414
 garage-0.404
 closet-0.399
 folder-0.395
Token spit
Token ID27591
Feature activation+1.56e+01

Pos logit contributions
 fort+0.049
 attic+0.049
 folder+0.047
 sheet+0.047
 garage+0.046
Neg logit contributions
used-0.055
eyed-0.049
uts-0.047
unch-0.045
all-0.045
Token."
Token ID526
Feature activation+6.87e+00

Pos logit contributions
used+0.419
eyed+0.362
uts+0.349
all+0.327
but+0.324
Neg logit contributions
 attic-0.608
 fort-0.608
 folder-0.585
 garage-0.583
 sheet-0.581
Token
Token ID198
Feature activation+2.96e+00

Pos logit contributions
used+0.146
eyed+0.111
ting+0.102
uts+0.097
unch+0.096
Neg logit contributions
 fort-0.314
 attic-0.314
 garage-0.309
 sheet-0.308
 closet-0.308
Token
Token ID198
Feature activation+2.55e+00

Pos logit contributions
used+0.088
ting+0.070
ucked+0.060
eyed+0.059
overed+0.059
Neg logit contributions
 fort-0.116
 attic-0.116
 garage-0.115
 sheet-0.114
 closet-0.110
Token"
Token ID1
Feature activation+1.80e+00

Pos logit contributions
used+0.073
eyed+0.062
uts+0.060
unch+0.057
all+0.056
Neg logit contributions
 fort-0.098
 attic-0.097
 folder-0.095
 sheet-0.095
 garage-0.093
TokenMe
Token ID5308
Feature activation-1.36e+00

Pos logit contributions
used+0.053
eyed+0.046
uts+0.044
unch+0.042
all+0.041
Neg logit contributions
 fort-0.066
 attic-0.066
 folder-0.064
 sheet-0.064
 garage-0.063
Token mess
Token ID2085
Feature activation+9.79e+00

Pos logit contributions
 attic+0.218
 fort+0.211
 proper+0.197
iper+0.194
 folders+0.191
Neg logit contributions
 cheering-0.331
used-0.320
 joining-0.297
 deciding-0.293
-0.285
Token.
Token ID13
Feature activation+3.65e+00

Pos logit contributions
used+0.188
eyed+0.152
uts+0.145
all+0.129
but+0.129
Neg logit contributions
 fort-0.459
 attic-0.456
 folder-0.447
 sheet-0.445
 garage-0.440
Token
Token ID198
Feature activation+2.02e+00

Pos logit contributions
used+0.113
eyed+0.101
uts+0.095
ting+0.093
unch+0.092
Neg logit contributions
 attic-0.128
 garage-0.123
 closet-0.123
 folder-0.123
 fort-0.123
Token
Token ID198
Feature activation+1.80e+00

Pos logit contributions
used+0.051
ting+0.042
uts+0.038
overed+0.035
but+0.034
Neg logit contributions
 attic-0.089
 garage-0.086
 fort-0.085
 closet-0.084
 sheet-0.083
TokenBut
Token ID1537
Feature activation-1.43e-02

Pos logit contributions
used+0.053
uts+0.045
eyed+0.044
ting+0.042
unch+0.042
Neg logit contributions
 attic-0.066
 fort-0.065
 folder-0.064
 garage-0.063
 sheet-0.063
Token their
Token ID511
Feature activation-1.52e+01

Pos logit contributions
 fort+0.001
 attic+0.001
 folder+0.001
 sheet+0.001
 proper+0.001
Neg logit contributions
used-0.000
eyed-0.000
all-0.000
 cheering-0.000
ting-0.000
Token mom
Token ID1995
Feature activation-9.77e-01

Pos logit contributions
pron+0.430
 attic+0.428
 folder+0.411
 fort+0.408
 sheet+0.405
Neg logit contributions
used-0.441
 cheering-0.407
eyed-0.401
uts-0.375
:-0.368
Token said
Token ID531
Feature activation+4.39e+00

Pos logit contributions
 fort+0.033
 attic+0.033
 folder+0.032
 sheet+0.032
 garage+0.031
Neg logit contributions
used-0.032
eyed-0.028
uts-0.028
unch-0.026
all-0.026
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
used+0.109
eyed+0.091
uts+0.091
 cheering+0.084
but+0.083
Neg logit contributions
 fort-0.184
 attic-0.183
 folder-0.176
 sheet-0.176
 garage-0.175
Token "
Token ID366
Feature activation+2.29e+00

Pos logit contributions
 fort+0.078
 attic+0.077
 sheet+0.077
 folder+0.077
 closet+0.075
Neg logit contributions
used-0.051
eyed-0.044
uts-0.040
unch-0.039
all-0.038
TokenNo
Token ID2949
Feature activation-7.89e-01

Pos logit contributions
used+0.055
eyed+0.049
but+0.045
uts+0.045
all+0.044
Neg logit contributions
 fort-0.101
 attic-0.098
 folder-0.096
 sheet-0.096
 closet-0.095
Token,
Token ID11
Feature activation-2.58e-01

Pos logit contributions
 fort+0.103
 sheet+0.100
 attic+0.099
 folder+0.098
 closet+0.096
Neg logit contributions
used-0.075
eyed-0.068
uts-0.065
but-0.062
ucked-0.060
Token Lily
Token ID20037
Feature activation-5.67e+00

Pos logit contributions
pron+0.009
 journal+0.009
 attic+0.009
 proper+0.009
 puzzle+0.009
Neg logit contributions
used-0.007
 cheering-0.006
 joining-0.006
-0.006
ting-0.006
Token and
Token ID290
Feature activation-6.86e+00

Pos logit contributions
 fort+0.226
 attic+0.219
 sheet+0.217
 folder+0.214
 garage+0.210
Neg logit contributions
used-0.155
eyed-0.134
uts-0.128
but-0.121
unch-0.120
Token Sam
Token ID3409
Feature activation-5.20e+00

Pos logit contributions
 attic+0.245
 fort+0.230
pron+0.230
 sheet+0.227
 journal+0.227
Neg logit contributions
used-0.206
eyed-0.171
uts-0.169
ting-0.169
all-0.164
Token!"
Token ID2474
Feature activation+5.72e+00

Pos logit contributions
 fort+0.233
 attic+0.226
 sheet+0.224
 folder+0.222
 garage+0.219
Neg logit contributions
used-0.114
eyed-0.097
uts-0.093
but-0.086
unch-0.084
Token their
Token ID511
Feature activation-1.47e+01

Pos logit contributions
used+0.146
 cheering+0.144
+0.142
uts+0.134
eyed+0.133
Neg logit contributions
 attic-0.207
 fort-0.205
 drawer-0.197
pron-0.196
 folder-0.194
Token mom
Token ID1995
Feature activation-1.52e+00

Pos logit contributions
 attic+0.392
 fort+0.386
 folder+0.383
 sheet+0.372
 garage+0.360
Neg logit contributions
used-0.555
eyed-0.516
uts-0.500
 cheering-0.487
but-0.469
Token calls
Token ID3848
Feature activation+7.20e+00

Pos logit contributions
 closet+0.037
 fort+0.036
 sheet+0.034
 paper+0.032
 trash+0.032
Neg logit contributions
used-0.064
uts-0.061
unch-0.059
but-0.059
eyed-0.059
Token.
Token ID13
Feature activation+4.61e+00

Pos logit contributions
used+0.178
uts+0.142
unch+0.140
eyed+0.139
all+0.132
Neg logit contributions
 fort-0.315
 sheet-0.308
 folder-0.306
 closet-0.301
 attic-0.300
Token "
Token ID366
Feature activation+2.16e+00

Pos logit contributions
used+0.152
unch+0.116
all+0.115
eyed+0.115
shadow+0.111
Neg logit contributions
 closet-0.176
 attic-0.176
 folder-0.176
 fort-0.173
 sheet-0.172
TokenYou
Token ID1639
Feature activation-5.64e-01

Pos logit contributions
used+0.059
uts+0.049
eyed+0.047
unch+0.045
all+0.044
Neg logit contributions
 attic-0.084
 fort-0.083
 sheet-0.081
 garage-0.081
 folder-0.080
Tokenira
Token ID8704
Feature activation-7.78e+00

Pos logit contributions
 fort+0.163
 attic+0.161
 folder+0.158
 sheet+0.157
 garage+0.155
Neg logit contributions
used-0.122
eyed-0.106
uts-0.102
but-0.096
unch-0.095
Token and
Token ID290
Feature activation-7.75e+00

Pos logit contributions
 fort+0.263
 attic+0.259
 sheet+0.256
 folder+0.255
 garage+0.248
Neg logit contributions
used-0.250
eyed-0.220
uts-0.215
unch-0.204
but-0.201
Token R
Token ID371
Feature activation-3.90e+00

Pos logit contributions
 fort+0.293
 attic+0.285
 folder+0.276
 proper+0.269
 sheet+0.267
Neg logit contributions
used-0.245
eyed-0.198
 cheering-0.193
ting-0.193
uts-0.189
Tokenavi
Token ID15820
Feature activation-5.71e+00

Pos logit contributions
 fort+0.116
 attic+0.114
 sheet+0.111
 folder+0.109
 garage+0.108
Neg logit contributions
used-0.140
eyed-0.125
uts-0.122
but-0.119
 cheering-0.118
Token and
Token ID290
Feature activation-8.47e+00

Pos logit contributions
 sheet+0.153
 fort+0.151
 folder+0.147
 attic+0.147
 garage+0.147
Neg logit contributions
used-0.207
uts-0.194
eyed-0.183
but-0.180
unch-0.179
Token their
Token ID511
Feature activation-1.47e+01

Pos logit contributions
 fort+0.327
 attic+0.325
 folder+0.316
 sheet+0.313
 garage+0.309
Neg logit contributions
used-0.241
eyed-0.208
uts-0.196
 cheering-0.188
all-0.186
Token friends
Token ID2460
Feature activation+3.09e+00

Pos logit contributions
 attic+0.456
 folder+0.452
 sheet+0.439
 fort+0.434
 drawer+0.426
Neg logit contributions
used-0.473
 cheering-0.439
eyed-0.435
ting-0.411
 joining-0.393
Token ran
Token ID4966
Feature activation+5.21e+00

Pos logit contributions
used+0.113
uts+0.104
eyed+0.099
but+0.097
unch+0.096
Neg logit contributions
 sheet-0.087
 folder-0.086
 fort-0.085
 garage-0.085
 attic-0.085
Token to
Token ID284
Feature activation-2.97e+00

Pos logit contributions
used+0.156
unch+0.137
uts+0.134
qu+0.123
icked+0.115
Neg logit contributions
 fort-0.196
 folder-0.188
 attic-0.188
 garage-0.184
 sheet-0.184
Token the
Token ID262
Feature activation-1.19e+01

Pos logit contributions
 attic+0.072
 garage+0.071
 sheet+0.071
 fort+0.071
 closet+0.069
Neg logit contributions
used-0.125
uts-0.121
eyed-0.112
aring-0.112
unch-0.112
Token policeman
Token ID32166
Feature activation+4.77e+00

Pos logit contributions
 fort+0.257
 attic+0.255
 folder+0.252
 sheet+0.248
 journal+0.235
Neg logit contributions
used-0.524
eyed-0.483
uts-0.465
 cheering-0.454
but-0.451
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
 attic+0.155
 closet+0.152
 sheet+0.152
 fort+0.150
 folder+0.148
Neg logit contributions
used-0.129
eyed-0.123
uts-0.123
but-0.111
ucked-0.110
Token Ben
Token ID3932
Feature activation-4.68e+00

Pos logit contributions
 attic+0.051
 fort+0.049
 proper+0.048
 garage+0.048
 journal+0.047
Neg logit contributions
 cheering-0.043
used-0.043
 joining-0.042
all-0.042
ting-0.039
Token and
Token ID290
Feature activation-7.59e+00

Pos logit contributions
 fort+0.177
 attic+0.176
 folder+0.172
 sheet+0.171
 garage+0.167
Neg logit contributions
used-0.131
eyed-0.115
uts-0.111
unch-0.104
all-0.104
Token Lily
Token ID20037
Feature activation-6.53e+00

Pos logit contributions
 attic+0.240
 fort+0.227
 garage+0.220
 sheet+0.213
 journal+0.210
Neg logit contributions
used-0.239
all-0.222
 joining-0.220
 cheering-0.218
smart-0.215
Token?"
Token ID1701
Feature activation+5.50e+00

Pos logit contributions
 fort+0.234
 attic+0.232
 folder+0.227
 sheet+0.226
 garage+0.222
Neg logit contributions
used-0.199
eyed-0.174
uts-0.169
unch-0.159
but-0.158
Token their
Token ID511
Feature activation-1.47e+01

Pos logit contributions
used+0.216
eyed+0.195
uts+0.191
unch+0.183
all+0.182
Neg logit contributions
 fort-0.150
 attic-0.149
 folder-0.145
 sheet-0.144
 garage-0.140
Token mom
Token ID1995
Feature activation-1.59e+00

Pos logit contributions
 fort+0.439
 attic+0.433
 folder+0.430
 puzzle+0.411
 sheet+0.409
Neg logit contributions
used-0.466
eyed-0.443
 cheering-0.442
uts-0.427
 joining-0.387
Token asked
Token ID1965
Feature activation+5.44e+00

Pos logit contributions
 closet+0.049
 sheet+0.048
 drawer+0.047
 pen+0.046
 scarf+0.046
Neg logit contributions
used-0.058
overed-0.050
seek-0.050
wards-0.048
ucked-0.047
Token.
Token ID13
Feature activation+4.36e+00

Pos logit contributions
used+0.149
uts+0.123
unch+0.121
all+0.121
eyed+0.116
Neg logit contributions
 folder-0.205
 fort-0.205
 closet-0.203
 sheet-0.202
 drawer-0.200
Token "
Token ID366
Feature activation+2.10e+00

Pos logit contributions
used+0.167
qu+0.137
all+0.134
eyed+0.133
beh+0.131
Neg logit contributions
 closet-0.142
 folder-0.137
 attic-0.137
 garage-0.135
 drawer-0.135
TokenWe
Token ID1135
Feature activation-1.58e+00

Pos logit contributions
used+0.055
eyed+0.044
uts+0.044
unch+0.042
 cheering+0.040
Neg logit contributions
 attic-0.084
 fort-0.083
 folder-0.082
 garage-0.081
 sheet-0.081
Token,
Token ID11
Feature activation-1.71e+00

Pos logit contributions
used+0.159
eyed+0.140
uts+0.130
but+0.123
unch+0.123
Neg logit contributions
 fort-0.235
 attic-0.230
 sheet-0.225
 folder-0.225
 garage-0.220
Token Tim
Token ID5045
Feature activation-6.25e+00

Pos logit contributions
 proper+0.055
 folder+0.054
 attic+0.053
 drawer+0.052
 journal+0.052
Neg logit contributions
used-0.050
 cheering-0.047
unch-0.047
uts-0.047
't-0.046
Token and
Token ID290
Feature activation-7.49e+00

Pos logit contributions
 fort+0.276
 attic+0.274
 folder+0.269
 sheet+0.268
 garage+0.264
Neg logit contributions
used-0.139
eyed-0.116
uts-0.110
unch-0.101
but-0.100
Token Mia
Token ID32189
Feature activation-6.61e+00

Pos logit contributions
 attic+0.267
 folder+0.259
 garage+0.257
 fort+0.256
 closet+0.253
Neg logit contributions
used-0.228
eyed-0.193
uts-0.188
all-0.186
but-0.182
Token,"
Token ID553
Feature activation+4.95e+00

Pos logit contributions
 fort+0.311
 attic+0.308
 folder+0.303
 sheet+0.302
 garage+0.298
Neg logit contributions
used-0.128
eyed-0.103
uts-0.098
unch-0.088
but-0.087
Token their
Token ID511
Feature activation-1.46e+01

Pos logit contributions
used+0.167
eyed+0.148
uts+0.143
 cheering+0.140
unch+0.137
Neg logit contributions
 fort-0.162
 attic-0.159
 folder-0.157
 sheet-0.154
 garage-0.151
Token mom
Token ID1995
Feature activation-1.70e+00

Pos logit contributions
 attic+0.404
 folder+0.400
 fort+0.396
 sheet+0.388
 journal+0.376
Neg logit contributions
used-0.521
eyed-0.488
uts-0.469
 cheering-0.460
unch-0.438
Token says
Token ID1139
Feature activation+2.77e+00

Pos logit contributions
 fort+0.047
 closet+0.046
 sheet+0.044
 attic+0.044
 garage+0.043
Neg logit contributions
used-0.067
uts-0.063
eyed-0.061
unch-0.059
but-0.059
Token.
Token ID13
Feature activation+4.16e+00

Pos logit contributions
used+0.092
all+0.077
unch+0.071
Nice+0.070
aro+0.070
Neg logit contributions
 fort-0.089
 folder-0.088
 sheet-0.086
 drawer-0.085
 closet-0.083
Token "
Token ID366
Feature activation+1.76e+00

Pos logit contributions
used+0.188
unch+0.167
eyed+0.160
alling+0.160
all+0.159
Neg logit contributions
 girl-0.104
 attic-0.104
 closet-0.101
 folder-0.100
 fort-0.100
TokenThe
Token ID464
Feature activation-1.11e+01

Pos logit contributions
used+0.048
eyed+0.039
uts+0.038
unch+0.036
all+0.035
Neg logit contributions
 attic-0.069
 fort-0.069
 folder-0.067
 garage-0.067
 sheet-0.067
Token,
Token ID11
Feature activation-9.21e-01

Pos logit contributions
used+0.115
eyed+0.096
uts+0.093
all+0.090
but+0.088
Neg logit contributions
 fort-0.158
 attic-0.156
 folder-0.154
 sheet-0.152
 garage-0.149
Token Ben
Token ID3932
Feature activation-4.67e+00

Pos logit contributions
 fort+0.027
 attic+0.027
 garage+0.026
 folder+0.025
 proper+0.025
Neg logit contributions
used-0.033
 cheering-0.029
uts-0.029
all-0.029
 joining-0.029
Token and
Token ID290
Feature activation-7.28e+00

Pos logit contributions
 fort+0.173
 attic+0.172
 folder+0.168
 sheet+0.167
 garage+0.164
Neg logit contributions
used-0.134
eyed-0.118
uts-0.114
unch-0.107
all-0.107
Token Anna
Token ID11735
Feature activation-7.50e+00

Pos logit contributions
 garage+0.242
 attic+0.240
 fort+0.238
 seas+0.225
 sheet+0.223
Neg logit contributions
used-0.217
eyed-0.183
all-0.178
:-0.178
 joining-0.177
Token?"
Token ID1701
Feature activation+5.62e+00

Pos logit contributions
 fort+0.260
 attic+0.259
 folder+0.253
 sheet+0.251
 garage+0.248
Neg logit contributions
used-0.237
eyed-0.209
uts-0.203
but-0.194
unch-0.192
Token their
Token ID511
Feature activation-1.46e+01

Pos logit contributions
used+0.214
eyed+0.194
uts+0.188
but+0.179
 cheering+0.179
Neg logit contributions
 fort-0.161
 attic-0.158
 folder-0.154
 sheet-0.153
 garage-0.149
Token mom
Token ID1995
Feature activation-1.48e+00

Pos logit contributions
 fort+0.406
 attic+0.402
 folder+0.396
 sheet+0.387
 garage+0.377
Neg logit contributions
used-0.539
eyed-0.504
uts-0.486
 cheering-0.471
unch-0.448
Token asked
Token ID1965
Feature activation+5.48e+00

Pos logit contributions
 closet+0.046
 drawer+0.045
 sheet+0.043
 garage+0.043
 paper+0.042
Neg logit contributions
used-0.049
oor-0.046
wards-0.045
overed-0.044
seek-0.044
Token.
Token ID13
Feature activation+4.56e+00

Pos logit contributions
used+0.142
unch+0.125
uts+0.125
all+0.123
eyed+0.118
Neg logit contributions
 fort-0.209
 folder-0.207
 sheet-0.206
 closet-0.205
 drawer-0.203
Token "
Token ID366
Feature activation+2.09e+00

Pos logit contributions
used+0.161
unch+0.136
all+0.135
eyed+0.135
shadow+0.134
Neg logit contributions
 closet-0.150
 attic-0.149
 sheet-0.149
 folder-0.149
 drawer-0.147
TokenMaybe
Token ID13300
Feature activation+4.15e+00

Pos logit contributions
used+0.063
uts+0.049
eyed+0.049
unch+0.048
icked+0.048
Neg logit contributions
 attic-0.075
 sheet-0.073
 garage-0.072
 folder-0.072
 fort-0.072
Token.
Token ID13
Feature activation+4.72e+00

Pos logit contributions
used+0.239
unch+0.183
eyed+0.170
ucked+0.165
all+0.164
Neg logit contributions
 fort-0.287
 sheet-0.279
 paper-0.272
 folder-0.271
 puzzle-0.263
Token Lily
Token ID20037
Feature activation-6.59e+00

Pos logit contributions
used+0.161
eyed+0.142
uts+0.134
unch+0.129
all+0.129
Neg logit contributions
 fort-0.157
 attic-0.157
 folder-0.153
 sheet-0.152
 garage-0.149
Token and
Token ID290
Feature activation-7.97e+00

Pos logit contributions
 fort+0.241
 attic+0.239
 folder+0.234
 sheet+0.234
 garage+0.229
Neg logit contributions
used-0.196
eyed-0.171
uts-0.164
unch-0.156
but-0.154
Token Max
Token ID5436
Feature activation-5.67e+00

Pos logit contributions
 fort+0.263
 attic+0.260
 seas+0.259
pron+0.248
 sheet+0.247
Neg logit contributions
used-0.241
ting-0.216
 cheering-0.210
uts-0.204
't-0.203
Token and
Token ID290
Feature activation-8.43e+00

Pos logit contributions
 fort+0.164
 sheet+0.161
 folder+0.157
 closet+0.157
 garage+0.156
Neg logit contributions
used-0.207
uts-0.183
eyed-0.182
unch-0.181
all-0.179
Token their
Token ID511
Feature activation-1.46e+01

Pos logit contributions
 attic+0.291
 seas+0.284
 fort+0.284
pron+0.274
 sheet+0.273
Neg logit contributions
used-0.233
ting-0.222
 cheering-0.212
eyed-0.208
smart-0.205
Token mom
Token ID1995
Feature activation-1.49e+00

Pos logit contributions
 attic+0.468
 sheet+0.457
 folder+0.455
 fort+0.444
 drawer+0.441
Neg logit contributions
used-0.447
eyed-0.421
 cheering-0.414
uts-0.406
aro-0.370
Token talked
Token ID6619
Feature activation+8.83e+00

Pos logit contributions
 fort+0.045
 sheet+0.044
 closet+0.043
 garage+0.043
 folder+0.042
Neg logit contributions
used-0.055
uts-0.048
eyed-0.047
but-0.046
unch-0.046
Token to
Token ID284
Feature activation-2.24e+00

Pos logit contributions
used+0.297
unch+0.240
eyed+0.232
uts+0.230
qu+0.222
Neg logit contributions
 fort-0.322
 sheet-0.314
 folder-0.313
 attic-0.309
 journal-0.300
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 fort+0.073
 attic+0.073
 folder+0.071
 sheet+0.071
 garage+0.069
Neg logit contributions
used-0.076
eyed-0.067
uts-0.066
unch-0.062
but-0.062
Token elephant
Token ID20950
Feature activation+1.57e+00

Pos logit contributions
 fort+0.260
 attic+0.255
 folder+0.243
 sheet+0.239
 garage+0.237
Neg logit contributions
used-0.500
eyed-0.458
uts-0.453
unch-0.437
but-0.431
Token.
Token ID13
Feature activation+4.61e+00

Pos logit contributions
used+0.079
unch+0.064
eyed+0.062
uts+0.061
qu+0.060
Neg logit contributions
 fort-0.112
 attic-0.108
 sheet-0.106
 closet-0.105
 shelf-0.104
Token Tom
Token ID4186
Feature activation-5.88e+00

Pos logit contributions
used+0.155
eyed+0.134
ting+0.124
uts+0.124
all+0.121
Neg logit contributions
 attic-0.158
 fort-0.155
 garage-0.151
 sheet-0.150
 folder-0.150
Token and
Token ID290
Feature activation-8.20e+00

Pos logit contributions
 fort+0.216
 attic+0.214
 folder+0.209
 sheet+0.209
 garage+0.204
Neg logit contributions
used-0.174
eyed-0.152
uts-0.146
unch-0.139
all-0.138
Token Jen
Token ID13364
Feature activation-6.84e+00

Pos logit contributions
 attic+0.359
 fort+0.353
 journal+0.346
 folder+0.343
 puzzle+0.340
Neg logit contributions
used-0.167
eyed-0.159
 cheering-0.141
aro-0.136
all-0.136
Token and
Token ID290
Feature activation-8.24e+00

Pos logit contributions
 fort+0.205
 sheet+0.203
 attic+0.200
 closet+0.197
 garage+0.196
Neg logit contributions
used-0.239
uts-0.210
all-0.209
eyed-0.207
unch-0.206
Token their
Token ID511
Feature activation-1.46e+01

Pos logit contributions
 attic+0.342
 fort+0.339
 folder+0.331
 journal+0.330
 sheet+0.325
Neg logit contributions
used-0.204
eyed-0.184
 cheering-0.168
uts-0.162
ting-0.157
Token wheels
Token ID13666
Feature activation+5.19e+00

Pos logit contributions
 attic+0.443
 folder+0.441
 sheet+0.425
 drawer+0.414
 needle+0.410
Neg logit contributions
eyed-0.433
 cheering-0.432
used-0.431
uts-0.386
 hugging-0.370
Token were
Token ID547
Feature activation-1.99e+00

Pos logit contributions
used+0.186
eyed+0.162
uts+0.156
all+0.154
unch+0.154
Neg logit contributions
 fort-0.158
 sheet-0.154
 attic-0.153
 garage-0.150
 folder-0.149
Token broken
Token ID5445
Feature activation+2.75e+00

Pos logit contributions
 fort+0.063
 attic+0.062
 sheet+0.060
 folder+0.060
 closet+0.059
Neg logit contributions
used-0.070
eyed-0.062
uts-0.060
unch-0.058
but-0.057
Token and
Token ID290
Feature activation-6.36e+00

Pos logit contributions
used+0.062
eyed+0.051
uts+0.047
unch+0.046
all+0.044
Neg logit contributions
 fort-0.124
 attic-0.123
 sheet-0.119
 garage-0.118
 closet-0.117
Token hurt
Token ID5938
Feature activation+3.36e+00

Pos logit contributions
 attic+0.218
 fort+0.218
 folder+0.218
 journal+0.206
 garage+0.205
Neg logit contributions
used-0.180
 cheering-0.169
eyed-0.161
uts-0.160
icked-0.155
Token noticed
Token ID6810
Feature activation+3.31e+00

Pos logit contributions
used+0.002
eyed+0.002
uts+0.002
 cheering+0.002
all+0.002
Neg logit contributions
 attic-0.003
 fort-0.002
 folder-0.002
 sheet-0.002
 garage-0.002
Token Lily
Token ID20037
Feature activation-6.63e+00

Pos logit contributions
used+0.107
eyed+0.096
uts+0.090
unch+0.087
but+0.084
Neg logit contributions
 attic-0.116
 sheet-0.116
 fort-0.115
 folder-0.114
 closet-0.111
Token and
Token ID290
Feature activation-7.85e+00

Pos logit contributions
 fort+0.229
 attic+0.223
 sheet+0.222
 folder+0.222
 closet+0.218
Neg logit contributions
used-0.209
eyed-0.184
uts-0.177
unch-0.171
but-0.170
Token Max
Token ID5436
Feature activation-5.68e+00

Pos logit contributions
 fort+0.244
 attic+0.243
 folder+0.228
 garage+0.226
 sheet+0.223
Neg logit contributions
used-0.273
uts-0.238
 cheering-0.237
ting-0.234
eyed-0.233
Token and
Token ID290
Feature activation-8.32e+00

Pos logit contributions
 fort+0.198
 closet+0.193
 sheet+0.192
 folder+0.189
 garage+0.186
Neg logit contributions
used-0.166
but-0.152
eyed-0.149
all-0.148
uts-0.146
Token their
Token ID511
Feature activation-1.46e+01

Pos logit contributions
 attic+0.301
 fort+0.296
 garage+0.281
 sheet+0.277
 folder+0.274
Neg logit contributions
used-0.243
 cheering-0.215
ting-0.212
eyed-0.207
uts-0.202
Token mom
Token ID1995
Feature activation-1.61e+00

Pos logit contributions
 attic+0.389
 folder+0.382
 sheet+0.379
 fort+0.375
 drawer+0.360
Neg logit contributions
used-0.556
eyed-0.506
 cheering-0.483
uts-0.482
but-0.454
Token.
Token ID13
Feature activation+3.79e+00

Pos logit contributions
 closet+0.055
 fort+0.054
 sheet+0.053
 garage+0.052
 folder+0.052
Neg logit contributions
used-0.051
but-0.045
uts-0.045
eyed-0.044
all-0.043
Token It
Token ID632
Feature activation-3.20e+00

Pos logit contributions
used+0.131
eyed+0.113
uts+0.107
all+0.105
unch+0.105
Neg logit contributions
 attic-0.127
 fort-0.127
 folder-0.125
 sheet-0.123
 garage-0.122
Token lifted
Token ID13663
Feature activation+7.61e+00

Pos logit contributions
 fort+0.099
 attic+0.099
 folder+0.096
 sheet+0.095
 garage+0.093
Neg logit contributions
used-0.112
eyed-0.101
uts-0.098
 cheering-0.093
all-0.093
Token its
Token ID663
Feature activation-1.22e+01

Pos logit contributions
used+0.153
eyed+0.148
ting+0.136
ucked+0.133
 cheering+0.128
Neg logit contributions
 attic-0.338
 fort-0.336
pron-0.328
 garage-0.327
 journal-0.322
Token then
Token ID788
Feature activation-5.25e-01

Pos logit contributions
 fort+0.127
 attic+0.124
 sheet+0.123
 folder+0.122
 closet+0.121
Neg logit contributions
used-0.050
eyed-0.039
all-0.035
uts-0.034
 cheering-0.034
Token,
Token ID11
Feature activation-2.08e+00

Pos logit contributions
 fort+0.021
 attic+0.021
 folder+0.020
 sheet+0.020
 garage+0.020
Neg logit contributions
used-0.014
eyed-0.012
uts-0.012
unch-0.011
but-0.011
Token Tim
Token ID5045
Feature activation-6.97e+00

Pos logit contributions
 fort+0.079
 attic+0.079
 folder+0.078
 sheet+0.078
 closet+0.076
Neg logit contributions
used-0.059
eyed-0.050
all-0.049
uts-0.048
but-0.048
Tokenmy
Token ID1820
Feature activation-4.82e+00

Pos logit contributions
 fort+0.259
 attic+0.252
 folder+0.249
 sheet+0.246
 spaceship+0.243
Neg logit contributions
used-0.199
all-0.182
eyed-0.181
:-0.161
ting-0.157
Token and
Token ID290
Feature activation-8.13e+00

Pos logit contributions
 fort+0.172
 attic+0.171
 folder+0.168
 sheet+0.166
 closet+0.160
Neg logit contributions
used-0.147
eyed-0.131
uts-0.120
all-0.119
 cheering-0.115
Token his
Token ID465
Feature activation-1.45e+01

Pos logit contributions
 fort+0.301
 attic+0.281
 sheet+0.278
 folder+0.271
 closet+0.263
Neg logit contributions
used-0.254
eyed-0.227
all-0.215
uts-0.208
 cheering-0.204
Token dad
Token ID9955
Feature activation+3.08e-01

Pos logit contributions
 fort+0.440
 attic+0.405
 sheet+0.396
 folder+0.390
 proper+0.380
Neg logit contributions
used-0.506
eyed-0.449
uts-0.438
 cheering-0.436
:-0.417
Token have
Token ID423
Feature activation-9.83e-01

Pos logit contributions
used+0.010
eyed+0.009
 cheering+0.008
all+0.008
ting+0.007
Neg logit contributions
 fort-0.011
 attic-0.011
 sheet-0.010
 folder-0.010
 closet-0.010
Token made
Token ID925
Feature activation+3.33e+00

Pos logit contributions
 fort+0.034
 attic+0.034
 folder+0.033
 sheet+0.032
 garage+0.032
Neg logit contributions
used-0.031
eyed-0.028
uts-0.027
 cheering-0.026
all-0.026
Token sure
Token ID1654
Feature activation+3.50e+00

Pos logit contributions
used+0.078
eyed+0.070
 cheering+0.066
all+0.065
uts+0.063
Neg logit contributions
 attic-0.135
 fort-0.135
 folder-0.134
 closet-0.132
 sheet-0.131
Token to
Token ID284
Feature activation-3.05e+00

Pos logit contributions
used+0.060
eyed+0.050
uts+0.048
all+0.043
 cheering+0.042
Neg logit contributions
 fort-0.171
 attic-0.169
 sheet-0.168
 folder-0.167
 closet-0.165
Token.
Token ID13
Feature activation+4.36e+00

Pos logit contributions
used+0.075
eyed+0.063
uts+0.063
 cheering+0.051
all+0.047
Neg logit contributions
 attic-0.290
 fort-0.288
 folder-0.283
 sheet-0.281
 closet-0.279
Token
Token ID198
Feature activation+2.64e+00

Pos logit contributions
used+0.133
ting+0.103
eyed+0.101
unch+0.100
pr+0.099
Neg logit contributions
 attic-0.174
 garage-0.167
 closet-0.164
 fort-0.161
 sheet-0.159
Token
Token ID198
Feature activation+2.28e+00

Pos logit contributions
used+0.093
ting+0.076
ified+0.066
ucked+0.065
uts+0.064
Neg logit contributions
 attic-0.098
 garage-0.093
 fort-0.093
 sheet-0.091
 closet-0.088
TokenLater
Token ID18602
Feature activation+2.77e+00

Pos logit contributions
used+0.067
uts+0.056
ting+0.053
icked+0.052
eyed+0.051
Neg logit contributions
 attic-0.086
 fort-0.083
 garage-0.082
 sheet-0.081
 folder-0.080
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
used+0.056
eyed+0.043
uts+0.042
all+0.039
 cheering+0.039
Neg logit contributions
 fort-0.127
 attic-0.126
 folder-0.123
 garage-0.123
 sheet-0.122
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
 fort+0.089
 attic+0.087
 folder+0.085
 journal+0.085
 sheet+0.084
Neg logit contributions
used-0.071
 cheering-0.058
't-0.058
eyed-0.057
uts-0.057
Token friend
Token ID1545
Feature activation+2.28e+00

Pos logit contributions
pron+0.439
 attic+0.419
 sheet+0.418
 folder+0.416
 drawer+0.409
Neg logit contributions
used-0.391
eyed-0.379
 cheering-0.370
uts-0.362
 joining-0.345
Token Bunny
Token ID28683
Feature activation-2.91e+00

Pos logit contributions
used+0.062
eyed+0.051
uts+0.050
 cheering+0.049
ting+0.048
Neg logit contributions
 attic-0.089
 fort-0.088
 sheet-0.085
 folder-0.084
 garage-0.083
Token came
Token ID1625
Feature activation+5.29e+00

Pos logit contributions
 fort+0.094
 folder+0.090
 sheet+0.089
 closet+0.089
 attic+0.089
Neg logit contributions
used-0.100
uts-0.088
eyed-0.086
but-0.084
unch-0.084
Token to
Token ID284
Feature activation-2.79e+00

Pos logit contributions
used+0.180
uts+0.168
unch+0.162
qu+0.160
overed+0.160
Neg logit contributions
 fort-0.168
 closet-0.155
 attic-0.153
 garage-0.153
 sheet-0.152
Token play
Token ID711
Feature activation+7.84e+00

Pos logit contributions
 fort+0.085
 attic+0.084
 folder+0.082
 sheet+0.081
 garage+0.080
Neg logit contributions
used-0.101
eyed-0.090
uts-0.089
unch-0.084
but-0.084
Token
Token ID198
Feature activation+2.91e+00

Pos logit contributions
used+0.104
ting+0.077
ucked+0.069
uts+0.069
eyed+0.068
Neg logit contributions
 fort-0.132
 attic-0.129
 sheet-0.126
 folder-0.123
 garage-0.122
TokenBy
Token ID3886
Feature activation+4.96e+00

Pos logit contributions
used+0.110
eyed+0.102
uts+0.098
but+0.095
all+0.095
Neg logit contributions
 fort-0.083
 attic-0.081
 folder-0.079
 sheet-0.079
 closet-0.077
Token the
Token ID262
Feature activation-1.31e+01

Pos logit contributions
used+0.169
uts+0.154
eyed+0.153
unch+0.142
but+0.141
Neg logit contributions
 attic-0.158
 fort-0.156
 folder-0.154
 sheet-0.150
 garage-0.146
Token end
Token ID886
Feature activation+2.29e+00

Pos logit contributions
 drawer+0.407
 closet+0.405
 attic+0.403
 fort+0.400
 sheet+0.397
Neg logit contributions
used-0.377
 joining-0.361
all-0.361
 cheering-0.360
eyed-0.339
Token of
Token ID286
Feature activation-5.38e+00

Pos logit contributions
used+0.076
eyed+0.067
ucked+0.058
but+0.058
ting+0.057
Neg logit contributions
 fort-0.087
 sheet-0.080
 folder-0.078
 closet-0.077
 attic-0.077
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
 fort+0.193
 attic+0.190
 folder+0.185
 garage+0.184
 sheet+0.183
Neg logit contributions
used-0.162
eyed-0.143
 cheering-0.136
all-0.134
uts-0.131
Token adventure
Token ID8855
Feature activation+7.57e+00

Pos logit contributions
 attic+0.409
 closet+0.379
 sheet+0.378
 drawer+0.378
 folder+0.371
Neg logit contributions
used-0.487
 cheering-0.467
eyed-0.456
 joining-0.442
aring-0.437
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
used+0.132
 cheering+0.101
uts+0.099
eyed+0.097
all+0.096
Neg logit contributions
 attic-0.360
 fort-0.359
 sheet-0.355
 garage-0.349
 folder-0.348
Token Lily
Token ID20037
Feature activation-6.76e+00

Pos logit contributions
 attic+0.066
 journal+0.065
 sheet+0.065
 fort+0.064
 drawer+0.064
Neg logit contributions
used-0.063
all-0.059
-0.058
 cheering-0.058
but-0.058
Token had
Token ID550
Feature activation-6.06e-01

Pos logit contributions
 fort+0.249
 attic+0.248
 folder+0.241
 sheet+0.239
 garage+0.235
Neg logit contributions
used-0.197
eyed-0.174
uts-0.168
all-0.159
but-0.157
Token learned
Token ID4499
Feature activation+5.26e+00

Pos logit contributions
 fort+0.023
 attic+0.022
 sheet+0.021
 garage+0.021
 folder+0.021
Neg logit contributions
used-0.017
eyed-0.016
uts-0.015
all-0.015
 cheering-0.015
Token you
Token ID345
Feature activation-4.65e+00

Pos logit contributions
 fort+0.053
 proper+0.053
 attic+0.053
 folder+0.052
 puzzle+0.050
Neg logit contributions
used-0.034
uts-0.032
 cheering-0.031
all-0.029
but-0.029
Token're
Token ID821
Feature activation-1.92e+00

Pos logit contributions
 fort+0.147
 attic+0.146
 folder+0.142
 sheet+0.142
 garage+0.138
Neg logit contributions
used-0.161
eyed-0.144
uts-0.140
unch-0.133
but-0.133
Token safe
Token ID3338
Feature activation+3.10e+00

Pos logit contributions
 fort+0.065
 attic+0.065
 folder+0.063
 sheet+0.063
 closet+0.062
Neg logit contributions
used-0.062
eyed-0.055
uts-0.054
unch-0.051
but-0.051
Token now
Token ID783
Feature activation-1.63e+00

Pos logit contributions
used+0.069
eyed+0.057
uts+0.054
unch+0.052
but+0.050
Neg logit contributions
 fort-0.136
 attic-0.135
 folder-0.132
 sheet-0.132
 garage-0.130
Token,"
Token ID553
Feature activation+5.53e+00

Pos logit contributions
 fort+0.071
 attic+0.069
 sheet+0.068
 folder+0.068
 garage+0.067
Neg logit contributions
used-0.037
eyed-0.031
unch-0.029
uts-0.029
but-0.029
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
used+0.149
 cheering+0.142
eyed+0.135
+0.135
uts+0.133
Neg logit contributions
 fort-0.207
 attic-0.196
 proper-0.191
 sheet-0.190
 folder-0.189
Token dad
Token ID9955
Feature activation+6.42e-01

Pos logit contributions
 fort+0.369
 attic+0.360
 folder+0.355
 sheet+0.350
 garage+0.332
Neg logit contributions
used-0.587
eyed-0.547
uts-0.534
 cheering-0.511
unch-0.500
Token said
Token ID531
Feature activation+3.67e+00

Pos logit contributions
used+0.025
uts+0.022
eyed+0.021
unch+0.021
but+0.020
Neg logit contributions
 sheet-0.017
 closet-0.017
 fort-0.017
 paper-0.016
 drawer-0.016
Token.
Token ID13
Feature activation+4.46e+00

Pos logit contributions
used+0.114
all+0.096
unch+0.094
qu+0.091
Nice+0.091
Neg logit contributions
 fort-0.121
 drawer-0.118
 sheet-0.116
 folder-0.115
 journal-0.115
Token He
Token ID679
Feature activation-7.22e+00

Pos logit contributions
used+0.146
eyed+0.114
unch+0.112
qu+0.111
all+0.111
Neg logit contributions
 fort-0.166
 attic-0.166
 garage-0.165
 closet-0.162
 folder-0.160
Token drove
Token ID10357
Feature activation+7.40e+00

Pos logit contributions
 fort+0.256
 attic+0.255
 folder+0.249
 sheet+0.246
 garage+0.241
Neg logit contributions
used-0.221
eyed-0.197
uts-0.188
 cheering-0.178
but-0.178
Token.
Token ID13
Feature activation+4.42e+00

Pos logit contributions
 attic+0.220
 fort+0.213
 folder+0.209
 puzzle+0.196
 sheet+0.196
Neg logit contributions
used-0.455
uts-0.419
eyed-0.415
but-0.401
unch-0.399
Token
Token ID198
Feature activation+2.45e+00

Pos logit contributions
used+0.117
eyed+0.103
uts+0.096
unch+0.092
ting+0.091
Neg logit contributions
 attic-0.173
 fort-0.169
 folder-0.167
 garage-0.167
 closet-0.165
Token
Token ID198
Feature activation+2.26e+00

Pos logit contributions
used+0.054
ting+0.041
uts+0.041
eyed+0.039
but+0.036
Neg logit contributions
 attic-0.112
 fort-0.109
 garage-0.108
 sheet-0.107
 folder-0.106
TokenSoon
Token ID28093
Feature activation-2.60e-01

Pos logit contributions
used+0.065
eyed+0.055
uts+0.055
ting+0.051
unch+0.051
Neg logit contributions
 attic-0.084
 fort-0.083
 folder-0.081
 sheet-0.080
 garage-0.080
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
 attic+0.012
 fort+0.012
 garage+0.011
 folder+0.011
 sheet+0.011
Neg logit contributions
used-0.006
uts-0.004
 cheering-0.004
all-0.004
eyed-0.004
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
 fort+0.090
 proper+0.088
 attic+0.088
 folder+0.086
 garage+0.085
Neg logit contributions
used-0.061
 cheering-0.052
wards-0.051
ting-0.050
all-0.050
Token friends
Token ID2460
Feature activation+3.27e+00

Pos logit contributions
 attic+0.404
 fort+0.393
 sheet+0.387
 folder+0.387
 garage+0.385
Neg logit contributions
used-0.481
 cheering-0.434
eyed-0.423
 joining-0.398
uts-0.395
Token came
Token ID1625
Feature activation+5.35e+00

Pos logit contributions
used+0.108
eyed+0.096
uts+0.093
unch+0.089
but+0.088
Neg logit contributions
 fort-0.108
 attic-0.107
 folder-0.105
 sheet-0.104
 garage-0.103
Token.
Token ID13
Feature activation+4.11e+00

Pos logit contributions
used+0.161
unch+0.144
uts+0.144
qu+0.138
but+0.133
Neg logit contributions
 fort-0.188
 folder-0.180
 attic-0.180
 garage-0.180
 closet-0.179
Token They
Token ID1119
Feature activation-5.82e+00

Pos logit contributions
used+0.155
eyed+0.139
unch+0.134
uts+0.131
qu+0.130
Neg logit contributions
 attic-0.121
 closet-0.119
 folder-0.117
 garage-0.117
 fort-0.113
Token played
Token ID2826
Feature activation+8.17e+00

Pos logit contributions
 fort+0.189
 attic+0.187
 folder+0.182
 sheet+0.181
 garage+0.177
Neg logit contributions
used-0.197
eyed-0.176
uts-0.170
unch-0.162
but-0.161
Token very
Token ID845
Feature activation-2.17e+00

Pos logit contributions
 fort+0.103
 attic+0.098
 garage+0.097
 folder+0.097
 sheet+0.096
Neg logit contributions
used-0.050
all-0.048
't-0.045
 joining-0.044
uts-0.044
Token naughty
Token ID45128
Feature activation+9.54e-01

Pos logit contributions
 fort+0.055
 attic+0.053
 sheet+0.053
 garage+0.051
 folder+0.051
Neg logit contributions
used-0.084
eyed-0.080
uts-0.077
all-0.076
unch-0.075
Token of
Token ID286
Feature activation-3.64e+00

Pos logit contributions
used+0.016
eyed+0.012
uts+0.012
 cheering+0.010
all+0.010
Neg logit contributions
 fort-0.047
 attic-0.047
 folder-0.046
 sheet-0.046
 garage-0.045
Token you
Token ID345
Feature activation-4.33e+00

Pos logit contributions
 fort+0.135
 folder+0.132
 attic+0.132
 sheet+0.129
 garage+0.128
Neg logit contributions
used-0.106
uts-0.090
eyed-0.087
all-0.086
unch-0.085
Token!"
Token ID2474
Feature activation+6.23e+00

Pos logit contributions
 fort+0.196
 attic+0.192
 sheet+0.191
 folder+0.189
 garage+0.185
Neg logit contributions
used-0.096
eyed-0.080
uts-0.072
but-0.070
ucked-0.068
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
used+0.192
eyed+0.173
uts+0.167
 cheering+0.158
but+0.157
Neg logit contributions
 fort-0.220
 attic-0.215
 folder-0.209
 sheet-0.208
 garage-0.203
Token mom
Token ID1995
Feature activation-1.49e+00

Pos logit contributions
 attic+0.449
 folder+0.447
 fort+0.442
 sheet+0.433
 journal+0.421
Neg logit contributions
used-0.445
eyed-0.431
uts-0.415
 cheering-0.396
ucked-0.373
Token said
Token ID531
Feature activation+3.79e+00

Pos logit contributions
 sheet+0.038
 closet+0.038
 paper+0.037
 drawer+0.037
 folder+0.036
Neg logit contributions
used-0.057
overed-0.049
uts-0.049
eyed-0.048
unch-0.047
Token.
Token ID13
Feature activation+4.49e+00

Pos logit contributions
used+0.118
Nice+0.098
seven+0.096
all+0.095
aro+0.092
Neg logit contributions
 folder-0.125
 drawer-0.124
 sheet-0.121
 fort-0.120
 paper-0.118
Token
Token ID198
Feature activation+3.39e+00

Pos logit contributions
used+0.145
eyed+0.110
ting+0.102
all+0.101
fed+0.101
Neg logit contributions
 folder-0.169
 attic-0.169
 closet-0.164
 garage-0.164
 sheet-0.159
Token
Token ID198
Feature activation+2.95e+00

Pos logit contributions
used+0.105
ting+0.079
overed+0.072
ucked+0.072
icked+0.069
Neg logit contributions
 attic-0.135
 garage-0.132
 fort-0.132
 closet-0.125
 sheet-0.123
Token up
Token ID510
Feature activation-2.51e-01

Pos logit contributions
used+0.090
eyed+0.080
uts+0.077
all+0.073
unch+0.072
Neg logit contributions
 fort-0.104
 attic-0.102
 folder-0.100
 sheet-0.099
 garage-0.098
Token with
Token ID351
Feature activation+2.13e-01

Pos logit contributions
 fort+0.010
 attic+0.010
 folder+0.009
 sheet+0.009
 garage+0.009
Neg logit contributions
used-0.007
eyed-0.006
uts-0.006
unch-0.005
but-0.005
Token each
Token ID1123
Feature activation-7.49e+00

Pos logit contributions
used+0.006
 cheering+0.005
eyed+0.005
all+0.005
ting+0.005
Neg logit contributions
 fort-0.008
 attic-0.007
 garage-0.007
 folder-0.007
 journal-0.007
Token other
Token ID584
Feature activation-3.11e+00

Pos logit contributions
 fort+0.220
 attic+0.218
 folder+0.204
 garage+0.203
 closet+0.201
Neg logit contributions
used-0.267
eyed-0.254
 cheering-0.235
ting-0.231
uts-0.231
Token,"
Token ID553
Feature activation+5.57e+00

Pos logit contributions
 attic+0.149
 fort+0.149
 folder+0.145
 sheet+0.144
 garage+0.143
Neg logit contributions
used-0.056
eyed-0.047
uts-0.044
 cheering-0.038
all-0.038
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
used+0.200
eyed+0.179
uts+0.176
unch+0.169
all+0.165
Neg logit contributions
 attic-0.168
 fort-0.167
 folder-0.166
 sheet-0.164
 garage-0.161
Token mom
Token ID1995
Feature activation-1.58e+00

Pos logit contributions
 attic+0.412
 fort+0.409
 folder+0.401
 sheet+0.398
 garage+0.379
Neg logit contributions
used-0.514
eyed-0.484
uts-0.466
 cheering-0.454
but-0.428
Token said
Token ID531
Feature activation+3.75e+00

Pos logit contributions
 girl+0.035
 paper+0.035
 envelope+0.035
 drawer+0.034
 closet+0.034
Neg logit contributions
used-0.066
uts-0.058
unch-0.055
overed-0.055
eyed-0.054
Token.
Token ID13
Feature activation+4.33e+00

Pos logit contributions
used+0.116
unch+0.092
all+0.089
aro+0.089
seven+0.087
Neg logit contributions
 folder-0.126
 drawer-0.124
 journal-0.120
 paper-0.120
 fort-0.119
Token "
Token ID366
Feature activation+1.97e+00

Pos logit contributions
used+0.149
eyed+0.120
unch+0.117
fed+0.115
qu+0.114
Neg logit contributions
 folder-0.153
 attic-0.152
 closet-0.149
 garage-0.148
 sheet-0.144
TokenAnd
Token ID1870
Feature activation-8.29e-01

Pos logit contributions
used+0.050
eyed+0.039
uts+0.039
unch+0.037
 cheering+0.035
Neg logit contributions
 attic-0.082
 fort-0.081
 folder-0.079
 sheet-0.079
 garage-0.079
Token.
Token ID13
Feature activation+4.42e+00

Pos logit contributions
 fort+0.064
 folder+0.063
 sheet+0.062
 attic+0.062
 closet+0.061
Neg logit contributions
used-0.058
uts-0.050
ting-0.049
eyed-0.049
 cheering-0.048
Token
Token ID198
Feature activation+2.45e+00

Pos logit contributions
used+0.117
eyed+0.103
uts+0.096
unch+0.092
ting+0.091
Neg logit contributions
 attic-0.173
 fort-0.169
 folder-0.167
 garage-0.167
 closet-0.165
Token
Token ID198
Feature activation+2.26e+00

Pos logit contributions
used+0.054
ting+0.041
uts+0.041
eyed+0.039
but+0.036
Neg logit contributions
 attic-0.112
 fort-0.109
 garage-0.108
 sheet-0.107
 folder-0.106
TokenSuddenly
Token ID38582
Feature activation+3.64e+00

Pos logit contributions
used+0.065
eyed+0.055
uts+0.055
ting+0.051
unch+0.051
Neg logit contributions
 attic-0.084
 fort-0.083
 folder-0.081
 sheet-0.080
 garage-0.080
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
used+0.058
uts+0.035
wards+0.034
:+0.033
 cheering+0.033
Neg logit contributions
 attic-0.181
 proper-0.179
 fort-0.178
 garage-0.175
 folder-0.172
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
 fort+0.088
 proper+0.086
 attic+0.085
 garage+0.082
 folder+0.082
Neg logit contributions
used-0.058
wards-0.047
 cheering-0.046
ting-0.045
't-0.044
Token mom
Token ID1995
Feature activation-9.31e-01

Pos logit contributions
 attic+0.410
 fort+0.398
 folder+0.387
 garage+0.386
pron+0.382
Neg logit contributions
used-0.489
 cheering-0.434
eyed-0.433
uts-0.417
 joining-0.394
Token came
Token ID1625
Feature activation+5.23e+00

Pos logit contributions
 fort+0.031
 attic+0.030
 sheet+0.030
 folder+0.030
 closet+0.029
Neg logit contributions
used-0.031
eyed-0.028
uts-0.027
but-0.026
unch-0.026
Token outside
Token ID2354
Feature activation+5.08e+00

Pos logit contributions
used+0.165
uts+0.159
unch+0.153
qu+0.143
but+0.140
Neg logit contributions
 fort-0.177
 closet-0.176
 attic-0.176
 sheet-0.173
 garage-0.171
Token and
Token ID290
Feature activation-5.38e+00

Pos logit contributions
used+0.171
uts+0.153
unch+0.147
eyed+0.147
qu+0.146
Neg logit contributions
 fort-0.163
 sheet-0.161
 attic-0.158
 closet-0.158
 folder-0.157
Token saw
Token ID2497
Feature activation+3.04e+00

Pos logit contributions
 fort+0.156
 attic+0.154
 folder+0.151
 sheet+0.150
 garage+0.146
Neg logit contributions
used-0.201
eyed-0.181
uts-0.178
unch-0.170
all-0.167
Token."
Token ID526
Feature activation+7.18e+00

Pos logit contributions
 fort+0.022
 folder+0.022
 attic+0.021
 sheet+0.021
 puzzle+0.021
Neg logit contributions
used-0.016
unch-0.013
eyed-0.012
ucked-0.012
overed-0.012
Token
Token ID198
Feature activation+3.09e+00

Pos logit contributions
used+0.199
eyed+0.160
ting+0.144
uts+0.143
unch+0.140
Neg logit contributions
 attic-0.291
 fort-0.290
 folder-0.285
 closet-0.284
 garage-0.282
Token
Token ID198
Feature activation+2.65e+00

Pos logit contributions
used+0.085
ting+0.060
eyed+0.055
ucked+0.054
uts+0.053
Neg logit contributions
 attic-0.133
 fort-0.133
 sheet-0.128
 garage-0.128
 closet-0.126
TokenSoon
Token ID28093
Feature activation-4.67e-01

Pos logit contributions
used+0.097
eyed+0.089
uts+0.086
but+0.082
unch+0.082
Neg logit contributions
 fort-0.079
 attic-0.078
 folder-0.076
 sheet-0.076
 closet-0.074
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
 fort+0.019
 attic+0.019
 folder+0.019
 sheet+0.019
 garage+0.018
Neg logit contributions
used-0.012
eyed-0.010
uts-0.010
unch-0.009
all-0.009
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
 fort+0.086
 attic+0.084
 proper+0.083
 garage+0.082
 folder+0.082
Neg logit contributions
used-0.054
 cheering-0.049
wards-0.047
ting-0.047
 joining-0.046
Token mom
Token ID1995
Feature activation-1.19e+00

Pos logit contributions
 attic+0.423
pron+0.391
 sheet+0.386
 garage+0.384
 drawer+0.380
Neg logit contributions
 cheering-0.429
used-0.420
uts-0.399
eyed-0.384
 joining-0.384
Token came
Token ID1625
Feature activation+4.88e+00

Pos logit contributions
 fort+0.042
 attic+0.042
 folder+0.041
 sheet+0.041
 garage+0.040
Neg logit contributions
used-0.037
eyed-0.033
uts-0.032
unch-0.030
but-0.030
Token to
Token ID284
Feature activation-2.97e+00

Pos logit contributions
used+0.165
uts+0.145
unch+0.140
eyed+0.136
but+0.135
Neg logit contributions
 fort-0.166
 attic-0.160
 sheet-0.159
 closet-0.159
 folder-0.159
Token check
Token ID2198
Feature activation+1.16e+01

Pos logit contributions
 fort+0.106
 attic+0.105
 folder+0.101
 sheet+0.100
 journal+0.098
Neg logit contributions
used-0.087
eyed-0.079
 cheering-0.077
all-0.073
 joining-0.071
Token on
Token ID319
Feature activation+3.03e-01

Pos logit contributions
used+0.303
eyed+0.259
uts+0.249
unch+0.233
but+0.230
Neg logit contributions
 fort-0.469
 attic-0.466
 folder-0.456
 sheet-0.455
 garage-0.446
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
used+0.076
eyed+0.064
uts+0.062
unch+0.057
all+0.057
Neg logit contributions
 fort-0.127
 attic-0.126
 folder-0.124
 sheet-0.123
 garage-0.121
Token it
Token ID340
Feature activation-2.35e+00

Pos logit contributions
 journal+0.041
 proper+0.041
 drawer+0.041
 folder+0.041
pron+0.040
Neg logit contributions
ting-0.030
 cheering-0.030
used-0.030
 joining-0.029
uts-0.028
Token's
Token ID338
Feature activation-8.65e+00

Pos logit contributions
 fort+0.067
 attic+0.065
 sheet+0.063
 folder+0.063
 garage+0.061
Neg logit contributions
used-0.090
eyed-0.081
uts-0.079
unch-0.077
but-0.076
Token okay
Token ID8788
Feature activation+2.53e+00

Pos logit contributions
 fort+0.353
 folder+0.351
 attic+0.349
 sheet+0.346
 garage+0.340
Neg logit contributions
used-0.210
eyed-0.187
all-0.175
uts-0.168
 cheering-0.164
Token,"
Token ID553
Feature activation+5.63e+00

Pos logit contributions
used+0.059
eyed+0.049
uts+0.045
unch+0.043
but+0.043
Neg logit contributions
 fort-0.113
 attic-0.110
 sheet-0.108
 folder-0.107
 closet-0.105
Token their
Token ID511
Feature activation-1.45e+01

Pos logit contributions
used+0.137
 cheering+0.134
eyed+0.127
+0.125
uts+0.123
Neg logit contributions
 fort-0.215
 attic-0.213
 sheet-0.202
 folder-0.201
 journal-0.199
Token mom
Token ID1995
Feature activation-1.56e+00

Pos logit contributions
 fort+0.350
 attic+0.347
 folder+0.340
 sheet+0.333
 garage+0.321
Neg logit contributions
used-0.602
eyed-0.558
uts-0.544
 cheering-0.522
unch-0.515
Token said
Token ID531
Feature activation+3.73e+00

Pos logit contributions
 sheet+0.039
 closet+0.038
 scarf+0.038
 pen+0.038
 paper+0.037
Neg logit contributions
used-0.067
overed-0.055
eyed-0.053
aro-0.053
ucked-0.052
Token.
Token ID13
Feature activation+4.57e+00

Pos logit contributions
used+0.119
all+0.098
aro+0.094
seven+0.093
Nice+0.091
Neg logit contributions
 fort-0.122
 drawer-0.121
 folder-0.121
 sheet-0.120
 closet-0.117
Token "
Token ID366
Feature activation+2.16e+00

Pos logit contributions
used+0.186
unch+0.151
all+0.151
eyed+0.150
aro+0.144
Neg logit contributions
 attic-0.141
 folder-0.139
 closet-0.139
 fort-0.134
 garage-0.131
TokenYou
Token ID1639
Feature activation-7.13e-01

Pos logit contributions
used+0.055
eyed+0.045
uts+0.044
unch+0.041
 cheering+0.040
Neg logit contributions
 attic-0.088
 fort-0.088
 folder-0.085
 sheet-0.085
 garage-0.084
Token
Token ID198
Feature activation+3.84e+00

Pos logit contributions
used+0.187
ting+0.149
eyed+0.145
 cheering+0.137
 joining+0.134
Neg logit contributions
 garage-0.261
 attic-0.258
 fort-0.244
 closet-0.239
 shelf-0.234
Token
Token ID198
Feature activation+3.39e+00

Pos logit contributions
used+0.127
icked+0.101
ting+0.095
overed+0.094
aro+0.089
Neg logit contributions
 garage-0.132
 shelf-0.128
 attic-0.125
 proper-0.118
 closet-0.115
TokenTom
Token ID13787
Feature activation-3.78e+00

Pos logit contributions
used+0.114
icked+0.091
eyed+0.091
uts+0.090
overed+0.088
Neg logit contributions
 attic-0.113
 garage-0.112
 fort-0.107
 folder-0.106
 shelf-0.104
Tokenmy
Token ID1820
Feature activation-4.79e+00

Pos logit contributions
 attic+0.128
 fort+0.124
 folder+0.124
 sheet+0.121
 journal+0.119
Neg logit contributions
used-0.122
eyed-0.112
all-0.106
ucked-0.099
uts-0.097
Token and
Token ID290
Feature activation-7.98e+00

Pos logit contributions
 fort+0.169
 attic+0.169
 folder+0.164
 sheet+0.161
 garage+0.160
Neg logit contributions
used-0.148
eyed-0.132
uts-0.126
all-0.119
but-0.118
Token his
Token ID465
Feature activation-1.45e+01

Pos logit contributions
 fort+0.290
 attic+0.281
 sheet+0.279
 folder+0.271
 closet+0.266
Neg logit contributions
used-0.238
eyed-0.215
all-0.206
 cheering-0.201
ting-0.195
Token friends
Token ID2460
Feature activation+2.25e+00

Pos logit contributions
 attic+0.509
 fort+0.498
 folder+0.497
 sheet+0.494
 garage+0.491
Neg logit contributions
used-0.420
eyed-0.369
uts-0.357
 cheering-0.348
unch-0.339
Token all
Token ID477
Feature activation-2.72e+00

Pos logit contributions
used+0.076
eyed+0.068
uts+0.064
 cheering+0.062
all+0.061
Neg logit contributions
 fort-0.073
 attic-0.073
 folder-0.070
 sheet-0.070
 garage-0.068
Token worked
Token ID3111
Feature activation+6.69e+00

Pos logit contributions
 fort+0.093
 attic+0.092
 folder+0.089
 sheet+0.087
 garage+0.087
Neg logit contributions
used-0.089
eyed-0.082
 cheering-0.074
uts-0.072
all-0.072
Token together
Token ID1978
Feature activation+2.00e+00

Pos logit contributions
used+0.108
eyed+0.086
uts+0.080
all+0.070
but+0.070
Neg logit contributions
 fort-0.334
 attic-0.333
 folder-0.326
 sheet-0.325
 garage-0.322
Token to
Token ID284
Feature activation-2.62e+00

Pos logit contributions
used+0.056
eyed+0.048
uts+0.046
unch+0.044
qu+0.042
Neg logit contributions
 fort-0.077
 sheet-0.076
 attic-0.076
 folder-0.076
 drawer-0.073
Token the
Token ID262
Feature activation-1.16e+01

Pos logit contributions
used+0.103
 cheering+0.092
eyed+0.091
all+0.090
uts+0.087
Neg logit contributions
 attic-0.160
 fort-0.158
 folder-0.157
 closet-0.154
 sheet-0.152
Token day
Token ID1110
Feature activation+5.56e+00

Pos logit contributions
 proper+0.259
iper+0.238
pron+0.230
 attic+0.217
 closet+0.214
Neg logit contributions
used-0.419
ting-0.408
 below-0.406
 cheering-0.402
 again-0.399
Token so
Token ID523
Feature activation-1.50e+00

Pos logit contributions
used+0.127
eyed+0.111
uts+0.107
 cheering+0.101
all+0.097
Neg logit contributions
 attic-0.238
 fort-0.237
 folder-0.231
 sheet-0.229
 closet-0.226
Token cool
Token ID3608
Feature activation+1.90e+00

Pos logit contributions
 fort+0.050
 attic+0.050
 folder+0.048
 sheet+0.048
 closet+0.047
Neg logit contributions
used-0.051
eyed-0.046
uts-0.044
unch-0.042
 cheering-0.042
Token!
Token ID0
Feature activation+3.83e+00

Pos logit contributions
used+0.040
eyed+0.033
uts+0.032
all+0.029
but+0.029
Neg logit contributions
 attic-0.085
 fort-0.085
 folder-0.083
 sheet-0.082
 garage-0.082
Token<|endoftext|>
Token ID50256
Feature activation+1.37e+01

Pos logit contributions
used+0.066
eyed+0.049
uts+0.045
but+0.040
unch+0.039
Neg logit contributions
 attic-0.191
 fort-0.191
 folder-0.186
 sheet-0.186
 garage-0.185
TokenOnce
Token ID7454
Feature activation+7.11e+00

Pos logit contributions
used+0.388
eyed+0.334
aring+0.325
uts+0.321
shadow+0.298
Neg logit contributions
 attic-0.554
 fort-0.534
 garage-0.527
 sheet-0.523
 closet-0.519
Token upon
Token ID2402
Feature activation-1.36e+00

Pos logit contributions
used+0.267
eyed+0.239
uts+0.235
all+0.227
unch+0.225
Neg logit contributions
 fort-0.200
 attic-0.197
 folder-0.194
 sheet-0.192
 garage-0.186
Token a
Token ID257
Feature activation-7.98e+00

Pos logit contributions
 fort+0.048
 attic+0.047
 folder+0.047
 sheet+0.046
 closet+0.045
Neg logit contributions
used-0.041
eyed-0.037
uts-0.036
unch-0.034
all-0.034
Token time
Token ID640
Feature activation+4.54e+00

Pos logit contributions
 folder+0.166
 attic+0.166
 fort+0.159
 drawer+0.158
 closet+0.157
Neg logit contributions
used-0.335
eyed-0.319
unch-0.316
all-0.310
uts-0.301
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
used+0.129
eyed+0.120
uts+0.109
ting+0.105
but+0.104
Neg logit contributions
 fort-0.169
 attic-0.168
 folder-0.163
 sheet-0.163
 garage-0.160
Token sister
Token ID6621
Feature activation+2.27e+00

Pos logit contributions
used+0.045
 cheering+0.041
uts+0.041
all+0.041
 joining+0.040
Neg logit contributions
 attic-0.027
 fort-0.026
 seas-0.025
 garage-0.024
 envelope-0.024
Token kept
Token ID4030
Feature activation+5.38e+00

Pos logit contributions
used+0.077
eyed+0.068
uts+0.067
all+0.063
but+0.063
Neg logit contributions
 fort-0.073
 attic-0.073
 folder-0.070
 sheet-0.070
 garage-0.069
Token her
Token ID607
Feature activation-1.11e+01

Pos logit contributions
used+0.180
eyed+0.160
uts+0.153
 cheering+0.151
all+0.149
Neg logit contributions
 fort-0.179
 attic-0.176
 sheet-0.169
 folder-0.169
 garage-0.166
Token promise
Token ID6991
Feature activation+8.69e+00

Pos logit contributions
 attic+0.321
 fort+0.308
 seas+0.296
 sheet+0.292
 garage+0.288
Neg logit contributions
used-0.380
 cheering-0.344
eyed-0.339
uts-0.325
 joining-0.318
Token.
Token ID13
Feature activation+4.57e+00

Pos logit contributions
used+0.240
eyed+0.191
unch+0.189
qu+0.184
aro+0.177
Neg logit contributions
 folder-0.346
 attic-0.342
 fort-0.339
 sheet-0.337
 drawer-0.334
Token<|endoftext|>
Token ID50256
Feature activation+1.38e+01

Pos logit contributions
used+0.148
eyed+0.130
 cheering+0.118
qu+0.118
ting+0.114
Neg logit contributions
 attic-0.161
 folder-0.153
 closet-0.151
 drawer-0.150
 garage-0.149
TokenOnce
Token ID7454
Feature activation+7.31e+00

Pos logit contributions
used+0.442
eyed+0.398
aring+0.384
uts+0.380
shadow+0.371
Neg logit contributions
 attic-0.508
 fort-0.479
 sheet-0.476
 garage-0.475
 closet-0.473
Token there
Token ID612
Feature activation+4.96e+00

Pos logit contributions
used+0.274
eyed+0.242
uts+0.240
all+0.234
but+0.230
Neg logit contributions
 fort-0.207
 attic-0.203
 folder-0.197
 sheet-0.196
 garage-0.193
Token was
Token ID373
Feature activation-2.38e+00

Pos logit contributions
used+0.148
eyed+0.126
uts+0.121
all+0.118
 cheering+0.117
Neg logit contributions
 fort-0.183
 attic-0.179
 folder-0.178
 sheet-0.175
 garage-0.170
Token a
Token ID257
Feature activation-8.07e+00

Pos logit contributions
 fort+0.100
 folder+0.097
 drawer+0.096
 garage+0.095
 closet+0.095
Neg logit contributions
used-0.053
eyed-0.047
all-0.047
unch-0.046
uts-0.043
Token big
Token ID1263
Feature activation-5.90e+00

Pos logit contributions
 attic+0.203
 fort+0.193
 folder+0.189
 sheet+0.184
 closet+0.182
Neg logit contributions
used-0.278
unch-0.271
 cheering-0.270
-0.268
eyed-0.268
Token
Token ID198
Feature activation+3.30e+00

Pos logit contributions
used+0.128
eyed+0.107
uts+0.099
all+0.096
unch+0.095
Neg logit contributions
 fort-0.199
 attic-0.199
 folder-0.194
 sheet-0.192
 garage-0.192
Token
Token ID198
Feature activation+2.82e+00

Pos logit contributions
used+0.099
ting+0.078
ucked+0.066
eyed+0.064
 joining+0.063
Neg logit contributions
 fort-0.136
 attic-0.134
 garage-0.132
 sheet-0.128
 closet-0.127
TokenTom
Token ID13787
Feature activation-3.87e+00

Pos logit contributions
used+0.092
eyed+0.082
uts+0.079
unch+0.075
but+0.074
Neg logit contributions
 fort-0.096
 attic-0.095
 folder-0.093
 sheet-0.093
 garage-0.090
Token and
Token ID290
Feature activation-8.02e+00

Pos logit contributions
 attic+0.159
 fort+0.156
 folder+0.154
 sheet+0.151
 journal+0.150
Neg logit contributions
used-0.096
eyed-0.086
uts-0.082
but-0.075
ting-0.075
Token Lily
Token ID20037
Feature activation-6.92e+00

Pos logit contributions
 attic+0.298
 fort+0.286
 sheet+0.277
 journal+0.276
 seas+0.275
Neg logit contributions
-0.200
eyed-0.194
used-0.193
smart-0.193
 cheering-0.181
Token wished
Token ID16555
Feature activation+1.01e+01

Pos logit contributions
 fort+0.208
 attic+0.204
 sheet+0.202
 folder+0.198
 garage+0.198
Neg logit contributions
used-0.251
eyed-0.222
uts-0.219
all-0.215
unch-0.211
Token they
Token ID484
Feature activation-4.88e+00

Pos logit contributions
used+0.295
eyed+0.258
uts+0.254
 cheering+0.239
but+0.238
Neg logit contributions
 fort-0.370
 attic-0.368
 folder-0.361
 sheet-0.357
 garage-0.352
Token had
Token ID550
Feature activation+9.48e-01

Pos logit contributions
 fort+0.140
 attic+0.138
 folder+0.136
 sheet+0.134
 closet+0.129
Neg logit contributions
used-0.180
eyed-0.166
uts-0.163
but-0.152
ting-0.150
Token not
Token ID407
Feature activation-1.36e+00

Pos logit contributions
used+0.032
eyed+0.028
uts+0.027
unch+0.026
but+0.026
Neg logit contributions
 fort-0.032
 attic-0.031
 folder-0.031
 sheet-0.030
 garage-0.030
Token fought
Token ID8350
Feature activation+1.25e+01

Pos logit contributions
 fort+0.039
 folder+0.038
 garage+0.037
 attic+0.037
 sheet+0.037
Neg logit contributions
used-0.052
but-0.045
eyed-0.044
uts-0.044
unch-0.043
Token.
Token ID13
Feature activation+4.34e+00

Pos logit contributions
used+0.331
unch+0.267
eyed+0.260
uts+0.257
but+0.238
Neg logit contributions
 fort-0.501
 folder-0.490
 attic-0.489
 sheet-0.481
 garage-0.476
Token happy
Token ID3772
Feature activation+2.74e+00

Pos logit contributions
 attic+0.094
 folder+0.091
 closet+0.090
 garage+0.088
 drawer+0.088
Neg logit contributions
used-0.096
uts-0.090
eyed-0.083
unch-0.082
ting-0.080
Token to
Token ID284
Feature activation-2.75e+00

Pos logit contributions
used+0.084
eyed+0.074
uts+0.068
unch+0.068
qu+0.066
Neg logit contributions
 fort-0.096
 folder-0.096
 attic-0.095
 sheet-0.094
 drawer-0.094
Token see
Token ID766
Feature activation+3.98e+00

Pos logit contributions
 folder+0.064
 garage+0.062
 sheet+0.061
 shelf+0.061
 closet+0.060
Neg logit contributions
used-0.111
unch-0.102
uts-0.102
overed-0.101
ting-0.099
Token how
Token ID703
Feature activation+3.26e+00

Pos logit contributions
used+0.136
eyed+0.120
uts+0.117
all+0.113
but+0.113
Neg logit contributions
 fort-0.130
 attic-0.126
 folder-0.122
 sheet-0.122
 garage-0.121
Token much
Token ID881
Feature activation+4.00e-01

Pos logit contributions
used+0.125
eyed+0.114
uts+0.110
all+0.106
 cheering+0.105
Neg logit contributions
 fort-0.093
 attic-0.091
 sheet-0.089
 folder-0.088
 garage-0.087
Token joy
Token ID8716
Feature activation+1.00e+01

Pos logit contributions
used+0.009
 cheering+0.009
uts+0.009
all+0.008
 joining+0.008
Neg logit contributions
 fort-0.016
 garage-0.016
 attic-0.015
 closet-0.015
 sheet-0.015
Token the
Token ID262
Feature activation-1.20e+01

Pos logit contributions
used+0.273
uts+0.241
eyed+0.233
all+0.227
ting+0.226
Neg logit contributions
 fort-0.388
 attic-0.381
 garage-0.372
 folder-0.369
 sheet-0.367
Token picture
Token ID4286
Feature activation+1.02e+01

Pos logit contributions
 fort+0.286
 attic+0.264
 sheet+0.256
 proper+0.254
 garage+0.252
Neg logit contributions
used-0.462
eyed-0.425
 cheering-0.423
all-0.403
qu-0.398
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
used+0.297
uts+0.267
eyed+0.261
all+0.249
 cheering+0.249
Neg logit contributions
 fort-0.359
 attic-0.358
 folder-0.346
 garage-0.343
 closet-0.340
Token bringing
Token ID6079
Feature activation+7.79e+00

Pos logit contributions
 fort+0.052
 attic+0.051
 folder+0.051
 sheet+0.050
 garage+0.050
Neg logit contributions
used-0.031
eyed-0.026
uts-0.025
unch-0.024
ucked-0.023
Token her
Token ID607
Feature activation-1.05e+01

Pos logit contributions
used+0.183
eyed+0.159
uts+0.150
all+0.142
 cheering+0.142
Neg logit contributions
 fort-0.330
 attic-0.326
 sheet-0.317
 garage-0.316
 folder-0.314
Token start
Token ID923
Feature activation+4.35e+00

Pos logit contributions
 attic+0.083
 garage+0.081
 fort+0.081
 folder+0.080
 sheet+0.080
Neg logit contributions
used-0.096
uts-0.086
unch-0.085
eyed-0.084
but-0.083
Token exploring
Token ID13504
Feature activation+6.90e+00

Pos logit contributions
used+0.133
eyed+0.118
uts+0.115
unch+0.111
but+0.107
Neg logit contributions
 attic-0.154
 fort-0.154
 folder-0.151
 sheet-0.151
 closet-0.148
Token.
Token ID13
Feature activation+3.91e+00

Pos logit contributions
used+0.161
 cheering+0.132
eyed+0.131
but+0.129
ting+0.127
Neg logit contributions
 fort-0.279
 folder-0.273
 attic-0.266
 garage-0.264
 sheet-0.264
Token Her
Token ID2332
Feature activation-1.04e+01

Pos logit contributions
used+0.116
eyed+0.105
uts+0.097
unch+0.096
ting+0.096
Neg logit contributions
 attic-0.144
 closet-0.137
 drawer-0.135
 fort-0.135
 garage-0.135
Token mom
Token ID1995
Feature activation-4.94e-01

Pos logit contributions
 fort+0.304
 attic+0.297
 sheet+0.291
 folder+0.288
 garage+0.282
Neg logit contributions
used-0.376
eyed-0.321
uts-0.321
 cheering-0.305
all-0.298
Token handed
Token ID10158
Feature activation+9.45e+00

Pos logit contributions
 fort+0.018
 attic+0.017
 folder+0.017
 sheet+0.017
 closet+0.017
Neg logit contributions
used-0.015
eyed-0.013
uts-0.013
unch-0.012
but-0.012
Token her
Token ID607
Feature activation-1.04e+01

Pos logit contributions
used+0.313
eyed+0.278
all+0.260
but+0.255
ting+0.254
Neg logit contributions
 fort-0.329
 attic-0.313
 folder-0.307
 sheet-0.307
 garage-0.305
Token a
Token ID257
Feature activation-8.12e+00

Pos logit contributions
 fort+0.418
 attic+0.393
 garage+0.382
 sheet+0.377
 folder+0.373
Neg logit contributions
used-0.277
eyed-0.241
uts-0.218
all-0.214
but-0.213
Token pretty
Token ID2495
Feature activation-2.23e+00

Pos logit contributions
 fort+0.190
 attic+0.185
 garage+0.160
 closet+0.150
 guy+0.148
Neg logit contributions
used-0.322
eyed-0.297
 cheering-0.293
uts-0.276
unch-0.275
Token notebook
Token ID20922
Feature activation+6.49e+00

Pos logit contributions
 fort+0.054
 attic+0.051
 garage+0.049
 proper+0.046
 guy+0.043
Neg logit contributions
used-0.087
 cheering-0.077
eyed-0.074
:-0.074
wards-0.073
Token so
Token ID523
Feature activation-1.50e+00

Pos logit contributions
used+0.163
eyed+0.139
uts+0.131
unch+0.124
all+0.122
Neg logit contributions
 fort-0.267
 attic-0.265
 folder-0.261
 sheet-0.260
 garage-0.255
Token a
Token ID257
Feature activation-8.92e+00

Pos logit contributions
used+0.084
eyed+0.072
uts+0.068
but+0.066
all+0.066
Neg logit contributions
 fort-0.111
 attic-0.108
 folder-0.106
 sheet-0.105
 garage-0.104
Token piece
Token ID3704
Feature activation+6.46e+00

Pos logit contributions
 attic+0.198
 fort+0.195
iper+0.173
 guys+0.172
 guy+0.164
Neg logit contributions
 cheering-0.327
used-0.323
 deciding-0.295
 joining-0.295
eyed-0.295
Token of
Token ID286
Feature activation-5.68e+00

Pos logit contributions
used+0.099
eyed+0.082
uts+0.080
but+0.070
all+0.068
Neg logit contributions
 attic-0.317
 fort-0.316
 folder-0.308
 garage-0.306
 closet-0.305
Token cloth
Token ID16270
Feature activation+6.58e+00

Pos logit contributions
 fort+0.107
 attic+0.098
 garage+0.088
 closet+0.084
 folder+0.082
Neg logit contributions
used-0.244
 cheering-0.234
wards-0.222
eyed-0.222
 joining-0.218
Token and
Token ID290
Feature activation-5.94e+00

Pos logit contributions
used+0.149
uts+0.127
eyed+0.127
but+0.114
all+0.112
Neg logit contributions
 fort-0.281
 attic-0.280
 folder-0.273
 sheet-0.270
 garage-0.267
Token put
Token ID1234
Feature activation+5.43e+00

Pos logit contributions
 fort+0.219
 attic+0.214
 folder+0.206
 garage+0.200
 closet+0.199
Neg logit contributions
used-0.167
eyed-0.147
uts-0.146
 cheering-0.140
but-0.140
Token it
Token ID340
Feature activation-1.41e+00

Pos logit contributions
used+0.111
eyed+0.091
all+0.083
ting+0.082
uts+0.081
Neg logit contributions
 fort-0.249
 attic-0.245
 folder-0.237
 garage-0.236
 closet-0.234
Token around
Token ID1088
Feature activation+3.57e+00

Pos logit contributions
 fort+0.058
 attic+0.058
 folder+0.057
 sheet+0.057
 garage+0.056
Neg logit contributions
used-0.034
eyed-0.029
uts-0.028
but-0.026
all-0.026
Token the
Token ID262
Feature activation-1.22e+01

Pos logit contributions
used+0.108
eyed+0.094
uts+0.088
 cheering+0.088
all+0.088
Neg logit contributions
 fort-0.128
 attic-0.126
 sheet-0.122
 folder-0.122
 garage-0.120
Token triangle
Token ID22950
Feature activation+2.50e+00

Pos logit contributions
 fort+0.246
iper+0.241
 guy+0.227
 germ+0.224
 attic+0.222
Neg logit contributions
used-0.459
eyed-0.440
 cheering-0.424
-0.415
but-0.413
Token.
Token ID13
Feature activation+4.58e+00

Pos logit contributions
used+0.044
eyed+0.037
uts+0.037
but+0.032
all+0.030
Neg logit contributions
 attic-0.115
 fort-0.114
 folder-0.111
 garage-0.111
 closet-0.110
Token up
Token ID510
Feature activation+2.52e-01

Pos logit contributions
used+0.188
eyed+0.162
uts+0.155
all+0.146
unch+0.145
Neg logit contributions
 fort-0.271
 attic-0.269
 folder-0.262
 sheet-0.261
 garage-0.258
Token a
Token ID257
Feature activation-8.62e+00

Pos logit contributions
used+0.007
eyed+0.006
uts+0.006
all+0.006
 cheering+0.006
Neg logit contributions
 attic-0.009
 fort-0.009
 garage-0.009
 folder-0.009
 sheet-0.009
Token pe
Token ID613
Feature activation-4.51e-01

Pos logit contributions
 fort+0.194
 attic+0.190
 folder+0.184
 sheet+0.183
 garage+0.177
Neg logit contributions
used-0.379
eyed-0.347
uts-0.340
unch-0.326
but-0.325
Tokenb
Token ID65
Feature activation+2.18e+00

Pos logit contributions
 fort+0.015
 sheet+0.015
 attic+0.015
 folder+0.015
 drawer+0.014
Neg logit contributions
used-0.014
uts-0.013
eyed-0.012
 cheering-0.012
unch-0.012
Tokenble
Token ID903
Feature activation+1.58e+00

Pos logit contributions
used+0.067
 cheering+0.061
wards+0.060
eyed+0.059
ting+0.058
Neg logit contributions
 sheet-0.069
 fort-0.068
 drawer-0.067
 necklace-0.066
 scarf-0.066
Token too
Token ID1165
Feature activation+1.16e+00

Pos logit contributions
used+0.042
eyed+0.035
uts+0.033
qu+0.031
unch+0.031
Neg logit contributions
 fort-0.064
 sheet-0.064
 folder-0.062
 drawer-0.061
 shelf-0.060
Token.
Token ID13
Feature activation+4.57e+00

Pos logit contributions
used+0.042
ting+0.035
alled+0.035
eyed+0.035
oned+0.035
Neg logit contributions
 fort-0.034
 paper-0.034
 sheet-0.033
 shelf-0.033
 boring-0.031
Token He
Token ID679
Feature activation-6.96e+00

Pos logit contributions
used+0.152
eyed+0.126
qu+0.120
ting+0.119
pr+0.117
Neg logit contributions
 fort-0.159
 attic-0.156
 drawer-0.154
 closet-0.152
 folder-0.152
Token threw
Token ID9617
Feature activation+6.79e+00

Pos logit contributions
 attic+0.253
 fort+0.248
 folder+0.243
 garage+0.235
 journal+0.235
Neg logit contributions
used-0.205
eyed-0.187
uts-0.181
all-0.173
 cheering-0.169
Token it
Token ID340
Feature activation-1.35e+00

Pos logit contributions
used+0.152
eyed+0.129
 cheering+0.120
all+0.119
ting+0.118
Neg logit contributions
 attic-0.298
 fort-0.295
 garage-0.286
 folder-0.285
 closet-0.282
Token as
Token ID355
Feature activation+2.49e+00

Pos logit contributions
 fort+0.053
 attic+0.052
 folder+0.051
 sheet+0.051
 garage+0.050
Neg logit contributions
used-0.037
eyed-0.031
uts-0.030
unch-0.029
 cheering-0.028
Token back
Token ID736
Feature activation-8.27e-01

Pos logit contributions
used+0.186
unch+0.161
uts+0.154
eyed+0.145
qu+0.145
Neg logit contributions
 fort-0.222
 folder-0.221
 sheet-0.214
 attic-0.213
 closet-0.211
Token out
Token ID503
Feature activation+2.37e-01

Pos logit contributions
 fort+0.027
 attic+0.027
 closet+0.026
 folder+0.026
 sheet+0.026
Neg logit contributions
used-0.027
unch-0.026
uts-0.024
qu-0.024
ucked-0.023
Token of
Token ID286
Feature activation-4.16e+00

Pos logit contributions
used+0.007
unch+0.006
uts+0.006
eyed+0.006
qu+0.005
Neg logit contributions
 fort-0.008
 sheet-0.008
 folder-0.008
 closet-0.008
 drawer-0.008
Token the
Token ID262
Feature activation-1.26e+01

Pos logit contributions
 fort+0.127
 attic+0.126
 folder+0.122
 sheet+0.121
 garage+0.119
Neg logit contributions
used-0.149
eyed-0.134
uts-0.130
unch-0.124
all-0.124
Token office
Token ID2607
Feature activation+4.68e+00

Pos logit contributions
 fort+0.265
kin+0.264
 sheet+0.261
 folder+0.258
 journal+0.248
Neg logit contributions
used-0.486
eyed-0.452
 cheering-0.444
wards-0.414
ting-0.413
Token.
Token ID13
Feature activation+3.89e+00

Pos logit contributions
used+0.100
eyed+0.080
uts+0.075
unch+0.073
all+0.069
Neg logit contributions
 fort-0.211
 attic-0.210
 folder-0.206
 sheet-0.205
 closet-0.204
Token She
Token ID1375
Feature activation-8.06e+00

Pos logit contributions
used+0.102
eyed+0.091
qu+0.085
ting+0.083
uts+0.082
Neg logit contributions
 attic-0.152
 folder-0.146
 closet-0.146
 garage-0.143
 drawer-0.142
Token was
Token ID373
Feature activation-2.96e+00

Pos logit contributions
 fort+0.296
 attic+0.292
 folder+0.287
 sheet+0.281
 garage+0.276
Neg logit contributions
used-0.242
eyed-0.214
uts-0.201
all-0.194
 cheering-0.193
Token so
Token ID523
Feature activation-1.85e+00

Pos logit contributions
 attic+0.105
 fort+0.103
 closet+0.103
 folder+0.102
 sheet+0.102
Neg logit contributions
used-0.086
uts-0.077
eyed-0.074
unch-0.071
qu-0.070
Token excited
Token ID6568
Feature activation+5.38e+00

Pos logit contributions
 attic+0.050
 folder+0.048
 closet+0.047
 proper+0.046
 drawer+0.046
Neg logit contributions
used-0.065
ting-0.061
uts-0.059
but-0.057
oned-0.057
Token to
Token ID284
Feature activation-2.79e+00

Pos logit contributions
used+0.132
eyed+0.113
uts+0.103
unch+0.101
ucked+0.096
Neg logit contributions
 fort-0.224
 attic-0.221
 folder-0.218
 sheet-0.217
 closet-0.212
Token so
Token ID523
Feature activation-1.73e+00

Pos logit contributions
used+0.018
eyed+0.016
uts+0.015
all+0.015
 cheering+0.015
Neg logit contributions
 fort-0.024
 attic-0.023
 sheet-0.023
 folder-0.023
 garage-0.022
Token much
Token ID881
Feature activation-2.46e-01

Pos logit contributions
 fort+0.062
 attic+0.061
 folder+0.060
 sheet+0.060
 garage+0.059
Neg logit contributions
used-0.053
eyed-0.047
uts-0.045
unch-0.043
all-0.042
Token fun
Token ID1257
Feature activation+5.90e+00

Pos logit contributions
 fort+0.005
 attic+0.005
 folder+0.005
 sheet+0.005
 garage+0.005
Neg logit contributions
used-0.011
eyed-0.010
uts-0.010
unch-0.010
but-0.010
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
used+0.126
eyed+0.104
uts+0.096
unch+0.090
but+0.087
Neg logit contributions
 fort-0.267
 attic-0.264
 folder-0.260
 sheet-0.259
 garage-0.254
Token they
Token ID484
Feature activation-5.94e+00

Pos logit contributions
 attic+0.075
 fort+0.074
 folder+0.072
 sheet+0.071
 garage+0.071
Neg logit contributions
used-0.041
 cheering-0.040
all-0.037
uts-0.037
but-0.036
Token didn
Token ID1422
Feature activation+7.86e+00

Pos logit contributions
 fort+0.218
 attic+0.214
 sheet+0.206
 folder+0.205
 garage+0.203
Neg logit contributions
used-0.178
eyed-0.159
uts-0.149
 cheering-0.144
all-0.142
Token't
Token ID470
Feature activation-6.48e-01

Pos logit contributions
used+0.156
eyed+0.123
ting+0.113
unch+0.102
but+0.101
Neg logit contributions
 attic-0.351
 sheet-0.351
 fort-0.347
 puzzle-0.345
 garage-0.344
Token want
Token ID765
Feature activation+1.02e+01

Pos logit contributions
 fort+0.022
 attic+0.022
 folder+0.022
 sheet+0.021
 garage+0.021
Neg logit contributions
used-0.021
eyed-0.018
uts-0.018
unch-0.017
but-0.017
Token the
Token ID262
Feature activation-1.15e+01

Pos logit contributions
used+0.212
eyed+0.176
uts+0.170
all+0.155
but+0.152
Neg logit contributions
 fort-0.464
 attic-0.461
 folder-0.451
 sheet-0.449
 garage-0.444
Token day
Token ID1110
Feature activation+5.62e+00

Pos logit contributions
 attic+0.268
 fort+0.267
 folder+0.250
 proper+0.244
 closet+0.243
Neg logit contributions
used-0.434
 cheering-0.420
 joining-0.393
all-0.384
wards-0.379
Token to
Token ID284
Feature activation-1.76e+00

Pos logit contributions
used+0.123
eyed+0.102
uts+0.097
unch+0.089
but+0.088
Neg logit contributions
 fort-0.250
 attic-0.248
 folder-0.243
 sheet-0.242
 garage-0.239
Token little
Token ID1310
Feature activation-5.83e+00

Pos logit contributions
 fort+0.283
 attic+0.262
 folder+0.248
 proper+0.242
 sheet+0.234
Neg logit contributions
used-0.439
 cheering-0.395
eyed-0.383
all-0.379
 joining-0.369
Token girl
Token ID2576
Feature activation-3.63e+00

Pos logit contributions
 fort+0.129
 attic+0.122
 proper+0.121
 garage+0.107
 journal+0.106
Neg logit contributions
used-0.244
ting-0.216
 cheering-0.212
eyed-0.202
all-0.202
Token had
Token ID550
Feature activation-9.40e-01

Pos logit contributions
 fort+0.127
 attic+0.126
 folder+0.123
 sheet+0.121
 closet+0.118
Neg logit contributions
used-0.115
eyed-0.102
uts-0.097
all-0.092
but-0.092
Token so
Token ID523
Feature activation-2.35e+00

Pos logit contributions
 fort+0.034
 attic+0.034
 folder+0.033
 sheet+0.033
 garage+0.033
Neg logit contributions
used-0.028
eyed-0.025
uts-0.024
unch-0.023
but-0.022
Token much
Token ID881
Feature activation-5.21e-01

Pos logit contributions
 fort+0.083
 attic+0.082
 folder+0.080
 sheet+0.080
 garage+0.078
Neg logit contributions
used-0.074
eyed-0.065
uts-0.063
unch-0.059
but-0.059
Token fun
Token ID1257
Feature activation+5.12e+00

Pos logit contributions
 fort+0.009
 attic+0.009
 folder+0.009
 sheet+0.009
 closet+0.008
Neg logit contributions
used-0.026
eyed-0.024
uts-0.023
but-0.023
ting-0.022
Token!
Token ID0
Feature activation+3.44e+00

Pos logit contributions
used+0.146
eyed+0.129
uts+0.118
unch+0.113
icked+0.113
Neg logit contributions
 fort-0.194
 attic-0.194
 sheet-0.189
 folder-0.188
 garage-0.185
Token She
Token ID1375
Feature activation-8.02e+00

Pos logit contributions
used+0.092
eyed+0.080
uts+0.077
unch+0.072
 cheering+0.071
Neg logit contributions
 attic-0.135
 fort-0.135
 folder-0.132
 sheet-0.131
 garage-0.129
Token laughed
Token ID13818
Feature activation+5.07e+00

Pos logit contributions
 fort+0.292
 attic+0.288
 folder+0.282
 sheet+0.277
 journal+0.270
Neg logit contributions
used-0.243
eyed-0.220
all-0.201
uts-0.200
 cheering-0.199
Token as
Token ID355
Feature activation+1.76e+00

Pos logit contributions
used+0.154
eyed+0.135
uts+0.129
unch+0.124
qu+0.118
Neg logit contributions
 fort-0.184
 attic-0.182
 sheet-0.179
 folder-0.179
 closet-0.173
Token the
Token ID262
Feature activation-1.17e+01

Pos logit contributions
used+0.040
all+0.035
but+0.034
uts+0.033
 cheering+0.031
Neg logit contributions
 fort-0.079
 closet-0.074
 garage-0.074
 folder-0.073
 attic-0.072
Token the
Token ID262
Feature activation-1.12e+01

Pos logit contributions
 fort+0.062
 attic+0.060
 folder+0.060
 sheet+0.058
 journal+0.058
Neg logit contributions
used-0.049
eyed-0.044
 cheering-0.043
uts-0.042
all-0.042
Token park
Token ID3952
Feature activation+6.39e+00

Pos logit contributions
 fort+0.273
 sheet+0.268
 folder+0.262
 journal+0.257
 attic+0.250
Neg logit contributions
used-0.406
eyed-0.369
all-0.368
 cheering-0.365
't-0.364
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
used+0.175
eyed+0.142
uts+0.135
unch+0.131
ucked+0.123
Neg logit contributions
 fort-0.268
 attic-0.260
 garage-0.256
 folder-0.253
 sheet-0.250
Token play
Token ID711
Feature activation+7.96e+00

Pos logit contributions
 fort+0.062
 attic+0.061
 folder+0.060
 sheet+0.060
 garage+0.058
Neg logit contributions
used-0.060
eyed-0.054
uts-0.052
unch-0.049
all-0.049
Token with
Token ID351
Feature activation+8.73e-01

Pos logit contributions
used+0.213
eyed+0.172
uts+0.164
all+0.148
aring+0.145
Neg logit contributions
 fort-0.348
 sheet-0.336
 folder-0.335
 attic-0.335
 garage-0.325
Token other
Token ID584
Feature activation-3.09e+00

Pos logit contributions
used+0.028
eyed+0.026
uts+0.025
all+0.024
unch+0.024
Neg logit contributions
 fort-0.028
 attic-0.028
 folder-0.028
 sheet-0.027
 drawer-0.026
Token kids
Token ID3988
Feature activation+5.31e+00

Pos logit contributions
 fort+0.079
 folder+0.079
 attic+0.078
 sheet+0.077
 journal+0.074
Neg logit contributions
used-0.121
eyed-0.109
uts-0.108
 cheering-0.105
all-0.105
Token in
Token ID287
Feature activation-3.69e+00

Pos logit contributions
used+0.127
eyed+0.100
unch+0.096
uts+0.094
overed+0.093
Neg logit contributions
 fort-0.232
 attic-0.228
 folder-0.225
 closet-0.223
 garage-0.222
Token a
Token ID257
Feature activation-8.65e+00

Pos logit contributions
 fort+0.117
 attic+0.116
 folder+0.113
 sheet+0.112
 garage+0.110
Neg logit contributions
used-0.129
eyed-0.116
uts-0.113
unch-0.107
but-0.107
Token club
Token ID3430
Feature activation+5.86e+00

Pos logit contributions
 attic+0.205
 sheet+0.205
 folder+0.199
 journal+0.197
kin+0.194
Neg logit contributions
used-0.293
 cheering-0.286
eyed-0.276
-0.275
unch-0.256
Token.
Token ID13
Feature activation+5.11e+00

Pos logit contributions
used+0.123
eyed+0.098
uts+0.093
unch+0.086
all+0.084
Neg logit contributions
 fort-0.274
 attic-0.274
 folder-0.270
 sheet-0.266
 garage-0.265
Token the
Token ID262
Feature activation-1.19e+01

Pos logit contributions
used+0.011
eyed+0.010
uts+0.010
 cheering+0.009
all+0.009
Neg logit contributions
 fort-0.013
 attic-0.013
 folder-0.013
 closet-0.013
 garage-0.013
Token eggs
Token ID9653
Feature activation+2.20e+00

Pos logit contributions
 attic+0.336
 folder+0.335
 journal+0.330
 fort+0.328
 garage+0.320
Neg logit contributions
used-0.417
 cheering-0.368
unch-0.364
uts-0.362
but-0.353
Token away
Token ID1497
Feature activation+4.14e-03

Pos logit contributions
used+0.063
eyed+0.051
all+0.049
 Well+0.049
unch+0.048
Neg logit contributions
 sheet-0.083
 fort-0.083
 folder-0.078
 attic-0.078
 garage-0.077
Token.
Token ID13
Feature activation+4.96e+00

Pos logit contributions
used+0.000
eyed+0.000
unch+0.000
uts+0.000
 cheering+0.000
Neg logit contributions
 fort-0.000
 attic-0.000
 sheet-0.000
 folder-0.000
 garage-0.000
Token The
Token ID383
Feature activation-1.04e+01

Pos logit contributions
used+0.141
eyed+0.124
uts+0.118
unch+0.112
all+0.110
Neg logit contributions
 fort-0.188
 attic-0.187
 sheet-0.182
 folder-0.182
 garage-0.179
Token bird
Token ID6512
Feature activation-6.42e-01

Pos logit contributions
 folder+0.294
 attic+0.285
 fort+0.284
 journal+0.281
 proper+0.279
Neg logit contributions
used-0.367
 cheering-0.327
unch-0.325
eyed-0.324
uts-0.318
Token was
Token ID373
Feature activation-3.39e+00

Pos logit contributions
 attic+0.023
 fort+0.022
 folder+0.022
 closet+0.022
 garage+0.022
Neg logit contributions
used-0.020
eyed-0.018
uts-0.016
but-0.016
all-0.016
Token very
Token ID845
Feature activation-2.67e+00

Pos logit contributions
 fort+0.081
 attic+0.078
 sheet+0.076
 closet+0.075
 folder+0.074
Neg logit contributions
used-0.143
uts-0.125
eyed-0.125
but-0.122
qu-0.122
Token upset
Token ID9247
Feature activation+6.39e+00

Pos logit contributions
 fort+0.066
 attic+0.064
 folder+0.062
 sheet+0.061
 garage+0.060
Neg logit contributions
used-0.111
eyed-0.098
uts-0.097
ting-0.096
but-0.095
Token because
Token ID780
Feature activation-7.94e-01

Pos logit contributions
used+0.228
eyed+0.205
qu+0.184
ucked+0.178
unch+0.178
Neg logit contributions
 fort-0.211
 attic-0.205
 sheet-0.204
 folder-0.192
 closet-0.192
Token it
Token ID340
Feature activation-1.85e+00

Pos logit contributions
 fort+0.032
 attic+0.031
 folder+0.031
 sheet+0.030
 closet+0.030
Neg logit contributions
used-0.021
eyed-0.018
uts-0.018
all-0.017
but-0.017
Token can
Token ID460
Feature activation+6.46e+00

Pos logit contributions
 fort+0.101
 attic+0.100
 folder+0.097
 sheet+0.096
 garage+0.093
Neg logit contributions
used-0.170
eyed-0.155
uts-0.151
unch-0.145
but-0.144
Token.
Token ID13
Feature activation+5.14e+00

Pos logit contributions
used+0.227
overed+0.220
year+0.219
ting+0.214
alled+0.213
Neg logit contributions
 fort-0.188
 doll-0.174
 sheet-0.174
 garage-0.174
 needle-0.173
Token Hold
Token ID9340
Feature activation+3.07e+00

Pos logit contributions
used+0.156
eyed+0.140
uts+0.138
all+0.130
unch+0.130
Neg logit contributions
 fort-0.178
 attic-0.178
 garage-0.173
 sheet-0.171
 closet-0.170
Token his
Token ID465
Feature activation-1.20e+01

Pos logit contributions
used+0.102
eyed+0.090
uts+0.087
unch+0.083
but+0.082
Neg logit contributions
 fort-0.103
 attic-0.102
 folder-0.099
 sheet-0.099
 garage-0.097
Token paw
Token ID41827
Feature activation+6.23e+00

Pos logit contributions
 attic+0.345
 fort+0.340
 folder+0.331
 sheet+0.322
 garage+0.320
Neg logit contributions
used-0.451
eyed-0.407
uts-0.396
unch-0.374
but-0.373
Token and
Token ID290
Feature activation-4.88e+00

Pos logit contributions
used+0.155
eyed+0.133
uts+0.127
but+0.117
unch+0.116
Neg logit contributions
 fort-0.257
 attic-0.255
 folder-0.250
 sheet-0.248
 garage-0.245
Token say
Token ID910
Feature activation+5.48e+00

Pos logit contributions
 fort+0.160
 attic+0.159
 folder+0.155
 sheet+0.152
 garage+0.150
Neg logit contributions
used-0.165
eyed-0.146
uts-0.141
 cheering-0.134
unch-0.133
Token '
Token ID705
Feature activation+3.26e+00

Pos logit contributions
used+0.184
eyed+0.173
unch+0.156
but+0.155
uts+0.154
Neg logit contributions
 fort-0.173
 sheet-0.172
 scarf-0.166
 folder-0.165
 attic-0.164
Tokenshake
Token ID32431
Feature activation+6.44e+00

Pos logit contributions
used+0.099
uts+0.089
eyed+0.084
overed+0.084
 cheering+0.083
Neg logit contributions
 folder-0.099
 sheet-0.097
 needle-0.097
pron-0.097
 proper-0.097
Token'.
Token ID4458
Feature activation+6.25e+00

Pos logit contributions
used+0.132
eyed+0.108
uts+0.106
unch+0.097
but+0.095
Neg logit contributions
 attic-0.286
 folder-0.286
 fort-0.285
 sheet-0.285
 garage-0.276
Token He
Token ID679
Feature activation-6.24e+00

Pos logit contributions
used+0.184
eyed+0.161
uts+0.153
 cheering+0.148
but+0.145
Neg logit contributions
 attic-0.226
 fort-0.221
 garage-0.218
 folder-0.216
 sheet-0.215
Token the
Token ID262
Feature activation-1.26e+01

Pos logit contributions
 fort+0.106
 attic+0.105
 folder+0.103
 sheet+0.102
 garage+0.100
Neg logit contributions
used-0.099
eyed-0.087
uts-0.085
unch-0.080
all-0.080
Token dry
Token ID5894
Feature activation+2.15e+00

Pos logit contributions
 fort+0.270
 attic+0.270
 proper+0.265
 folder+0.256
 sheet+0.247
Neg logit contributions
used-0.477
 cheering-0.457
eyed-0.429
ting-0.417
 joining-0.413
Token letter
Token ID3850
Feature activation+8.35e+00

Pos logit contributions
used+0.077
eyed+0.071
ting+0.071
 cheering+0.068
uts+0.068
Neg logit contributions
 journal-0.052
 proper-0.051
 fort-0.051
 folder-0.049
 attic-0.049
Token and
Token ID290
Feature activation-5.98e+00

Pos logit contributions
used+0.152
eyed+0.124
uts+0.123
but+0.107
 cheering+0.105
Neg logit contributions
 attic-0.392
 fort-0.389
 folder-0.380
 sheet-0.378
 garage-0.373
Token he
Token ID339
Feature activation-7.09e+00

Pos logit contributions
 fort+0.236
 attic+0.231
 sheet+0.228
 folder+0.225
 puzzle+0.216
Neg logit contributions
used-0.155
eyed-0.134
 cheering-0.130
uts-0.126
but-0.117
Token never
Token ID1239
Feature activation-5.07e-01

Pos logit contributions
 fort+0.263
 attic+0.263
 folder+0.254
 sheet+0.253
 garage+0.246
Neg logit contributions
used-0.208
eyed-0.187
uts-0.175
 cheering-0.166
all-0.165
Token got
Token ID1392
Feature activation+4.50e+00

Pos logit contributions
 fort+0.016
 attic+0.016
 folder+0.016
 sheet+0.015
 garage+0.015
Neg logit contributions
used-0.018
eyed-0.016
uts-0.015
unch-0.014
all-0.014
Token his
Token ID465
Feature activation-1.20e+01

Pos logit contributions
used+0.131
eyed+0.115
uts+0.109
all+0.103
but+0.102
Neg logit contributions
 fort-0.169
 attic-0.168
 sheet-0.163
 folder-0.163
 garage-0.160
Token magical
Token ID10883
Feature activation-1.08e-01

Pos logit contributions
 fort+0.281
 attic+0.279
 sheet+0.258
 folder+0.254
 garage+0.253
Neg logit contributions
used-0.490
eyed-0.452
uts-0.427
 cheering-0.425
all-0.411
Token castle
Token ID16669
Feature activation+6.49e+00

Pos logit contributions
 attic+0.002
 fort+0.002
 garage+0.002
 closet+0.002
 folder+0.002
Neg logit contributions
used-0.005
eyed-0.004
 cheering-0.004
uts-0.004
all-0.004
Token.
Token ID13
Feature activation+3.77e+00

Pos logit contributions
used+0.119
eyed+0.097
uts+0.095
but+0.085
all+0.083
Neg logit contributions
 attic-0.304
 fort-0.303
 sheet-0.297
 folder-0.297
 garage-0.292
Token brown
Token ID7586
Feature activation-2.25e-01

Pos logit contributions
 fort+0.084
 folder+0.082
 attic+0.080
 journal+0.080
 sheet+0.079
Neg logit contributions
used-0.086
eyed-0.078
uts-0.078
 cheering-0.075
all-0.074
Token and
Token ID290
Feature activation-6.94e+00

Pos logit contributions
 fort+0.008
 attic+0.008
 sheet+0.008
 folder+0.008
 closet+0.008
Neg logit contributions
used-0.007
eyed-0.006
uts-0.006
unch-0.006
but-0.006
Token tall
Token ID7331
Feature activation-1.59e+00

Pos logit contributions
 fort+0.184
 folder+0.178
 attic+0.175
 garage+0.170
 journal+0.169
Neg logit contributions
used-0.239
uts-0.225
 cheering-0.222
-0.214
't-0.213
Token.
Token ID13
Feature activation+4.88e+00

Pos logit contributions
 fort+0.069
 attic+0.068
 sheet+0.066
 folder+0.065
 closet+0.065
Neg logit contributions
used-0.040
eyed-0.034
unch-0.031
uts-0.030
but-0.029
Token It
Token ID632
Feature activation-2.66e+00

Pos logit contributions
used+0.187
eyed+0.172
 believing+0.165
seek+0.155
ting+0.151
Neg logit contributions
 garage-0.138
 attic-0.135
 fort-0.134
 trash-0.132
 sheet-0.128
Token has
Token ID468
Feature activation-1.07e+00

Pos logit contributions
 folder+0.081
 attic+0.079
 fort+0.076
 journal+0.076
pron+0.076
Neg logit contributions
used-0.088
:-0.076
't-0.076
eyed-0.076
uts-0.074
Token a
Token ID257
Feature activation-9.07e+00

Pos logit contributions
 fort+0.042
 folder+0.041
 journal+0.039
pron+0.039
 attic+0.039
Neg logit contributions
used-0.024
uts-0.024
eyed-0.023
seven-0.021
ucked-0.021
Token sign
Token ID1051
Feature activation+6.15e+00

Pos logit contributions
 fort+0.203
 attic+0.202
 folder+0.190
 sheet+0.185
 garage+0.184
Neg logit contributions
used-0.382
eyed-0.350
uts-0.347
 cheering-0.335
unch-0.331
Token on
Token ID319
Feature activation-1.49e-01

Pos logit contributions
used+0.163
eyed+0.133
unch+0.131
all+0.127
uts+0.121
Neg logit contributions
 attic-0.253
 folder-0.250
 sheet-0.245
 fort-0.245
 closet-0.244
Token it
Token ID340
Feature activation-1.54e+00

Pos logit contributions
 fort+0.006
 attic+0.006
 folder+0.006
 sheet+0.006
 garage+0.006
Neg logit contributions
used-0.004
eyed-0.003
uts-0.003
all-0.003
but-0.003
Token.
Token ID13
Feature activation+5.12e+00

Pos logit contributions
 attic+0.063
 fort+0.062
 closet+0.061
 sheet+0.061
 folder+0.061
Neg logit contributions
used-0.041
eyed-0.036
unch-0.034
 cheering-0.032
ucking-0.032
Token
Token ID198
Feature activation+3.46e+00

Pos logit contributions
used+0.170
ting+0.132
eyed+0.127
 cheering+0.115
 joining+0.111
Neg logit contributions
 attic-0.284
 fort-0.274
 garage-0.270
 folder-0.269
 closet-0.269
Token
Token ID198
Feature activation+3.04e+00

Pos logit contributions
used+0.106
ting+0.078
eyed+0.069
aring+0.068
ucked+0.068
Neg logit contributions
 attic-0.132
 folder-0.126
 sheet-0.125
 fort-0.124
 garage-0.123
TokenEventually
Token ID28724
Feature activation+4.17e+00

Pos logit contributions
used+0.116
eyed+0.102
uts+0.100
unch+0.095
 cheering+0.094
Neg logit contributions
 attic-0.088
 fort-0.086
 folder-0.085
 sheet-0.084
 garage-0.082
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
used+0.079
wards+0.060
 cheering+0.058
all+0.056
ting+0.056
Neg logit contributions
 fort-0.194
 attic-0.192
 garage-0.191
 journal-0.186
 folder-0.185
Token Lily
Token ID20037
Feature activation-6.61e+00

Pos logit contributions
 fort+0.062
 attic+0.060
 garage+0.059
 journal+0.058
 sheet+0.056
Neg logit contributions
used-0.044
but-0.041
wards-0.040
all-0.040
ting-0.040
Token and
Token ID290
Feature activation-7.78e+00

Pos logit contributions
 attic+0.236
 fort+0.231
 folder+0.224
 sheet+0.220
 journal+0.218
Neg logit contributions
used-0.201
eyed-0.181
uts-0.177
all-0.167
 cheering-0.164
Token her
Token ID607
Feature activation-1.31e+01

Pos logit contributions
 fort+0.285
 attic+0.268
 garage+0.266
 sheet+0.263
 folder+0.255
Neg logit contributions
used-0.233
eyed-0.201
all-0.198
 cheering-0.191
uts-0.190
Token mom
Token ID1995
Feature activation-1.38e+00

Pos logit contributions
 attic+0.389
 garage+0.373
 fort+0.372
 folder+0.371
 sheet+0.369
Neg logit contributions
used-0.399
 cheering-0.384
uts-0.363
eyed-0.359
 joining-0.341
Token had
Token ID550
Feature activation-5.07e-01

Pos logit contributions
 fort+0.042
 attic+0.042
 folder+0.040
 sheet+0.040
 garage+0.039
Neg logit contributions
used-0.049
eyed-0.045
uts-0.043
unch-0.041
all-0.041
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
 fort+0.017
 attic+0.017
 folder+0.017
 sheet+0.017
 garage+0.016
Neg logit contributions
used-0.016
eyed-0.014
uts-0.014
all-0.013
unch-0.013
Token leave
Token ID2666
Feature activation+5.07e+00

Pos logit contributions
 fort+0.105
 attic+0.103
 folder+0.101
 sheet+0.099
 closet+0.096
Neg logit contributions
used-0.114
eyed-0.103
uts-0.096
all-0.095
 cheering-0.095
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
 fort+0.109
 folder+0.107
 attic+0.103
 sheet+0.102
 proper+0.101
Neg logit contributions
used-0.084
eyed-0.073
 cheering-0.071
all-0.069
uts-0.067
Token cold
Token ID4692
Feature activation+8.56e-01

Pos logit contributions
kin+0.301
 folder+0.296
pron+0.291
 sheet+0.282
 shelf+0.280
Neg logit contributions
-0.361
used-0.357
eyed-0.353
ting-0.344
 cheering-0.332
Token.
Token ID13
Feature activation+4.73e+00

Pos logit contributions
used+0.034
eyed+0.031
uts+0.030
unch+0.029
all+0.029
Neg logit contributions
 fort-0.022
 attic-0.022
 folder-0.022
 sheet-0.021
 garage-0.021
Token One
Token ID1881
Feature activation-6.96e-02

Pos logit contributions
used+0.148
eyed+0.131
uts+0.127
 cheering+0.118
qu+0.117
Neg logit contributions
 attic-0.174
 fort-0.170
 garage-0.165
 closet-0.165
 sheet-0.164
Token day
Token ID1110
Feature activation+5.81e+00

Pos logit contributions
 folder+0.002
 fort+0.002
 journal+0.002
 drawer+0.002
 attic+0.002
Neg logit contributions
used-0.002
all-0.002
eyed-0.002
-0.002
unch-0.002
Token she
Token ID673
Feature activation-7.45e+00

Pos logit contributions
used+0.135
eyed+0.108
uts+0.107
but+0.098
unch+0.098
Neg logit contributions
 fort-0.255
 attic-0.251
 folder-0.244
 garage-0.243
 sheet-0.243
Token put
Token ID1234
Feature activation+5.37e+00

Pos logit contributions
 folder+0.296
 puzzle+0.294
 fort+0.292
 journal+0.289
 attic+0.287
Neg logit contributions
used-0.146
eyed-0.143
-0.140
ed-0.131
all-0.128
Token her
Token ID607
Feature activation-1.16e+01

Pos logit contributions
used+0.044
ting+0.040
icked+0.030
but+0.028
all+0.027
Neg logit contributions
 fort-0.295
 garage-0.290
 attic-0.284
 spaceship-0.280
 folder-0.277
Token warm
Token ID5814
Feature activation-1.48e-01

Pos logit contributions
 fort+0.270
 attic+0.258
 folder+0.252
 garage+0.252
 guy+0.248
Neg logit contributions
used-0.385
uts-0.339
eyed-0.327
aro-0.327
but-0.326
Token m
Token ID285
Feature activation-3.45e-01

Pos logit contributions
 attic+0.003
 fort+0.003
 folder+0.003
 garage+0.003
 puzzle+0.003
Neg logit contributions
used-0.006
eyed-0.005
uts-0.005
ting-0.005
but-0.005
Tokenittens
Token ID34978
Feature activation-2.42e-01

Pos logit contributions
 fort+0.009
 sheet+0.009
 attic+0.009
 closet+0.009
 drawer+0.008
Neg logit contributions
used-0.013
 cheering-0.012
eyed-0.012
ting-0.012
 joining-0.012
Token
Token ID198
Feature activation+3.86e+00

Pos logit contributions
used+0.194
ting+0.162
eyed+0.152
 cheering+0.148
 joining+0.141
Neg logit contributions
 attic-0.298
 fort-0.293
 garage-0.292
 drawer-0.283
 closet-0.282
Token
Token ID198
Feature activation+3.36e+00

Pos logit contributions
used+0.092
uts+0.071
ting+0.068
icked+0.067
overed+0.065
Neg logit contributions
 garage-0.153
 sheet-0.152
 shelf-0.152
 folder-0.152
 attic-0.151
TokenThe
Token ID464
Feature activation-1.15e+01

Pos logit contributions
used+0.115
uts+0.099
eyed+0.097
icked+0.094
unch+0.093
Neg logit contributions
 attic-0.102
 folder-0.102
 fort-0.101
 garage-0.101
 sheet-0.100
Token squirrel
Token ID33039
Feature activation+1.79e+00

Pos logit contributions
 fort+0.354
 attic+0.351
 folder+0.341
 sheet+0.339
 garage+0.332
Neg logit contributions
used-0.409
eyed-0.367
uts-0.358
unch-0.340
but-0.339
Token shook
Token ID14682
Feature activation+6.41e+00

Pos logit contributions
used+0.053
eyed+0.047
uts+0.046
unch+0.043
but+0.043
Neg logit contributions
 fort-0.064
 attic-0.063
 folder-0.062
 sheet-0.062
 garage-0.060
Token his
Token ID465
Feature activation-1.25e+01

Pos logit contributions
used+0.241
eyed+0.217
uts+0.214
unch+0.205
but+0.202
Neg logit contributions
 fort-0.184
 attic-0.182
 folder-0.178
 sheet-0.177
 garage-0.172
Token head
Token ID1182
Feature activation+6.49e+00

Pos logit contributions
 attic+0.273
 folder+0.262
 fort+0.261
 sheet+0.253
 garage+0.247
Neg logit contributions
used-0.559
eyed-0.510
uts-0.490
unch-0.473
all-0.467
Token and
Token ID290
Feature activation-5.05e+00

Pos logit contributions
used+0.199
eyed+0.170
qu+0.168
uts+0.166
unch+0.166
Neg logit contributions
 sheet-0.211
 folder-0.211
 fort-0.209
 drawer-0.206
 closet-0.205
Token replied
Token ID8712
Feature activation+5.95e+00

Pos logit contributions
 attic+0.206
 fort+0.205
 folder+0.200
 sheet+0.197
 garage+0.196
Neg logit contributions
used-0.132
eyed-0.109
uts-0.106
all-0.100
 cheering-0.099
Token:
Token ID25
Feature activation-5.58e-01

Pos logit contributions
used+0.170
eyed+0.155
 cheering+0.140
unch+0.140
uts+0.136
Neg logit contributions
 folder-0.213
 journal-0.203
 sheet-0.200
 fort-0.199
 drawer-0.199
Token "
Token ID366
Feature activation+3.91e+00

Pos logit contributions
pron+0.015
 folder+0.015
 drawer+0.014
 garage+0.014
 attic+0.014
Neg logit contributions
used-0.020
eyed-0.019
uts-0.018
beh-0.018
qu-0.017
Token out
Token ID503
Feature activation+1.32e+00

Pos logit contributions
used+0.182
eyed+0.168
uts+0.155
 cheering+0.154
ting+0.152
Neg logit contributions
 attic-0.247
 fort-0.245
 garage-0.237
 sheet-0.231
 closet-0.231
Token his
Token ID465
Feature activation-1.17e+01

Pos logit contributions
used+0.033
eyed+0.030
uts+0.029
 cheering+0.029
all+0.029
Neg logit contributions
 attic-0.052
 fort-0.051
 garage-0.050
 closet-0.048
 folder-0.048
Token camera
Token ID4676
Feature activation+7.91e+00

Pos logit contributions
 attic+0.288
 garage+0.264
 fort+0.249
 closet+0.239
 folder+0.231
Neg logit contributions
used-0.437
 cheering-0.383
eyed-0.375
uts-0.374
:-0.369
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
used+0.156
uts+0.136
eyed+0.132
all+0.119
but+0.118
Neg logit contributions
 attic-0.355
 fort-0.351
 folder-0.342
 sheet-0.341
 garage-0.340
Token take
Token ID1011
Feature activation+6.07e+00

Pos logit contributions
 fort+0.063
 attic+0.062
 folder+0.061
 sheet+0.060
 garage+0.059
Neg logit contributions
used-0.063
eyed-0.056
uts-0.054
unch-0.051
all-0.051
Token a
Token ID257
Feature activation-7.45e+00

Pos logit contributions
used+0.163
eyed+0.147
uts+0.137
all+0.132
 cheering+0.130
Neg logit contributions
 fort-0.238
 attic-0.237
 garage-0.226
 sheet-0.226
 folder-0.226
Token photo
Token ID4590
Feature activation+9.59e+00

Pos logit contributions
 attic+0.192
 fort+0.177
 garage+0.169
 closet+0.167
 folder+0.161
Neg logit contributions
used-0.295
eyed-0.276
uts-0.267
 cheering-0.267
unch-0.265
Token of
Token ID286
Feature activation-3.71e+00

Pos logit contributions
used+0.175
eyed+0.145
uts+0.143
but+0.126
all+0.126
Neg logit contributions
 attic-0.450
 fort-0.447
 folder-0.436
 sheet-0.435
 garage-0.432
Token the
Token ID262
Feature activation-1.20e+01

Pos logit contributions
 fort+0.122
 attic+0.120
 folder+0.117
 sheet+0.116
 garage+0.114
Neg logit contributions
used-0.123
eyed-0.110
uts-0.107
all-0.102
 cheering-0.102
Token tasty
Token ID25103
Feature activation+2.36e-01

Pos logit contributions
 attic+0.270
 fort+0.264
 proper+0.251
 garage+0.248
 sheet+0.243
Neg logit contributions
used-0.455
 cheering-0.439
ting-0.415
eyed-0.406
 joining-0.406
Token dish
Token ID9433
Feature activation+5.17e+00

Pos logit contributions
used+0.008
 cheering+0.008
uts+0.008
ting+0.007
all+0.007
Neg logit contributions
 attic-0.006
 garage-0.006
 fort-0.006
 closet-0.005
 proper-0.005
Token her
Token ID607
Feature activation-1.09e+01

Pos logit contributions
 fort+0.075
 attic+0.071
 sheet+0.070
 folder+0.069
 garage+0.069
Neg logit contributions
used-0.081
eyed-0.073
uts-0.071
 cheering-0.068
all-0.068
Token friends
Token ID2460
Feature activation+4.83e+00

Pos logit contributions
 fort+0.292
 attic+0.286
 sheet+0.278
 folder+0.276
 garage+0.271
Neg logit contributions
used-0.427
eyed-0.386
uts-0.378
but-0.361
all-0.359
Token.
Token ID13
Feature activation+4.86e+00

Pos logit contributions
used+0.117
eyed+0.096
unch+0.094
uts+0.091
overed+0.087
Neg logit contributions
 folder-0.210
 attic-0.208
 fort-0.205
 closet-0.203
 garage-0.202
Token She
Token ID1375
Feature activation-7.89e+00

Pos logit contributions
used+0.108
eyed+0.094
uts+0.091
but+0.086
all+0.086
Neg logit contributions
 fort-0.208
 sheet-0.202
 attic-0.200
 folder-0.196
 journal-0.193
Token had
Token ID550
Feature activation-8.40e-01

Pos logit contributions
 fort+0.290
 attic+0.275
 folder+0.272
 sheet+0.269
 journal+0.261
Neg logit contributions
used-0.206
eyed-0.203
uts-0.196
all-0.192
:-0.191
Token a
Token ID257
Feature activation-9.30e+00

Pos logit contributions
 fort+0.036
 sheet+0.033
 folder+0.032
 garage+0.032
 attic+0.032
Neg logit contributions
used-0.018
eyed-0.017
uts-0.017
 cheering-0.016
-0.015
Token large
Token ID1588
Feature activation-5.56e+00

Pos logit contributions
 fort+0.243
 attic+0.232
icer+0.224
wy+0.224
issy+0.216
Neg logit contributions
 cheering-0.270
used-0.261
-0.256
:-0.252
 deciding-0.250
Token family
Token ID1641
Feature activation+2.37e+00

Pos logit contributions
 proper+0.159
 fort+0.148
issy+0.143
wy+0.142
eeks+0.141
Neg logit contributions
used-0.170
:-0.157
 cheering-0.153
 --0.142
 hugging-0.142
Token with
Token ID351
Feature activation-9.87e-01

Pos logit contributions
used+0.057
eyed+0.048
uts+0.046
unch+0.042
but+0.042
Neg logit contributions
 fort-0.100
 attic-0.099
 folder-0.098
 sheet-0.097
 garage-0.096
Token many
Token ID867
Feature activation-3.72e+00

Pos logit contributions
 fort+0.032
 attic+0.030
 sheet+0.030
 folder+0.029
 garage+0.029
Neg logit contributions
used-0.034
eyed-0.030
uts-0.030
 cheering-0.029
ting-0.028
Token people
Token ID661
Feature activation+2.31e+00

Pos logit contributions
 fort+0.095
 attic+0.091
 sheet+0.090
 folder+0.088
 garage+0.087
Neg logit contributions
used-0.147
eyed-0.135
uts-0.132
 cheering-0.131
ting-0.125