TOP ACTIVATIONS
MAX = 14.411

the most perfect present ! Sy lves ter was so happy
he had never surrendered to Sy lves ter . <|endoftext|> Once
you singing about jail ?" Sy lves ter asked .
Bob continued to examine the neighborhood looking for the surprise .
She liked to explore her neighborhood . One day , she
rode it all around the neighborhood . He even rode it
ety saw a cat named Sy lves ter trying to catch
She walked around the neighborhood and finally found the car
talked to people in the neighborhood . Finally , they found
continued to zoom around the neighbourhood with his sc ooter .
shiny . People in the neighbourhood would come to look at
Mary â s neighborhood . All the kids would
to do . Tim imagined playing with his friends , running
L ily and Max were playing soccer in the park .
to do so and continued playing with his toys .
, while Tim my was playing with his new sp ade
named Lily . She loved playing with her friends and being
again and went back to playing with his ball . The
the road . We were playing . We were not careful
L ila and Ben were playing in the park . They

MIN ACTIVATIONS
MIN = -16.681

in need . Her thought fulness always helped others succeed .
and enthusiastic that his forget fulness had an unfortunate ending .
bird 's determination and grace fulness . She applauded it and
: Kind ness and thought fulness is a beautiful thing !
the story is that grace fulness is better than winning all
a little kindness and thought fulness goes a long way .
the time . With grace fulness , everyone can be happy
pencil box . With fear fulness , he launched it out
wanted to show his thank fulness to his family . So
understanding only came with cheer fulness . <|endoftext|> Once upon a
glad he overcame his fear fulness . They all played and
place , full of cheer fulness . In
play a game called " war ". This game was just
and tug - of - war . The fox was happy
a tug - of - war game . They had lots
of her daughter 's resource fulness and smiled . She grate
play tug - of - war ," said the oct opus
play tug - of - war with it .
play tug - of - war !" The birds
a little bit of thought fulness , dreams can come true

INTERVAL 6.638 - 14.411
CONTAINS 2.617%

. " Can I try it , Sam ?"
songs for her family and friends . She is excited to
are friends , we are friends , we are friends ,
mom and dad with the lawn . They water the flowers
day , while she was playing in her backyard , she

INTERVAL -1.135 - 6.638
CONTAINS 67.556%

" I can call your name . What is your name
" O w , ow , ow !" Mia
to play . Emma saw the green t eddy bear and
sun went down . It was a lovely day at the
." Ben tried to push the bath robe in

INTERVAL -8.908 - -1.135
CONTAINS 29.336%

He says he will be nice . He says he will
ball . You can throw it and he will fetch it
from a friend . <|endoftext|> Once upon a time , there
loved to drink juice and water all the time . One
. It makes a loud noise . Anna likes the noise

INTERVAL -16.681 - -8.908
CONTAINS 0.490%

Amy was scared but she slowly moved closer to the goose
quickly but Jill was counting slowly . She looked around and
some cookies with the brown butter !" <|endoftext|> Mary was in
pick the herbs . Her bs were plants that smelled good
her dad added some noisy spices . He said , "
Token the
Token ID262
Feature activation+2.05e-02

Pos logit contributions
 playing+0.302
 neighborhood+0.282
 office+0.281
ich+0.278
 living+0.264
Neg logit contributions
le-0.418
war-0.417
umbs-0.412
bs-0.410
of-0.400
Token most
Token ID749
Feature activation+8.45e+00

Pos logit contributions
 playing+0.001
 office+0.001
 neighborhood+0.001
ich+0.001
 living+0.001
Neg logit contributions
le-0.002
war-0.002
bread-0.002
of-0.002
umbs-0.002
Token perfect
Token ID2818
Feature activation+2.44e+00

Pos logit contributions
 playing+0.518
 office+0.460
 living+0.452
 neighborhood+0.449
ich+0.446
Neg logit contributions
war-0.605
le-0.604
umbs-0.590
bs-0.586
of-0.585
Token present
Token ID1944
Feature activation-3.68e+00

Pos logit contributions
 playing+0.144
ich+0.129
 neighborhood+0.128
 office+0.128
 living+0.124
Neg logit contributions
le-0.173
war-0.170
of-0.166
umbs-0.165
bs-0.162
Token!
Token ID0
Feature activation+3.45e+00

Pos logit contributions
le+0.235
umbs+0.229
of+0.228
war+0.227
bs+0.225
Neg logit contributions
 playing-0.243
 neighborhood-0.227
 office-0.220
ich-0.218
 living-0.214
Token Sy
Token ID1632
Feature activation+1.44e+01

Pos logit contributions
 playing+0.207
 office+0.184
 living+0.181
ich+0.180
 neighborhood+0.179
Neg logit contributions
war-0.251
umbs-0.247
of-0.246
le-0.243
bs-0.243
Tokenlves
Token ID31018
Feature activation+6.70e+00

Pos logit contributions
 playing+0.817
 office+0.742
 living+0.731
 neighborhood+0.702
 backyard+0.694
Neg logit contributions
le-1.139
of-1.126
war-1.111
les-1.100
bs-1.069
Tokenter
Token ID353
Feature activation+7.61e+00

Pos logit contributions
 playing+0.280
 office+0.238
 neighborhood+0.236
 classroom+0.223
 backyard+0.215
Neg logit contributions
le-0.632
les-0.603
of-0.597
fulness-0.586
war-0.579
Token was
Token ID373
Feature activation+5.48e+00

Pos logit contributions
 playing+0.371
 office+0.335
ich+0.329
 neighborhood+0.326
 living+0.318
Neg logit contributions
le-0.618
of-0.609
bs-0.598
war-0.592
les-0.585
Token so
Token ID523
Feature activation+3.14e+00

Pos logit contributions
 neighborhood+0.262
ich+0.261
 playing+0.260
 office+0.255
 factory+0.242
Neg logit contributions
le-0.406
umbs-0.403
war-0.399
of-0.396
bs-0.396
Token happy
Token ID3772
Feature activation-1.09e+00

Pos logit contributions
 playing+0.185
 office+0.174
 neighborhood+0.169
ich+0.168
 living+0.166
Neg logit contributions
war-0.221
bs-0.221
umbs-0.218
le-0.217
of-0.217
Token he
Token ID339
Feature activation+8.78e-01

Pos logit contributions
 playing+0.134
 neighborhood+0.129
 office+0.128
ich+0.126
 living+0.121
Neg logit contributions
bs-0.128
fulness-0.128
le-0.127
of-0.127
war-0.124
Token had
Token ID550
Feature activation+1.45e+00

Pos logit contributions
 playing+0.057
 office+0.056
ich+0.053
 neighborhood+0.053
 classroom+0.051
Neg logit contributions
le-0.058
of-0.057
bs-0.056
war-0.054
les-0.053
Token never
Token ID1239
Feature activation+2.89e+00

Pos logit contributions
ich+0.082
 office+0.081
 factory+0.081
 playing+0.080
 neighborhood+0.079
Neg logit contributions
war-0.097
of-0.093
bs-0.092
le-0.091
forts-0.091
Token surrendered
Token ID29209
Feature activation-3.38e+00

Pos logit contributions
 playing+0.171
ich+0.165
 office+0.165
 neighborhood+0.159
 living+0.155
Neg logit contributions
war-0.198
of-0.195
le-0.194
bs-0.191
forts-0.187
Token to
Token ID284
Feature activation-5.87e-01

Pos logit contributions
war+0.236
le+0.234
of+0.229
umbs+0.228
bs+0.227
Neg logit contributions
 playing-0.212
 office-0.191
 living-0.187
 neighborhood-0.187
ich-0.185
Token Sy
Token ID1632
Feature activation+1.41e+01

Pos logit contributions
of+0.035
le+0.035
war+0.034
fulness+0.034
bs+0.034
Neg logit contributions
ich-0.037
 playing-0.036
 neighborhood-0.036
 office-0.035
 living-0.034
Tokenlves
Token ID31018
Feature activation+6.38e+00

Pos logit contributions
 playing+0.795
 office+0.708
 living+0.707
 neighborhood+0.678
ich+0.671
Neg logit contributions
le-1.106
war-1.092
of-1.085
les-1.055
bs-1.038
Tokenter
Token ID353
Feature activation+7.32e+00

Pos logit contributions
 playing+0.275
 neighborhood+0.250
 classroom+0.233
 office+0.228
 backyard+0.220
Neg logit contributions
le-0.594
of-0.561
war-0.552
les-0.548
fulness-0.543
Token.
Token ID13
Feature activation-9.34e-01

Pos logit contributions
 playing+0.510
 neighborhood+0.481
 office+0.471
 living+0.469
ich+0.468
Neg logit contributions
le-0.447
war-0.425
of-0.422
bs-0.406
umbs-0.402
Token<|endoftext|>
Token ID50256
Feature activation-5.51e-01

Pos logit contributions
forts+0.051
war+0.050
of+0.050
bs+0.049
umbs+0.049
Neg logit contributions
 playing-0.068
 seeker-0.062
 office-0.061
ich-0.061
 living-0.060
TokenOnce
Token ID7454
Feature activation-1.95e+00

Pos logit contributions
war+0.038
umbs+0.038
of+0.037
le+0.037
bs+0.036
Neg logit contributions
 playing-0.034
 office-0.032
ich-0.031
 living-0.030
 neighborhood-0.030
Token you
Token ID345
Feature activation+2.72e+00

Pos logit contributions
of+0.209
bs+0.198
war+0.197
forts+0.190
umbs+0.188
Neg logit contributions
 office-0.198
 neighborhood-0.194
 playing-0.187
 living-0.181
 backyard-0.181
Token singing
Token ID13777
Feature activation+1.68e+00

Pos logit contributions
 office+0.121
 neighborhood+0.121
 playing+0.114
 living+0.113
 backyard+0.109
Neg logit contributions
le-0.214
umbs-0.213
of-0.210
war-0.205
bs-0.205
Token about
Token ID546
Feature activation-1.14e+00

Pos logit contributions
 playing+0.088
 office+0.080
 neighborhood+0.079
ich+0.078
 living+0.077
Neg logit contributions
le-0.134
war-0.131
of-0.129
bs-0.128
umbs-0.127
Token jail
Token ID7356
Feature activation+8.58e+00

Pos logit contributions
of+0.076
le+0.073
les+0.070
fulness+0.069
bread+0.068
Neg logit contributions
 neighborhood-0.061
 office-0.060
 seeker-0.059
 backyard-0.056
 classroom-0.056
Token?"
Token ID1701
Feature activation+5.15e+00

Pos logit contributions
 playing+0.467
 neighborhood+0.428
 living+0.423
ich+0.420
 office+0.419
Neg logit contributions
le-0.662
of-0.632
war-0.632
bs-0.620
les-0.620
Token Sy
Token ID1632
Feature activation+1.40e+01

Pos logit contributions
 office+0.294
 playing+0.293
 living+0.283
 neighborhood+0.281
ich+0.280
Neg logit contributions
bs-0.359
war-0.350
le-0.346
of-0.345
forts-0.337
Tokenlves
Token ID31018
Feature activation+6.32e+00

Pos logit contributions
 playing+0.932
 living+0.836
 office+0.825
 neighborhood+0.820
 backyard+0.787
Neg logit contributions
le-0.982
of-0.948
war-0.946
bs-0.929
les-0.920
Tokenter
Token ID353
Feature activation+7.27e+00

Pos logit contributions
 playing+0.259
 neighborhood+0.227
 office+0.208
 backyard+0.195
 classroom+0.193
Neg logit contributions
le-0.607
of-0.578
les-0.574
war-0.564
ck-0.564
Token asked
Token ID1965
Feature activation-2.40e-01

Pos logit contributions
 playing+0.491
 office+0.469
 neighborhood+0.465
 living+0.463
 seeker+0.442
Neg logit contributions
le-0.477
of-0.446
bs-0.444
war-0.431
ck-0.426
Token.
Token ID13
Feature activation-9.74e-01

Pos logit contributions
le+0.013
of+0.012
war+0.012
bs+0.012
forts+0.011
Neg logit contributions
ich-0.018
 neighborhood-0.018
 playing-0.018
 office-0.018
 living-0.017
Token
Token ID220
Feature activation-3.30e+00

Pos logit contributions
war+0.061
of+0.061
forts+0.060
bs+0.059
l+0.059
Neg logit contributions
 office-0.058
 playing-0.056
 seeker-0.056
ich-0.054
 living-0.054
Token Bob
Token ID5811
Feature activation+1.00e+00

Pos logit contributions
le+0.019
of+0.018
war+0.018
les+0.018
umbs+0.018
Neg logit contributions
 playing-0.018
 office-0.017
ich-0.016
 living-0.015
 neighborhood-0.015
Token continued
Token ID3767
Feature activation+3.08e+00

Pos logit contributions
 playing+0.060
 office+0.056
ich+0.053
 neighborhood+0.053
 living+0.053
Neg logit contributions
le-0.073
of-0.071
war-0.071
umbs-0.069
bs-0.069
Token to
Token ID284
Feature activation-1.18e-01

Pos logit contributions
 playing+0.165
 office+0.155
ich+0.155
 neighborhood+0.152
 living+0.145
Neg logit contributions
war-0.226
le-0.224
bs-0.222
of-0.221
umbs-0.218
Token examine
Token ID10716
Feature activation+1.87e+00

Pos logit contributions
le+0.008
war+0.008
of+0.008
bs+0.008
umbs+0.008
Neg logit contributions
 playing-0.007
ich-0.007
 office-0.007
 neighborhood-0.007
 living-0.006
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
 playing+0.113
 living+0.098
 office+0.096
 neighborhood+0.095
ich+0.090
Neg logit contributions
war-0.140
umbs-0.139
of-0.134
forts-0.133
le-0.133
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 playing+0.001
 office+0.001
ich+0.001
 neighborhood+0.001
 living+0.001
Neg logit contributions
le-0.001
war-0.001
of-0.001
umbs-0.001
bs-0.001
Token looking
Token ID2045
Feature activation+6.26e+00

Pos logit contributions
 playing+0.890
 office+0.788
 living+0.781
 neighborhood+0.772
ich+0.759
Neg logit contributions
war-0.974
le-0.959
umbs-0.949
of-0.940
bs-0.935
Token for
Token ID329
Feature activation+6.75e-01

Pos logit contributions
 playing+0.350
 living+0.302
 office+0.300
 neighborhood+0.294
ich+0.282
Neg logit contributions
war-0.491
umbs-0.482
le-0.480
of-0.476
bs-0.469
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 playing+0.041
 living+0.035
 office+0.035
 neighborhood+0.035
ich+0.033
Neg logit contributions
war-0.051
umbs-0.050
le-0.049
bs-0.048
of-0.048
Token surprise
Token ID5975
Feature activation-1.35e+00

Pos logit contributions
war+0.008
le+0.008
of+0.007
umbs+0.007
bs+0.007
Neg logit contributions
 playing-0.006
ich-0.005
 office-0.005
 neighborhood-0.005
 living-0.005
Token.
Token ID13
Feature activation-6.42e-01

Pos logit contributions
war+0.090
le+0.089
umbs+0.088
of+0.087
bs+0.087
Neg logit contributions
 playing-0.089
 office-0.080
 living-0.079
 neighborhood-0.078
ich-0.077
Token She
Token ID1375
Feature activation+2.08e+00

Pos logit contributions
war+0.078
of+0.078
umbs+0.078
bs+0.078
forts+0.073
Neg logit contributions
 playing-0.076
ich-0.072
 office-0.072
 neighborhood-0.069
 living-0.068
Token liked
Token ID8288
Feature activation+5.11e+00

Pos logit contributions
 playing+0.125
 office+0.111
ich+0.111
 neighborhood+0.109
 living+0.108
Neg logit contributions
le-0.149
war-0.147
bs-0.146
of-0.146
umbs-0.143
Token to
Token ID284
Feature activation-1.30e-01

Pos logit contributions
 playing+0.313
 office+0.283
 neighborhood+0.280
ich+0.277
 living+0.276
Neg logit contributions
war-0.356
le-0.350
of-0.346
umbs-0.346
bs-0.344
Token explore
Token ID7301
Feature activation+5.16e+00

Pos logit contributions
war+0.010
umbs+0.010
le+0.010
of+0.010
forts+0.009
Neg logit contributions
 playing-0.008
 office-0.007
 living-0.006
 neighborhood-0.006
 exploring-0.006
Token her
Token ID607
Feature activation+7.26e-01

Pos logit contributions
 playing+0.323
 living+0.283
 office+0.282
 neighborhood+0.277
ich+0.268
Neg logit contributions
war-0.374
umbs-0.368
le-0.364
of-0.358
bs-0.357
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 playing+0.033
ich+0.031
 office+0.029
 Sy+0.027
 saving+0.027
Neg logit contributions
bs-0.057
le-0.056
of-0.055
les-0.055
war-0.053
Token.
Token ID13
Feature activation-5.62e-01

Pos logit contributions
 playing+0.925
ich+0.888
 office+0.849
 neighborhood+0.844
 living+0.832
Neg logit contributions
le-0.868
war-0.830
of-0.825
bs-0.816
umbs-0.808
Token One
Token ID1881
Feature activation-4.44e+00

Pos logit contributions
war+0.035
of+0.034
umbs+0.034
bs+0.034
le+0.033
Neg logit contributions
 playing-0.039
 office-0.036
ich-0.036
 living-0.035
 neighborhood-0.034
Token day
Token ID1110
Feature activation+1.07e-01

Pos logit contributions
le+0.358
war+0.354
bs+0.352
of+0.351
umbs+0.339
Neg logit contributions
 playing-0.226
ich-0.202
 office-0.196
 neighborhood-0.196
 living-0.191
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
 playing+0.005
 office+0.005
ich+0.005
 neighborhood+0.005
 living+0.005
Neg logit contributions
bs-0.009
le-0.008
war-0.008
of-0.008
bread-0.008
Token she
Token ID673
Feature activation+1.10e+00

Pos logit contributions
 playing+0.199
 office+0.194
 neighborhood+0.192
 living+0.190
ich+0.186
Neg logit contributions
bs-0.199
of-0.194
le-0.193
war-0.193
bread-0.184
Token rode
Token ID22075
Feature activation+3.00e+00

Pos logit contributions
bs+0.250
war+0.250
le+0.249
of+0.244
forts+0.237
Neg logit contributions
 playing-0.226
ich-0.211
 office-0.210
 neighborhood-0.208
 living-0.202
Token it
Token ID340
Feature activation-5.78e+00

Pos logit contributions
 playing+0.192
 living+0.167
 office+0.165
 neighborhood+0.163
ich+0.157
Neg logit contributions
umbs-0.210
war-0.209
le-0.206
of-0.201
bread-0.201
Token all
Token ID477
Feature activation+3.11e+00

Pos logit contributions
war+0.419
le+0.413
of+0.408
bs+0.402
umbs+0.397
Neg logit contributions
 playing-0.343
 office-0.309
ich-0.307
 neighborhood-0.303
 living-0.299
Token around
Token ID1088
Feature activation+3.56e+00

Pos logit contributions
 playing+0.171
ich+0.159
 office+0.158
 neighborhood+0.151
 living+0.146
Neg logit contributions
le-0.236
war-0.235
of-0.232
bs-0.223
les-0.223
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
 playing+0.249
ich+0.235
 office+0.234
 neighborhood+0.227
 living+0.223
Neg logit contributions
le-0.216
war-0.208
of-0.205
bs-0.198
umbs-0.197
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
le+0.001
war+0.001
of+0.001
les+0.001
bread+0.001
Neg logit contributions
 playing-0.000
ich-0.000
 office-0.000
 exploring-0.000
 seeking-0.000
Token.
Token ID13
Feature activation-5.62e-01

Pos logit contributions
 playing+0.911
ich+0.860
 office+0.850
 neighborhood+0.839
 living+0.820
Neg logit contributions
le-0.871
war-0.852
of-0.845
bs-0.840
umbs-0.835
Token He
Token ID679
Feature activation+1.79e+00

Pos logit contributions
umbs+0.038
war+0.038
of+0.037
bs+0.037
le+0.036
Neg logit contributions
 playing-0.036
 office-0.032
ich-0.032
 living-0.032
 neighborhood-0.031
Token even
Token ID772
Feature activation+3.27e+00

Pos logit contributions
 playing+0.094
 office+0.086
ich+0.086
 living+0.082
 neighborhood+0.081
Neg logit contributions
le-0.139
of-0.138
war-0.135
bs-0.134
les-0.132
Token rode
Token ID22075
Feature activation+2.88e+00

Pos logit contributions
 playing+0.188
 office+0.163
 living+0.162
 neighborhood+0.160
ich+0.154
Neg logit contributions
war-0.250
umbs-0.248
le-0.245
of-0.240
bs-0.239
Token it
Token ID340
Feature activation-5.99e+00

Pos logit contributions
 playing+0.180
 office+0.158
 living+0.157
 neighborhood+0.155
ich+0.152
Neg logit contributions
war-0.205
le-0.203
umbs-0.202
of-0.199
bs-0.198
Tokenety
Token ID2963
Feature activation-1.47e+00

Pos logit contributions
 playing+0.412
ich+0.373
 neighborhood+0.364
 office+0.352
 living+0.346
Neg logit contributions
war-0.634
le-0.626
bs-0.609
umbs-0.609
forts-0.606
Token saw
Token ID2497
Feature activation+4.08e+00

Pos logit contributions
le+0.124
bs+0.119
les+0.119
war+0.119
of+0.119
Neg logit contributions
 playing-0.072
 office-0.065
 living-0.061
ich-0.061
 neighborhood-0.060
Token a
Token ID257
Feature activation+2.09e-01

Pos logit contributions
 playing+0.251
 neighborhood+0.247
 office+0.243
ich+0.238
 seeking+0.228
Neg logit contributions
war-0.250
of-0.248
le-0.247
bs-0.247
umbs-0.242
Token cat
Token ID3797
Feature activation+2.35e+00

Pos logit contributions
 playing+0.014
 office+0.013
 neighborhood+0.012
 exploring+0.012
 living+0.012
Neg logit contributions
of-0.012
bread-0.012
le-0.012
war-0.012
 sip-0.011
Token named
Token ID3706
Feature activation+9.38e+00

Pos logit contributions
 neighborhood+0.106
ich+0.098
 office+0.097
 neighbourhood+0.097
 seeker+0.096
Neg logit contributions
le-0.194
of-0.181
 sip-0.178
bs-0.178
les-0.174
Token Sy
Token ID1632
Feature activation+1.39e+01

Pos logit contributions
 neighborhood+0.496
 office+0.479
 playing+0.470
 neighbourhood+0.447
 living+0.440
Neg logit contributions
of-0.695
war-0.694
umbs-0.685
bs-0.659
bread-0.641
Tokenlves
Token ID31018
Feature activation+6.24e+00

Pos logit contributions
 playing+0.735
 office+0.671
 living+0.652
 neighborhood+0.641
ich+0.605
Neg logit contributions
le-1.132
war-1.119
of-1.105
bs-1.075
les-1.072
Tokenter
Token ID353
Feature activation+7.18e+00

Pos logit contributions
 playing+0.244
 neighborhood+0.228
 office+0.217
 classroom+0.205
 backyard+0.192
Neg logit contributions
le-0.600
of-0.561
les-0.556
war-0.554
ck-0.538
Token trying
Token ID2111
Feature activation+4.54e+00

Pos logit contributions
 neighborhood+0.458
 seeker+0.449
 office+0.447
 classroom+0.443
 playing+0.439
Neg logit contributions
le-0.427
of-0.415
bs-0.399
ck-0.376
les-0.376
Token to
Token ID284
Feature activation-7.43e-01

Pos logit contributions
 playing+0.267
 living+0.228
 office+0.224
 neighborhood+0.217
ich+0.212
Neg logit contributions
le-0.351
umbs-0.350
war-0.348
of-0.341
bs-0.337
Token catch
Token ID4929
Feature activation+1.78e+00

Pos logit contributions
war+0.055
umbs+0.055
bs+0.054
le+0.053
of+0.053
Neg logit contributions
 playing-0.045
 office-0.038
 living-0.038
 neighborhood-0.037
ich-0.036
Token
Token ID198
Feature activation+6.31e-01

Pos logit contributions
 playing+0.034
 office+0.030
 living+0.029
ich+0.029
 neighborhood+0.029
Neg logit contributions
war-0.039
of-0.038
umbs-0.038
le-0.038
bs-0.038
TokenShe
Token ID3347
Feature activation+6.44e-01

Pos logit contributions
 playing+0.048
 office+0.044
 living+0.043
 neighborhood+0.042
ich+0.042
Neg logit contributions
war-0.036
of-0.035
le-0.035
umbs-0.035
bs-0.034
Token walked
Token ID6807
Feature activation+9.05e-01

Pos logit contributions
 playing+0.039
 office+0.035
ich+0.035
 living+0.034
 neighborhood+0.034
Neg logit contributions
le-0.046
war-0.045
of-0.045
bs-0.045
umbs-0.044
Token around
Token ID1088
Feature activation+3.54e+00

Pos logit contributions
 playing+0.055
 office+0.049
 living+0.048
 neighborhood+0.048
ich+0.047
Neg logit contributions
war-0.066
le-0.065
of-0.064
umbs-0.064
bs-0.063
Token the
Token ID262
Feature activation-3.11e-02

Pos logit contributions
 playing+0.219
ich+0.212
 office+0.206
 neighborhood+0.203
 living+0.196
Neg logit contributions
le-0.234
war-0.231
of-0.228
bs-0.224
umbs-0.215
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
of+0.002
war+0.002
les+0.002
le+0.002
bread+0.002
Neg logit contributions
 playing-0.001
ich-0.001
 saving-0.001
 doing-0.001
 exploring-0.001
Token and
Token ID290
Feature activation-3.07e+00

Pos logit contributions
 playing+0.778
ich+0.758
 office+0.727
 seeker+0.723
 neighborhood+0.715
Neg logit contributions
le-0.939
of-0.916
bs-0.913
bread-0.912
war-0.901
Token finally
Token ID3443
Feature activation-2.11e+00

Pos logit contributions
bs+0.224
le+0.215
of+0.213
war+0.211
les+0.199
Neg logit contributions
ich-0.171
 office-0.165
 playing-0.160
 neighborhood-0.153
 living-0.148
Token found
Token ID1043
Feature activation+4.61e+00

Pos logit contributions
war+0.162
le+0.161
bs+0.160
of+0.160
umbs+0.156
Neg logit contributions
 playing-0.117
 office-0.106
 living-0.103
ich-0.103
 neighborhood-0.103
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
 playing+0.275
 office+0.245
 neighborhood+0.245
ich+0.244
 living+0.239
Neg logit contributions
war-0.331
le-0.330
of-0.324
bs-0.322
umbs-0.321
Token car
Token ID1097
Feature activation+4.59e+00

Pos logit contributions
war+0.013
le+0.012
bread+0.012
of+0.012
umbs+0.012
Neg logit contributions
 playing-0.010
 office-0.009
ich-0.008
 neighborhood-0.008
 living-0.008
Token talked
Token ID6619
Feature activation-5.12e+00

Pos logit contributions
war+0.193
le+0.191
bs+0.188
of+0.186
umbs+0.185
Neg logit contributions
 playing-0.193
 office-0.174
ich-0.172
 living-0.171
 neighborhood-0.171
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
umbs+0.412
war+0.405
le+0.395
of+0.392
forts+0.388
Neg logit contributions
 playing-0.293
 living-0.255
 office-0.239
 neighborhood-0.230
 exploring-0.224
Token people
Token ID661
Feature activation+4.18e+00

Pos logit contributions
war+0.009
le+0.009
umbs+0.008
of+0.008
bs+0.008
Neg logit contributions
 playing-0.009
 office-0.008
 living-0.008
 neighborhood-0.008
ich-0.008
Token in
Token ID287
Feature activation+6.03e+00

Pos logit contributions
 playing+0.260
 office+0.231
 living+0.228
 neighborhood+0.227
ich+0.224
Neg logit contributions
war-0.294
le-0.292
umbs-0.287
of-0.286
bs-0.284
Token the
Token ID262
Feature activation-8.72e-02

Pos logit contributions
 playing+0.368
 living+0.319
 office+0.316
 neighborhood+0.309
 exploring+0.288
Neg logit contributions
war-0.466
umbs-0.461
le-0.453
of-0.445
bs-0.444
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
war+0.007
le+0.007
of+0.007
bs+0.007
forts+0.007
Neg logit contributions
 playing-0.005
ich-0.004
 office-0.004
 living-0.004
 neighborhood-0.004
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 playing+0.926
 office+0.828
 living+0.819
 neighborhood+0.816
ich+0.811
Neg logit contributions
war-0.909
le-0.902
umbs-0.886
of-0.881
bs-0.876
Token Finally
Token ID9461
Feature activation-6.68e-01

Pos logit contributions
war+0.040
umbs+0.040
of+0.039
bs+0.039
le+0.038
Neg logit contributions
 playing-0.040
 office-0.037
ich-0.037
 living-0.036
 neighborhood-0.035
Token,
Token ID11
Feature activation+3.10e+00

Pos logit contributions
le+0.053
bs+0.053
war+0.052
of+0.052
umbs+0.051
Neg logit contributions
 playing-0.033
ich-0.031
 neighborhood-0.031
 office-0.030
 living-0.030
Token they
Token ID484
Feature activation-4.28e+00

Pos logit contributions
 playing+0.208
 office+0.194
 neighborhood+0.193
 living+0.193
ich+0.192
Neg logit contributions
war-0.193
le-0.191
bs-0.189
of-0.186
umbs-0.182
Token found
Token ID1043
Feature activation+4.57e+00

Pos logit contributions
war+0.343
le+0.336
umbs+0.333
of+0.328
bs+0.328
Neg logit contributions
 playing-0.234
 living-0.200
 office-0.200
 neighborhood-0.198
ich-0.191
Token continued
Token ID3767
Feature activation+2.39e+00

Pos logit contributions
umbs+0.026
war+0.026
of+0.026
le+0.025
les+0.025
Neg logit contributions
 playing-0.021
 office-0.020
ich-0.020
 neighborhood-0.018
 living-0.018
Token to
Token ID284
Feature activation-2.00e-01

Pos logit contributions
 playing+0.112
ich+0.109
 office+0.108
 neighborhood+0.105
 living+0.098
Neg logit contributions
bs-0.187
war-0.186
le-0.186
of-0.183
umbs-0.179
Token zoom
Token ID19792
Feature activation+2.47e+00

Pos logit contributions
le+0.014
war+0.014
of+0.013
bs+0.013
umbs+0.013
Neg logit contributions
 playing-0.012
ich-0.011
 office-0.011
 neighborhood-0.011
 living-0.011
Token around
Token ID1088
Feature activation+3.57e+00

Pos logit contributions
 playing+0.150
 living+0.130
 office+0.130
 neighborhood+0.129
ich+0.125
Neg logit contributions
war-0.181
umbs-0.178
le-0.177
of-0.174
bs-0.173
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
ich+0.227
 playing+0.220
 office+0.219
 neighborhood+0.211
 living+0.199
Neg logit contributions
le-0.226
war-0.218
of-0.217
bs-0.211
les-0.210
Token neighbourhood
Token ID19495
Feature activation+1.39e+01

Pos logit contributions
 playing+0.001
ich+0.001
 office+0.001
 exploring+0.001
 seeking+0.001
Neg logit contributions
le-0.001
of-0.001
les-0.001
war-0.001
bread-0.001
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
 playing+0.824
ich+0.790
 office+0.784
 neighborhood+0.762
 living+0.743
Neg logit contributions
le-0.948
war-0.929
of-0.925
bs-0.919
umbs-0.895
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
 playing+0.376
 office+0.337
 living+0.330
 neighborhood+0.330
ich+0.327
Neg logit contributions
war-0.431
le-0.426
umbs-0.420
of-0.419
bs-0.416
Token sc
Token ID629
Feature activation+7.70e+00

Pos logit contributions
 playing+0.083
 office+0.079
ich+0.074
 living+0.073
 neighborhood+0.071
Neg logit contributions
bs-0.129
of-0.127
war-0.125
le-0.124
forts-0.122
Tokenooter
Token ID25141
Feature activation+7.42e+00

Pos logit contributions
 playing+0.396
 living+0.340
 office+0.339
 neighborhood+0.332
ich+0.323
Neg logit contributions
war-0.628
of-0.623
bs-0.618
umbs-0.616
le-0.615
Token.
Token ID13
Feature activation-6.42e-01

Pos logit contributions
 playing+0.440
 neighborhood+0.418
ich+0.416
 office+0.413
 living+0.402
Neg logit contributions
of-0.506
le-0.495
les-0.489
bs-0.488
war-0.486
Token shiny
Token ID22441
Feature activation-5.52e-01

Pos logit contributions
le+0.201
bs+0.201
war+0.199
of+0.196
forts+0.193
Neg logit contributions
 playing-0.172
ich-0.167
 office-0.166
 neighborhood-0.160
 living-0.153
Token.
Token ID13
Feature activation-4.58e-01

Pos logit contributions
war+0.039
umbs+0.038
of+0.037
bs+0.037
le+0.037
Neg logit contributions
 playing-0.035
 living-0.031
 office-0.031
 neighborhood-0.030
ich-0.030
Token People
Token ID4380
Feature activation+1.19e+00

Pos logit contributions
war+0.030
of+0.030
umbs+0.029
bs+0.029
le+0.028
Neg logit contributions
 playing-0.029
ich-0.028
 office-0.027
 living-0.026
 neighborhood-0.026
Token in
Token ID287
Feature activation+6.03e+00

Pos logit contributions
 playing+0.072
 office+0.063
 living+0.062
 neighborhood+0.062
ich+0.061
Neg logit contributions
war-0.087
le-0.086
umbs-0.085
of-0.084
bs-0.084
Token the
Token ID262
Feature activation-8.72e-02

Pos logit contributions
 playing+0.364
 office+0.323
 living+0.318
 neighborhood+0.317
ich+0.316
Neg logit contributions
war-0.435
le-0.431
umbs-0.424
of-0.423
bs-0.420
Token neighbourhood
Token ID19495
Feature activation+1.38e+01

Pos logit contributions
le+0.007
war+0.007
of+0.007
les+0.007
forts+0.007
Neg logit contributions
 playing-0.004
ich-0.003
 office-0.003
 exploring-0.003
 seeking-0.003
Token would
Token ID561
Feature activation-2.85e+00

Pos logit contributions
 playing+0.774
 office+0.696
ich+0.695
 neighborhood+0.676
 living+0.676
Neg logit contributions
le-1.032
war-1.018
bs-1.009
of-1.008
umbs-0.989
Token come
Token ID1282
Feature activation-2.69e+00

Pos logit contributions
war+0.208
umbs+0.202
le+0.198
bs+0.197
of+0.192
Neg logit contributions
 playing-0.183
 living-0.158
 office-0.157
 neighborhood-0.154
ich-0.147
Token to
Token ID284
Feature activation-3.28e-01

Pos logit contributions
war+0.186
le+0.185
umbs+0.181
of+0.181
bs+0.180
Neg logit contributions
 playing-0.171
 office-0.153
 living-0.151
 neighborhood-0.150
ich-0.148
Token look
Token ID804
Feature activation-2.94e+00

Pos logit contributions
umbs+0.026
war+0.025
le+0.024
bs+0.024
les+0.024
Neg logit contributions
 playing-0.020
 office-0.017
 living-0.017
 neighborhood-0.016
 exploring-0.016
Token at
Token ID379
Feature activation+3.02e+00

Pos logit contributions
war+0.245
umbs+0.245
of+0.235
bread+0.232
le+0.232
Neg logit contributions
 playing-0.156
 living-0.132
 office-0.124
 neighborhood-0.123
 exploring-0.118
Token Mary
Token ID5335
Feature activation-2.88e+00

Pos logit contributions
 playing+0.368
 office+0.324
 living+0.322
 neighborhood+0.318
ich+0.313
Neg logit contributions
war-0.428
le-0.423
umbs-0.417
of-0.414
bs-0.411
Tokenâ
Token ID22940
Feature activation+1.58e+00

Pos logit contributions
le+0.225
war+0.220
of+0.218
bs+0.217
umbs+0.217
Neg logit contributions
 playing-0.151
 office-0.136
ich-0.133
 neighborhood-0.132
 living-0.130
Token
Token ID26391
Feature activation+2.41e+00

Pos logit contributions
 playing+0.101
 office+0.091
 neighborhood+0.089
 living+0.088
ich+0.086
Neg logit contributions
war-0.108
umbs-0.107
bs-0.106
le-0.106
of-0.106
Token
Token ID8151
Feature activation+2.60e+00

Pos logit contributions
 playing+0.166
 office+0.151
 living+0.150
 neighborhood+0.144
ich+0.143
Neg logit contributions
war-0.149
umbs-0.149
bs-0.147
of-0.146
le-0.145
Tokens
Token ID82
Feature activation-9.10e+00

Pos logit contributions
 playing+0.200
 office+0.182
 living+0.180
 neighborhood+0.179
ich+0.177
Neg logit contributions
war-0.145
umbs-0.142
le-0.142
of-0.140
bs-0.138
Token neighborhood
Token ID6232
Feature activation+1.38e+01

Pos logit contributions
le+0.654
war+0.652
bs+0.647
of+0.645
umbs+0.635
Neg logit contributions
 playing-0.515
ich-0.461
 office-0.458
 neighborhood-0.449
 living-0.441
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
 playing+0.868
ich+0.798
 office+0.788
 neighborhood+0.784
 living+0.769
Neg logit contributions
le-0.904
war-0.890
of-0.884
bs-0.883
umbs-0.872
Token All
Token ID1439
Feature activation+2.30e+00

Pos logit contributions
le+0.051
war+0.051
umbs+0.049
bs+0.049
of+0.049
Neg logit contributions
 playing-0.046
 office-0.041
 living-0.040
 neighborhood-0.040
ich-0.039
Token the
Token ID262
Feature activation-2.71e-01

Pos logit contributions
 playing+0.170
 living+0.154
 office+0.151
 neighborhood+0.148
ich+0.143
Neg logit contributions
war-0.139
umbs-0.137
le-0.134
forts-0.131
of-0.131
Token kids
Token ID3988
Feature activation+5.95e+00

Pos logit contributions
le+0.021
war+0.021
of+0.021
umbs+0.021
bs+0.020
Neg logit contributions
 playing-0.015
 office-0.013
ich-0.013
 living-0.012
 neighborhood-0.012
Token would
Token ID561
Feature activation-3.01e+00

Pos logit contributions
 playing+0.347
 office+0.323
ich+0.319
 neighborhood+0.311
 living+0.305
Neg logit contributions
le-0.426
bs-0.419
war-0.414
of-0.414
umbs-0.405
Token to
Token ID284
Feature activation-8.88e-01

Pos logit contributions
 playing+0.304
ich+0.285
 office+0.277
 neighborhood+0.273
 living+0.271
Neg logit contributions
war-0.399
bs-0.398
of-0.391
le-0.386
umbs-0.384
Token do
Token ID466
Feature activation+7.68e-01

Pos logit contributions
le+0.058
war+0.057
of+0.057
bs+0.056
umbs+0.055
Neg logit contributions
 playing-0.056
ich-0.053
 neighborhood-0.052
 office-0.052
 living-0.051
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
 playing+0.047
ich+0.045
 neighborhood+0.045
 office+0.044
 living+0.043
Neg logit contributions
le-0.051
war-0.051
bs-0.050
of-0.049
umbs-0.048
Token Tim
Token ID5045
Feature activation+4.85e+00

Pos logit contributions
war+0.029
umbs+0.029
of+0.029
bs+0.028
le+0.027
Neg logit contributions
 playing-0.029
ich-0.027
 office-0.026
 living-0.026
 neighborhood-0.025
Token imagined
Token ID15758
Feature activation-2.13e+00

Pos logit contributions
 playing+0.270
ich+0.251
 office+0.239
 living+0.234
 neighborhood+0.231
Neg logit contributions
le-0.372
war-0.365
of-0.364
les-0.356
bs-0.352
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
le+0.152
war+0.149
of+0.149
bs+0.148
umbs+0.144
Neg logit contributions
 playing-0.118
 neighborhood-0.112
ich-0.111
 office-0.111
 living-0.106
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
ich+0.757
 playing+0.725
 neighborhood+0.702
 office+0.675
 living+0.651
Neg logit contributions
le-1.038
war-1.020
of-0.972
les-0.956
bs-0.947
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
 playing+0.364
 office+0.330
ich+0.325
 neighborhood+0.325
 living+0.322
Neg logit contributions
war-0.433
le-0.429
of-0.421
umbs-0.421
bs-0.418
Token friends
Token ID2460
Feature activation+9.04e+00

Pos logit contributions
 playing+0.087
 office+0.086
ich+0.082
 neighborhood+0.081
 living+0.078
Neg logit contributions
bs-0.118
le-0.116
war-0.115
of-0.114
les-0.114
Token,
Token ID11
Feature activation+3.14e+00

Pos logit contributions
 playing+0.531
ich+0.500
 office+0.494
 neighborhood+0.488
 living+0.473
Neg logit contributions
le-0.635
war-0.628
of-0.612
bs-0.610
umbs-0.610
Token running
Token ID2491
Feature activation+7.39e+00

Pos logit contributions
 playing+0.168
 office+0.162
ich+0.160
 neighborhood+0.158
 living+0.153
Neg logit contributions
war-0.226
le-0.224
of-0.224
bs-0.221
umbs-0.220
TokenL
Token ID43
Feature activation-3.63e-01

Pos logit contributions
 playing+0.008
 office+0.008
 living+0.008
 neighborhood+0.007
ich+0.007
Neg logit contributions
war-0.007
le-0.007
umbs-0.007
of-0.007
bs-0.007
Tokenily
Token ID813
Feature activation-1.14e+00

Pos logit contributions
war+0.024
le+0.024
umbs+0.023
bs+0.023
of+0.023
Neg logit contributions
 playing-0.024
 office-0.022
 living-0.022
 neighborhood-0.021
ich-0.021
Token and
Token ID290
Feature activation-2.93e+00

Pos logit contributions
le+0.083
bs+0.081
of+0.080
war+0.079
les+0.079
Neg logit contributions
 playing-0.066
ich-0.063
 office-0.061
 neighborhood-0.060
 living-0.058
Token Max
Token ID5436
Feature activation+3.22e+00

Pos logit contributions
bs+0.190
le+0.183
of+0.180
forts+0.174
war+0.174
Neg logit contributions
ich-0.186
 neighborhood-0.183
 office-0.183
 playing-0.183
 living-0.171
Token were
Token ID547
Feature activation-9.94e-01

Pos logit contributions
 playing+0.203
 office+0.192
ich+0.189
 neighborhood+0.184
 living+0.183
Neg logit contributions
le-0.219
war-0.215
bs-0.214
of-0.213
umbs-0.209
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
ck+0.074
le+0.073
bs+0.072
 tighter+0.071
umbs+0.070
Neg logit contributions
 cabinet-0.037
 office-0.037
 neighborhood-0.036
ich-0.036
 factory-0.036
Token soccer
Token ID11783
Feature activation+5.14e+00

Pos logit contributions
ich+0.723
 office+0.709
 neighborhood+0.665
 playing+0.654
 factory+0.606
Neg logit contributions
le-1.034
war-0.991
bs-0.945
les-0.943
of-0.916
Token in
Token ID287
Feature activation+5.97e+00

Pos logit contributions
 playing+0.278
 office+0.271
ich+0.268
 neighborhood+0.256
 living+0.241
Neg logit contributions
le-0.382
war-0.370
of-0.364
les-0.362
bs-0.352
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 playing+0.371
 office+0.336
ich+0.330
 neighborhood+0.326
 living+0.326
Neg logit contributions
war-0.416
le-0.410
of-0.405
umbs-0.403
bs-0.396
Token park
Token ID3952
Feature activation+8.10e+00

Pos logit contributions
war+0.009
les+0.008
of+0.008
le+0.008
umbs+0.008
Neg logit contributions
 playing-0.005
ich-0.005
 office-0.004
gy-0.004
 exploring-0.004
Token.
Token ID13
Feature activation-6.42e-01

Pos logit contributions
 playing+0.560
ich+0.527
 office+0.525
 neighborhood+0.516
 living+0.508
Neg logit contributions
le-0.483
war-0.475
of-0.463
bs-0.457
umbs-0.455
Token to
Token ID284
Feature activation-8.88e-01

Pos logit contributions
war+0.100
le+0.099
umbs+0.097
of+0.097
bs+0.097
Neg logit contributions
 playing-0.070
 office-0.061
 living-0.061
 neighborhood-0.060
ich-0.060
Token do
Token ID466
Feature activation+7.68e-01

Pos logit contributions
le+0.058
war+0.057
of+0.057
bs+0.056
umbs+0.055
Neg logit contributions
 playing-0.056
ich-0.053
 neighborhood-0.052
 office-0.052
 living-0.051
Token so
Token ID523
Feature activation+3.36e+00

Pos logit contributions
 playing+0.047
ich+0.045
 neighborhood+0.045
 office+0.044
 living+0.043
Neg logit contributions
le-0.051
war-0.051
bs-0.050
of-0.049
umbs-0.048
Token and
Token ID290
Feature activation-2.95e+00

Pos logit contributions
 playing+0.232
 office+0.212
 living+0.209
 neighborhood+0.209
ich+0.207
Neg logit contributions
war-0.211
le-0.209
bs-0.207
umbs-0.204
of-0.204
Token continued
Token ID3767
Feature activation+3.14e+00

Pos logit contributions
war+0.197
bs+0.196
le+0.195
of+0.190
forts+0.184
Neg logit contributions
 playing-0.181
ich-0.174
 office-0.169
 neighborhood-0.169
 living-0.163
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
 playing+0.161
ich+0.149
 office+0.148
 neighborhood+0.147
 living+0.141
Neg logit contributions
war-0.242
le-0.239
bs-0.238
of-0.234
umbs-0.232
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
 playing+0.790
 office+0.712
ich+0.709
 neighborhood+0.700
 living+0.691
Neg logit contributions
le-1.022
war-1.016
bs-0.983
of-0.980
umbs-0.978
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
 playing+0.377
 office+0.330
 living+0.329
 neighborhood+0.324
ich+0.317
Neg logit contributions
war-0.440
le-0.433
umbs-0.429
of-0.424
bs-0.422
Token toys
Token ID14958
Feature activation+5.14e+00

Pos logit contributions
 playing+0.074
 office+0.069
ich+0.069
 neighborhood+0.063
 living+0.062
Neg logit contributions
bs-0.133
le-0.130
war-0.129
les-0.126
of-0.126
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 playing+0.335
 office+0.300
 living+0.296
 neighborhood+0.294
ich+0.291
Neg logit contributions
war-0.347
le-0.344
umbs-0.338
of-0.336
bs-0.334
Token
Token ID198
Feature activation+4.32e-01

Pos logit contributions
war+0.041
umbs+0.041
of+0.040
bs+0.039
le+0.039
Neg logit contributions
 playing-0.040
 office-0.036
ich-0.036
 living-0.035
 neighborhood-0.035
Token,
Token ID11
Feature activation+2.54e+00

Pos logit contributions
war+0.048
umbs+0.048
bread+0.045
bs+0.044
of+0.043
Neg logit contributions
 office-0.047
 playing-0.046
ich-0.045
 living-0.043
 seeker-0.043
Token while
Token ID981
Feature activation-5.66e+00

Pos logit contributions
 playing+0.140
 neighborhood+0.134
 office+0.134
ich+0.133
 living+0.126
Neg logit contributions
bs-0.175
war-0.175
le-0.174
of-0.173
umbs-0.169
Token Tim
Token ID5045
Feature activation+4.86e+00

Pos logit contributions
bs+0.349
le+0.342
of+0.334
umbs+0.326
war+0.323
Neg logit contributions
 neighborhood-0.346
ich-0.344
 office-0.340
 playing-0.334
 living-0.319
Tokenmy
Token ID1820
Feature activation+7.57e-01

Pos logit contributions
 playing+0.264
ich+0.240
 office+0.237
 living+0.235
 neighborhood+0.231
Neg logit contributions
le-0.377
of-0.376
war-0.376
umbs-0.364
bs-0.363
Token was
Token ID373
Feature activation+5.67e+00

Pos logit contributions
 playing+0.046
 office+0.041
 living+0.040
ich+0.040
 neighborhood+0.040
Neg logit contributions
le-0.054
war-0.054
of-0.053
umbs-0.053
bs-0.053
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
ich+0.217
 office+0.197
 neighborhood+0.195
 factory+0.193
 playing+0.180
Neg logit contributions
le-0.488
war-0.479
bs-0.479
umbs-0.472
les-0.460
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
ich+0.819
 neighborhood+0.738
 office+0.732
 playing+0.672
 neighbourhood+0.657
Neg logit contributions
le-1.035
war-0.961
 sip-0.924
of-0.921
les-0.919
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
 playing+0.366
 office+0.330
 neighborhood+0.324
ich+0.323
 living+0.322
Neg logit contributions
war-0.438
le-0.433
umbs-0.428
of-0.425
bs-0.423
Token new
Token ID649
Feature activation+3.73e-02

Pos logit contributions
 office+0.086
 playing+0.084
ich+0.080
 neighborhood+0.077
 exploring+0.074
Neg logit contributions
bs-0.116
war-0.112
le-0.112
les-0.112
of-0.111
Token sp
Token ID599
Feature activation-8.30e-01

Pos logit contributions
ich+0.002
 playing+0.002
 office+0.002
 neighborhood+0.002
 living+0.002
Neg logit contributions
le-0.003
les-0.003
war-0.002
bs-0.002
of-0.002
Tokenade
Token ID671
Feature activation-6.49e+00

Pos logit contributions
war+0.029
le+0.027
bs+0.027
forts+0.026
of+0.026
Neg logit contributions
 playing-0.081
 office-0.076
 living-0.075
 neighborhood-0.075
ich-0.074
Token named
Token ID3706
Feature activation+9.23e+00

Pos logit contributions
war+0.280
le+0.278
bs+0.270
umbs+0.270
of+0.270
Neg logit contributions
 playing-0.231
 office-0.207
ich-0.206
 living-0.202
 neighborhood-0.200
Token Lily
Token ID20037
Feature activation+2.74e+00

Pos logit contributions
 playing+0.493
 office+0.457
 neighborhood+0.451
ich+0.446
 living+0.431
Neg logit contributions
of-0.707
le-0.707
bs-0.706
war-0.700
umbs-0.679
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
 playing+0.186
 office+0.185
ich+0.180
 factory+0.177
 neighborhood+0.173
Neg logit contributions
le-0.165
of-0.163
bs-0.160
les-0.151
war-0.147
Token She
Token ID1375
Feature activation+2.15e+00

Pos logit contributions
umbs+0.028
of+0.028
war+0.026
bs+0.025
fulness+0.024
Neg logit contributions
 playing-0.028
 office-0.027
ich-0.027
 living-0.026
 neighborhood-0.025
Token loved
Token ID6151
Feature activation+8.78e+00

Pos logit contributions
ich+0.111
 office+0.107
 playing+0.103
 seeker+0.100
 factory+0.097
Neg logit contributions
le-0.157
of-0.153
bs-0.153
les-0.147
forts-0.141
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
 playing+0.528
 office+0.515
ich+0.511
 neighborhood+0.509
 seeker+0.483
Neg logit contributions
war-0.564
bs-0.552
of-0.550
umbs-0.540
le-0.539
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
ich+0.719
 office+0.690
 neighborhood+0.667
 playing+0.641
 smartest+0.619
Neg logit contributions
le-1.070
les-1.002
war-0.994
bs-0.977
of-0.961
Token her
Token ID607
Feature activation+6.49e-01

Pos logit contributions
 playing+0.383
 office+0.366
 neighborhood+0.358
ich+0.357
 living+0.342
Neg logit contributions
le-0.398
war-0.398
of-0.393
bs-0.392
umbs-0.390
Token friends
Token ID2460
Feature activation+9.04e+00

Pos logit contributions
 office+0.034
 playing+0.031
ich+0.030
 neighborhood+0.028
 seeker+0.028
Neg logit contributions
bs-0.046
les-0.044
le-0.043
of-0.042
forts-0.042
Token and
Token ID290
Feature activation-3.11e+00

Pos logit contributions
ich+0.625
 office+0.610
 playing+0.598
 neighborhood+0.585
 seeker+0.579
Neg logit contributions
le-0.530
les-0.478
umbs-0.477
of-0.476
bs-0.472
Token being
Token ID852
Feature activation+8.53e+00

Pos logit contributions
bs+0.255
of+0.246
forts+0.244
 sip+0.241
 syrup+0.239
Neg logit contributions
ich-0.119
 office-0.117
 neighborhood-0.103
 seeker-0.096
 neighbourhood-0.094
Token again
Token ID757
Feature activation-5.03e+00

Pos logit contributions
umbs+0.116
war+0.116
le+0.113
bread+0.113
of+0.113
Neg logit contributions
 playing-0.074
 office-0.062
 living-0.062
 neighborhood-0.057
ich-0.056
Token and
Token ID290
Feature activation-3.02e+00

Pos logit contributions
le+0.329
war+0.318
of+0.316
bs+0.315
umbs+0.312
Neg logit contributions
 playing-0.319
 office-0.299
ich-0.296
 neighborhood-0.291
 living-0.287
Token went
Token ID1816
Feature activation+1.76e+00

Pos logit contributions
le+0.203
war+0.200
bs+0.198
of+0.196
forts+0.190
Neg logit contributions
 playing-0.189
ich-0.177
 neighborhood-0.173
 office-0.173
 living-0.167
Token back
Token ID736
Feature activation+4.04e+00

Pos logit contributions
 playing+0.095
ich+0.089
 office+0.088
 neighborhood+0.087
 living+0.084
Neg logit contributions
war-0.130
le-0.128
bs-0.128
of-0.126
umbs-0.125
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
ich+0.236
 playing+0.225
 office+0.221
 neighborhood+0.214
 factory+0.209
Neg logit contributions
bs-0.273
war-0.268
le-0.266
of-0.259
forts-0.254
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
le+0.009
war+0.009
bs+0.009
of+0.009
 sip+0.009
Neg logit contributions
ich-0.007
 neighborhood-0.007
 office-0.007
 playing-0.006
 factory-0.006
Token with
Token ID351
Feature activation+6.09e+00

Pos logit contributions
 playing+0.757
ich+0.724
 office+0.703
 neighborhood+0.690
 living+0.667
Neg logit contributions
le-1.038
war-1.016
bs-0.985
of-0.975
umbs-0.973
Token his
Token ID465
Feature activation+1.70e+00

Pos logit contributions
 playing+0.372
 office+0.329
 living+0.325
 neighborhood+0.323
ich+0.318
Neg logit contributions
war-0.440
le-0.435
umbs-0.429
of-0.426
bs-0.424
Token ball
Token ID2613
Feature activation+1.07e+00

Pos logit contributions
 playing+0.077
 office+0.073
ich+0.070
 neighborhood+0.066
 living+0.065
Neg logit contributions
bs-0.128
war-0.125
le-0.125
les-0.123
of-0.123
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 playing+0.069
ich+0.066
 office+0.065
 neighborhood+0.064
 living+0.062
Neg logit contributions
le-0.069
war-0.067
bs-0.066
of-0.065
umbs-0.065
Token The
Token ID383
Feature activation+3.54e-01

Pos logit contributions
umbs+0.041
war+0.040
of+0.040
bs+0.039
forts+0.038
Neg logit contributions
 playing-0.039
ich-0.037
 office-0.037
 living-0.035
 neighborhood-0.035
Token the
Token ID262
Feature activation-7.37e-01

Pos logit contributions
 playing+0.242
 office+0.222
 neighborhood+0.221
 living+0.215
ich+0.212
Neg logit contributions
le-0.240
war-0.237
bs-0.235
of-0.234
umbs-0.228
Token road
Token ID2975
Feature activation+3.53e+00

Pos logit contributions
war+0.054
le+0.054
of+0.053
bs+0.052
umbs+0.052
Neg logit contributions
 playing-0.040
ich-0.034
 office-0.034
 neighborhood-0.034
 living-0.033
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
 playing+0.214
ich+0.196
 office+0.191
 neighborhood+0.190
 living+0.187
Neg logit contributions
le-0.243
war-0.242
of-0.237
umbs-0.236
bs-0.233
Token We
Token ID775
Feature activation+2.58e+00

Pos logit contributions
umbs+0.029
war+0.029
of+0.028
bs+0.028
le+0.026
Neg logit contributions
 playing-0.028
ich-0.027
 office-0.027
 living-0.026
 neighborhood-0.025
Token were
Token ID547
Feature activation-9.94e-01

Pos logit contributions
 playing+0.169
 office+0.149
 living+0.148
 neighborhood+0.147
ich+0.141
Neg logit contributions
war-0.176
le-0.172
umbs-0.172
of-0.169
bs-0.167
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
le+0.077
bs+0.076
ck+0.074
war+0.073
les+0.073
Neg logit contributions
ich-0.041
 neighborhood-0.039
 office-0.035
gy-0.034
 neighbourhood-0.033
Token.
Token ID13
Feature activation-5.06e-01

Pos logit contributions
 playing+0.781
ich+0.769
 neighborhood+0.743
 office+0.732
 living+0.700
Neg logit contributions
le-0.959
war-0.939
of-0.872
les-0.871
bs-0.871
Token We
Token ID775
Feature activation+2.47e+00

Pos logit contributions
war+0.039
le+0.038
umbs+0.038
bs+0.038
of+0.038
Neg logit contributions
 playing-0.028
 office-0.025
 living-0.024
 neighborhood-0.024
ich-0.024
Token were
Token ID547
Feature activation-1.12e+00

Pos logit contributions
 playing+0.144
 living+0.124
 office+0.124
 neighborhood+0.122
ich+0.119
Neg logit contributions
war-0.188
le-0.184
umbs-0.183
of-0.181
bs-0.178
Token not
Token ID407
Feature activation+4.47e+00

Pos logit contributions
le+0.087
ck+0.086
bs+0.085
les+0.082
umbs+0.082
Neg logit contributions
 neighborhood-0.042
ich-0.041
 office-0.038
gy-0.037
 neighbourhood-0.036
Token careful
Token ID8161
Feature activation-4.77e-01

Pos logit contributions
 neighborhood+0.204
 office+0.202
 playing+0.196
ich+0.192
 neighbourhood+0.183
Neg logit contributions
bs-0.347
war-0.342
le-0.339
umbs-0.331
bread-0.328
TokenL
Token ID43
Feature activation-3.63e-01

Pos logit contributions
 neighborhood+0.298
 office+0.291
 playing+0.283
 factory+0.274
 neighbourhood+0.273
Neg logit contributions
le-0.379
bs-0.358
war-0.357
umbs-0.355
ck-0.353
Tokenila
Token ID10102
Feature activation-4.56e+00

Pos logit contributions
war+0.024
le+0.024
umbs+0.023
bs+0.023
of+0.023
Neg logit contributions
 playing-0.024
 office-0.022
 living-0.022
 neighborhood-0.021
ich-0.021
Token and
Token ID290
Feature activation-2.93e+00

Pos logit contributions
le+0.356
of+0.346
bs+0.344
war+0.342
umbs+0.340
Neg logit contributions
 playing-0.248
 office-0.222
 neighborhood-0.218
ich-0.218
 living-0.214
Token Ben
Token ID3932
Feature activation-6.18e-01

Pos logit contributions
bs+0.187
le+0.180
of+0.177
war+0.170
forts+0.170
Neg logit contributions
 playing-0.192
 neighborhood-0.190
ich-0.190
 office-0.187
 living-0.177
Token were
Token ID547
Feature activation-9.94e-01

Pos logit contributions
war+0.041
le+0.040
umbs+0.039
of+0.039
bs+0.039
Neg logit contributions
 playing-0.041
 office-0.037
 living-0.037
 neighborhood-0.037
ich-0.036
Token playing
Token ID2712
Feature activation+1.37e+01

Pos logit contributions
ck+0.072
le+0.072
bs+0.070
les+0.067
 tighter+0.067
Neg logit contributions
 cabinet-0.038
gy-0.036
 neighborhood-0.036
ich-0.036
 office-0.036
Token in
Token ID287
Feature activation+6.03e+00

Pos logit contributions
ich+0.719
 office+0.698
 neighborhood+0.662
 playing+0.647
 exploring+0.606
Neg logit contributions
le-1.029
war-0.985
les-0.943
bs-0.927
of-0.908
Token the
Token ID262
Feature activation-8.72e-02

Pos logit contributions
 playing+0.392
ich+0.387
 office+0.377
 neighborhood+0.365
 living+0.348
Neg logit contributions
war-0.357
le-0.355
of-0.352
umbs-0.337
bs-0.333
Token park
Token ID3952
Feature activation+8.13e+00

Pos logit contributions
war+0.007
les+0.007
le+0.006
of+0.006
 sip+0.006
Neg logit contributions
 playing-0.004
ich-0.004
 exploring-0.003
 saving-0.003
 seeking-0.003
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 playing+0.535
ich+0.510
 office+0.506
 neighborhood+0.501
 living+0.486
Neg logit contributions
le-0.506
war-0.490
of-0.479
bs-0.474
umbs-0.469
Token They
Token ID1119
Feature activation-2.72e+00

Pos logit contributions
war+0.029
umbs+0.028
bs+0.028
of+0.028
le+0.026
Neg logit contributions
 playing-0.051
 office-0.048
ich-0.048
 neighborhood-0.047
 living-0.046
Token in
Token ID287
Feature activation+5.46e+00

Pos logit contributions
 playing+0.105
 office+0.096
ich+0.095
 neighborhood+0.094
 living+0.093
Neg logit contributions
le-0.128
war-0.125
of-0.124
bs-0.124
umbs-0.121
Token need
Token ID761
Feature activation-3.09e+00

Pos logit contributions
 playing+0.340
 office+0.300
 living+0.297
 neighborhood+0.293
ich+0.285
Neg logit contributions
war-0.394
le-0.388
umbs-0.383
of-0.379
bs-0.377
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
war+0.226
umbs+0.222
le+0.222
of+0.216
bs+0.214
Neg logit contributions
 playing-0.195
 living-0.170
 office-0.168
 neighborhood-0.164
 exploring-0.156
Token Her
Token ID2332
Feature activation+2.47e-01

Pos logit contributions
umbs+0.077
war+0.076
of+0.076
bs+0.074
le+0.074
Neg logit contributions
 playing-0.065
 office-0.057
ich-0.057
 neighborhood-0.055
 living-0.055
Token thought
Token ID1807
Feature activation-2.40e+00

Pos logit contributions
 playing+0.014
ich+0.013
 living+0.012
 office+0.012
 seeker+0.012
Neg logit contributions
bs-0.016
war-0.016
forts-0.016
of-0.016
bread-0.016
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.164
le+0.162
umbs+0.160
of+0.159
bs+0.158
Neg logit contributions
 playing-0.156
 office-0.138
 living-0.137
 neighborhood-0.135
ich-0.133
Token always
Token ID1464
Feature activation+5.67e+00

Pos logit contributions
war+1.340
le+1.335
of+1.311
umbs+1.311
bs+1.307
Neg logit contributions
 playing-0.870
 office-0.759
 neighborhood-0.740
 living-0.739
ich-0.735
Token helped
Token ID4193
Feature activation-2.31e+00

Pos logit contributions
ich+0.339
 playing+0.337
 office+0.322
 neighborhood+0.314
 living+0.297
Neg logit contributions
le-0.390
bs-0.378
war-0.378
of-0.372
forts-0.372
Token others
Token ID1854
Feature activation+6.23e+00

Pos logit contributions
war+0.171
umbs+0.168
le+0.167
bs+0.164
of+0.164
Neg logit contributions
 playing-0.139
 office-0.122
 living-0.120
 neighborhood-0.119
ich-0.116
Token succeed
Token ID6758
Feature activation+1.87e+00

Pos logit contributions
 playing+0.363
 office+0.325
 neighborhood+0.319
ich+0.318
 living+0.315
Neg logit contributions
le-0.457
war-0.456
of-0.445
bs-0.444
umbs-0.441
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 playing+0.113
 living+0.098
 office+0.097
 neighborhood+0.096
ich+0.092
Neg logit contributions
war-0.140
umbs-0.138
le-0.134
of-0.133
forts-0.132
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
 playing+0.171
 office+0.152
 living+0.150
 neighborhood+0.149
ich+0.147
Neg logit contributions
war-0.199
le-0.197
umbs-0.194
of-0.193
bs-0.192
Token enthusiastic
Token ID21848
Feature activation+1.45e+00

Pos logit contributions
le+0.248
bs+0.247
war+0.246
of+0.239
forts+0.236
Neg logit contributions
 playing-0.208
ich-0.200
 neighborhood-0.193
 office-0.193
 living-0.186
Token that
Token ID326
Feature activation+1.10e+00

Pos logit contributions
 playing+0.088
ich+0.082
 office+0.081
 neighborhood+0.080
 living+0.078
Neg logit contributions
le-0.102
war-0.101
of-0.099
bs-0.098
umbs-0.097
Token his
Token ID465
Feature activation+1.20e+00

Pos logit contributions
ich+0.074
 playing+0.073
 neighborhood+0.071
 office+0.070
 living+0.066
Neg logit contributions
le-0.067
bs-0.064
war-0.064
of-0.064
fulness-0.063
Token forget
Token ID6044
Feature activation-1.01e+00

Pos logit contributions
ich+0.064
 playing+0.060
 living+0.057
 office+0.056
 neighborhood+0.054
Neg logit contributions
le-0.080
bs-0.079
 sip-0.078
forts-0.076
war-0.075
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.069
le+0.068
bs+0.068
of+0.067
umbs+0.066
Neg logit contributions
 playing-0.062
ich-0.057
 office-0.056
 neighborhood-0.056
 living-0.054
Token had
Token ID550
Feature activation+1.29e+00

Pos logit contributions
war+1.262
le+1.258
umbs+1.233
of+1.231
bs+1.229
Neg logit contributions
 playing-0.941
 office-0.833
 neighborhood-0.814
 living-0.812
ich-0.809
Token an
Token ID281
Feature activation+2.62e+00

Pos logit contributions
 playing+0.077
 living+0.065
 office+0.063
 neighborhood+0.063
 exploring+0.060
Neg logit contributions
war-0.099
umbs-0.098
le-0.098
of-0.096
bs-0.095
Token unfortunate
Token ID14855
Feature activation+2.62e-01

Pos logit contributions
 playing+0.157
 office+0.138
 living+0.136
 neighborhood+0.135
ich+0.132
Neg logit contributions
war-0.194
le-0.191
umbs-0.189
of-0.188
bs-0.187
Token ending
Token ID7464
Feature activation-3.15e-02

Pos logit contributions
 playing+0.017
ich+0.016
 office+0.015
 neighborhood+0.015
 living+0.015
Neg logit contributions
le-0.017
war-0.016
forts-0.016
bs-0.016
of-0.016
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
war+0.002
le+0.002
umbs+0.002
of+0.002
bs+0.002
Neg logit contributions
 playing-0.002
 office-0.002
 neighborhood-0.002
 living-0.002
ich-0.002
Token bird
Token ID6512
Feature activation-1.45e+00

Pos logit contributions
war+0.044
le+0.044
umbs+0.043
of+0.043
bs+0.043
Neg logit contributions
 playing-0.033
 office-0.029
 living-0.029
 neighborhood-0.029
ich-0.028
Token's
Token ID338
Feature activation-1.79e-01

Pos logit contributions
war+0.111
le+0.108
umbs+0.108
of+0.107
bs+0.106
Neg logit contributions
 playing-0.085
 office-0.073
 living-0.073
 neighborhood-0.071
ich-0.070
Token determination
Token ID12123
Feature activation-4.40e+00

Pos logit contributions
umbs+0.012
le+0.012
war+0.012
of+0.012
les+0.012
Neg logit contributions
 playing-0.012
 office-0.011
 living-0.011
 neighborhood-0.010
ich-0.010
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
war+0.319
umbs+0.311
le+0.306
of+0.302
bs+0.300
Neg logit contributions
 playing-0.283
 living-0.249
 office-0.244
 neighborhood-0.236
ich-0.227
Token grace
Token ID11542
Feature activation-6.17e+00

Pos logit contributions
war+0.247
le+0.244
umbs+0.240
of+0.239
bs+0.238
Neg logit contributions
 playing-0.226
 office-0.201
 living-0.199
 neighborhood-0.197
ich-0.195
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.296
umbs+0.287
bread+0.267
forts+0.266
of+0.258
Neg logit contributions
 playing-0.537
 living-0.487
 office-0.459
 exploring-0.450
ich-0.450
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
war+1.217
le+1.191
umbs+1.189
of+1.170
bs+1.157
Neg logit contributions
 playing-1.029
 living-0.901
 office-0.894
 neighborhood-0.877
ich-0.859
Token She
Token ID1375
Feature activation+1.53e+00

Pos logit contributions
war+0.080
umbs+0.078
of+0.077
le+0.077
bs+0.077
Neg logit contributions
 playing-0.063
 office-0.055
 living-0.054
ich-0.054
 neighborhood-0.053
Token applauded
Token ID38911
Feature activation-2.02e+00

Pos logit contributions
 playing+0.092
 office+0.082
 living+0.080
 neighborhood+0.080
ich+0.079
Neg logit contributions
war-0.112
le-0.111
umbs-0.109
of-0.109
bs-0.108
Token it
Token ID340
Feature activation-6.40e+00

Pos logit contributions
war+0.151
umbs+0.149
le+0.147
of+0.144
bs+0.144
Neg logit contributions
 playing-0.123
 living-0.106
 office-0.105
 neighborhood-0.102
ich-0.101
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
war+0.482
umbs+0.476
le+0.471
bs+0.461
of+0.460
Neg logit contributions
 playing-0.383
 living-0.332
 office-0.328
 neighborhood-0.315
 exploring-0.304
Token:
Token ID25
Feature activation+3.36e+00

Pos logit contributions
of+0.391
war+0.388
le+0.385
umbs+0.380
bs+0.379
Neg logit contributions
 playing-0.316
ich-0.277
 neighborhood-0.269
 office-0.265
 living-0.258
Token Kind
Token ID14927
Feature activation-3.45e+00

Pos logit contributions
 playing+0.181
 office+0.159
 neighborhood+0.157
ich+0.156
 living+0.155
Neg logit contributions
war-0.260
le-0.257
of-0.255
umbs-0.254
bs-0.253
Tokenness
Token ID1108
Feature activation-6.20e+00

Pos logit contributions
war+0.182
le+0.173
umbs+0.172
of+0.167
bs+0.166
Neg logit contributions
 playing-0.282
 office-0.259
 living-0.259
 neighborhood-0.256
ich-0.248
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
war+0.474
le+0.467
umbs+0.463
of+0.455
bs+0.453
Neg logit contributions
 playing-0.355
 living-0.311
 office-0.309
 neighborhood-0.302
ich-0.294
Token thought
Token ID1807
Feature activation-2.39e+00

Pos logit contributions
war+0.240
bs+0.239
le+0.237
of+0.234
forts+0.231
Neg logit contributions
 playing-0.217
 office-0.201
ich-0.200
 neighborhood-0.199
 living-0.191
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.170
umbs+0.168
le+0.164
of+0.161
forts+0.160
Neg logit contributions
 playing-0.156
 living-0.136
 office-0.134
 neighborhood-0.128
 exploring-0.125
Token is
Token ID318
Feature activation+2.19e+00

Pos logit contributions
war+1.254
le+1.244
umbs+1.224
of+1.221
bs+1.217
Neg logit contributions
 playing-0.958
 office-0.844
 living-0.826
 neighborhood-0.826
ich-0.818
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
 playing+0.125
 office+0.113
 neighborhood+0.112
ich+0.112
 living+0.107
Neg logit contributions
le-0.159
war-0.158
bs-0.155
of-0.153
umbs-0.152
Token beautiful
Token ID4950
Feature activation-2.28e-01

Pos logit contributions
 playing+0.013
ich+0.012
 office+0.012
 neighborhood+0.011
 living+0.011
Neg logit contributions
war-0.013
le-0.012
bs-0.012
forts-0.012
of-0.012
Token thing
Token ID1517
Feature activation+8.33e-01

Pos logit contributions
le+0.017
war+0.016
of+0.016
bs+0.016
umbs+0.016
Neg logit contributions
 playing-0.014
ich-0.012
 office-0.012
 living-0.012
 neighborhood-0.012
Token!
Token ID0
Feature activation+2.96e+00

Pos logit contributions
 playing+0.051
 office+0.047
 neighborhood+0.046
ich+0.046
 living+0.044
Neg logit contributions
le-0.058
war-0.057
of-0.056
bs-0.056
umbs-0.055
Token the
Token ID262
Feature activation-5.80e-01

Pos logit contributions
war+0.572
le+0.561
umbs+0.560
bs+0.553
of+0.549
Neg logit contributions
 playing-0.445
 office-0.389
 living-0.387
 neighborhood-0.376
ich-0.370
Token story
Token ID1621
Feature activation-5.37e+00

Pos logit contributions
war+0.044
le+0.043
umbs+0.043
bs+0.042
of+0.042
Neg logit contributions
 playing-0.035
 office-0.030
 living-0.030
 neighborhood-0.029
ich-0.028
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
war+0.430
le+0.428
of+0.424
umbs+0.418
bs+0.418
Neg logit contributions
 playing-0.281
 office-0.239
ich-0.239
 neighborhood-0.236
 living-0.234
Token that
Token ID326
Feature activation+1.11e+00

Pos logit contributions
 playing+0.132
 neighborhood+0.121
 office+0.120
ich+0.119
 living+0.114
Neg logit contributions
bs-0.149
of-0.148
le-0.146
war-0.145
forts-0.141
Token grace
Token ID11542
Feature activation-6.18e+00

Pos logit contributions
 playing+0.071
 office+0.068
ich+0.067
 neighborhood+0.067
 living+0.064
Neg logit contributions
le-0.070
war-0.070
of-0.069
bs-0.069
fulness-0.066
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.308
umbs+0.298
le+0.291
bread+0.286
of+0.284
Neg logit contributions
 playing-0.519
 living-0.472
 office-0.466
 neighborhood-0.457
ich-0.452
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
war+1.257
le+1.247
of+1.227
umbs+1.226
bs+1.221
Neg logit contributions
 playing-0.950
 office-0.838
 neighborhood-0.819
 living-0.818
ich-0.812
Token better
Token ID1365
Feature activation-6.79e+00

Pos logit contributions
 playing+0.122
 office+0.106
 living+0.105
 neighborhood+0.104
ich+0.102
Neg logit contributions
war-0.171
le-0.169
umbs-0.167
of-0.166
bs-0.165
Token than
Token ID621
Feature activation+3.23e+00

Pos logit contributions
le+0.397
bs+0.391
war+0.390
of+0.378
forts+0.371
Neg logit contributions
 playing-0.491
 office-0.451
ich-0.445
 neighborhood-0.441
 living-0.437
Token winning
Token ID5442
Feature activation+8.28e+00

Pos logit contributions
 playing+0.146
 office+0.128
 neighborhood+0.124
 living+0.123
ich+0.122
Neg logit contributions
war-0.279
le-0.276
of-0.274
bs-0.272
umbs-0.272
Token all
Token ID477
Feature activation+2.50e+00

Pos logit contributions
 playing+0.515
 office+0.465
 living+0.455
 neighborhood+0.453
ich+0.453
Neg logit contributions
war-0.579
le-0.573
of-0.562
bs-0.559
umbs-0.559
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
 playing+0.246
 office+0.218
 living+0.215
 neighborhood+0.214
ich+0.211
Neg logit contributions
war-0.281
le-0.278
umbs-0.274
of-0.272
bs-0.271
Token little
Token ID1310
Feature activation-1.78e+00

Pos logit contributions
 playing+0.012
 office+0.011
ich+0.011
 neighborhood+0.011
 living+0.010
Neg logit contributions
le-0.013
war-0.013
bs-0.013
of-0.013
forts-0.013
Token kindness
Token ID23887
Feature activation-6.74e+00

Pos logit contributions
le+0.116
war+0.114
bs+0.112
of+0.111
umbs+0.111
Neg logit contributions
 playing-0.118
 office-0.106
ich-0.106
 neighborhood-0.105
 living-0.104
Token and
Token ID290
Feature activation-3.54e+00

Pos logit contributions
war+0.458
le+0.452
umbs+0.446
of+0.442
bs+0.440
Neg logit contributions
 playing-0.440
 office-0.391
 living-0.388
 neighborhood-0.384
ich-0.378
Token thought
Token ID1807
Feature activation-2.39e+00

Pos logit contributions
war+0.221
le+0.217
umbs+0.215
of+0.212
bs+0.210
Neg logit contributions
 playing-0.253
 office-0.227
 living-0.225
 neighborhood-0.223
ich-0.219
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.162
umbs+0.159
le+0.157
of+0.153
forts+0.153
Neg logit contributions
 playing-0.162
 living-0.143
 office-0.141
 neighborhood-0.136
ich-0.133
Token goes
Token ID2925
Feature activation-1.36e+00

Pos logit contributions
war+1.210
le+1.207
of+1.187
bs+1.184
umbs+1.178
Neg logit contributions
 playing-0.981
 office-0.876
 neighborhood-0.859
ich-0.857
 living-0.841
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
le+0.098
war+0.097
bs+0.095
of+0.093
les+0.092
Neg logit contributions
 playing-0.082
 office-0.075
 neighborhood-0.073
ich-0.073
 living-0.073
Token long
Token ID890
Feature activation-7.34e+00

Pos logit contributions
 playing+0.015
 living+0.013
 office+0.013
 neighborhood+0.013
ich+0.013
Neg logit contributions
war-0.015
le-0.014
umbs-0.014
bs-0.014
of-0.014
Token way
Token ID835
Feature activation+9.20e-01

Pos logit contributions
war+0.434
umbs+0.428
le+0.422
bs+0.413
of+0.411
Neg logit contributions
 playing-0.550
 office-0.497
 living-0.492
 neighborhood-0.488
ich-0.470
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 playing+0.058
 office+0.052
 neighborhood+0.051
 living+0.050
ich+0.050
Neg logit contributions
war-0.064
le-0.063
of-0.062
bs-0.062
umbs-0.062
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
 playing+0.171
 living+0.150
 office+0.148
 neighborhood+0.146
ich+0.141
Neg logit contributions
war-0.169
umbs-0.167
le-0.165
of-0.163
bs-0.162
Token time
Token ID640
Feature activation-6.58e+00

Pos logit contributions
war+0.032
le+0.032
of+0.031
umbs+0.030
bs+0.030
Neg logit contributions
 playing-0.044
 office-0.041
ich-0.040
 living-0.040
 neighborhood-0.040
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
war+0.461
le+0.457
umbs+0.449
of+0.448
bs+0.446
Neg logit contributions
 playing-0.408
 office-0.365
 living-0.359
 neighborhood-0.358
ich-0.354
Token With
Token ID2080
Feature activation+2.04e+00

Pos logit contributions
war+0.082
le+0.081
umbs+0.079
bs+0.079
of+0.079
Neg logit contributions
 playing-0.058
 office-0.051
 living-0.050
 neighborhood-0.050
ich-0.049
Token grace
Token ID11542
Feature activation-6.15e+00

Pos logit contributions
 playing+0.131
 living+0.112
 office+0.107
 exploring+0.103
 neighborhood+0.102
Neg logit contributions
war-0.155
umbs-0.155
le-0.150
bread-0.145
of-0.143
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.352
umbs+0.346
bread+0.327
le+0.326
forts+0.317
Neg logit contributions
 playing-0.479
 living-0.427
 office-0.414
ich-0.402
 exploring-0.398
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
war+1.371
le+1.359
umbs+1.341
of+1.336
bs+1.331
Neg logit contributions
 playing-0.844
 office-0.728
 living-0.715
 neighborhood-0.710
ich-0.698
Token everyone
Token ID2506
Feature activation+3.65e+00

Pos logit contributions
 playing+0.155
 office+0.135
 living+0.134
 neighborhood+0.132
ich+0.130
Neg logit contributions
war-0.207
le-0.205
umbs-0.202
of-0.201
bs-0.200
Token can
Token ID460
Feature activation-3.51e+00

Pos logit contributions
 playing+0.238
 office+0.216
 neighborhood+0.210
ich+0.209
 living+0.208
Neg logit contributions
le-0.244
war-0.243
bs-0.237
of-0.237
umbs-0.235
Token be
Token ID307
Feature activation+2.94e+00

Pos logit contributions
war+0.275
le+0.268
umbs+0.268
bs+0.264
of+0.262
Neg logit contributions
 playing-0.197
 office-0.170
 living-0.168
 neighborhood-0.167
ich-0.163
Token happy
Token ID3772
Feature activation-1.43e+00

Pos logit contributions
 playing+0.166
 office+0.150
ich+0.148
 living+0.147
 neighborhood+0.145
Neg logit contributions
war-0.217
le-0.217
bs-0.213
umbs-0.211
of-0.209
Token pencil
Token ID21613
Feature activation-1.53e+00

Pos logit contributions
 playing+0.068
ich+0.059
 neighborhood+0.056
 living+0.056
 office+0.055
Neg logit contributions
bs-0.089
of-0.088
war-0.087
forts-0.083
bread-0.083
Token box
Token ID3091
Feature activation-3.09e-01

Pos logit contributions
war+0.114
forts+0.109
le+0.109
umbs+0.108
bs+0.107
Neg logit contributions
 playing-0.093
 living-0.082
 office-0.082
ich-0.078
 neighborhood-0.078
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
war+0.022
le+0.022
umbs+0.022
bs+0.021
of+0.021
Neg logit contributions
 playing-0.019
 office-0.017
 living-0.017
 neighborhood-0.017
ich-0.016
Token With
Token ID2080
Feature activation+2.08e+00

Pos logit contributions
war+0.071
umbs+0.070
of+0.069
bs+0.068
le+0.067
Neg logit contributions
 playing-0.065
ich-0.059
 office-0.058
 living-0.057
 neighborhood-0.057
Token fear
Token ID3252
Feature activation-6.57e+00

Pos logit contributions
 playing+0.109
 living+0.088
 office+0.082
 exploring+0.081
 walking+0.081
Neg logit contributions
umbs-0.181
war-0.180
le-0.175
bread-0.173
forts-0.169
Tokenfulness
Token ID15538
Feature activation-1.67e+01

Pos logit contributions
war+0.615
umbs+0.606
le+0.593
of+0.584
forts+0.582
Neg logit contributions
 playing-0.282
 living-0.226
 office-0.211
 neighborhood-0.201
 exploring-0.201
Token,
Token ID11
Feature activation+2.73e+00

Pos logit contributions
war+1.466
umbs+1.439
le+1.430
of+1.405
bs+1.398
Neg logit contributions
 playing-0.791
 living-0.656
 office-0.650
 neighborhood-0.630
ich-0.606
Token he
Token ID339
Feature activation+7.32e-01

Pos logit contributions
 playing+0.171
 office+0.156
 neighborhood+0.155
ich+0.154
 living+0.153
Neg logit contributions
le-0.183
war-0.183
of-0.180
bs-0.180
umbs-0.177
Token launched
Token ID5611
Feature activation-5.30e-01

Pos logit contributions
 playing+0.042
 office+0.036
 living+0.036
 neighborhood+0.035
ich+0.035
Neg logit contributions
war-0.057
umbs-0.056
le-0.055
of-0.054
bs-0.054
Token it
Token ID340
Feature activation-6.38e+00

Pos logit contributions
umbs+0.043
war+0.042
forts+0.042
bread+0.041
bs+0.040
Neg logit contributions
 playing-0.030
 office-0.025
 living-0.024
ich-0.024
 exploring-0.024
Token out
Token ID503
Feature activation+1.27e+00

Pos logit contributions
umbs+0.484
war+0.481
le+0.471
forts+0.469
bs+0.465
Neg logit contributions
 playing-0.379
 living-0.326
 office-0.318
 neighborhood-0.303
 exploring-0.302
Token wanted
Token ID2227
Feature activation+3.13e+00

Pos logit contributions
 playing+0.073
 office+0.064
 living+0.063
 neighborhood+0.062
ich+0.061
Neg logit contributions
war-0.108
le-0.106
umbs-0.106
of-0.104
bs-0.104
Token to
Token ID284
Feature activation-7.00e-01

Pos logit contributions
 playing+0.182
 living+0.157
 office+0.157
 neighborhood+0.149
 exploring+0.142
Neg logit contributions
war-0.244
umbs-0.243
le-0.241
bs-0.237
forts-0.236
Token show
Token ID905
Feature activation+1.71e+00

Pos logit contributions
war+0.049
umbs+0.047
le+0.047
bs+0.046
of+0.046
Neg logit contributions
 playing-0.046
 office-0.041
 living-0.040
 neighborhood-0.039
ich-0.038
Token his
Token ID465
Feature activation+1.24e+00

Pos logit contributions
 playing+0.104
 office+0.091
 living+0.089
 neighborhood+0.086
ich+0.083
Neg logit contributions
war-0.128
umbs-0.127
le-0.124
bs-0.123
of-0.123
Token thank
Token ID5875
Feature activation-6.14e+00

Pos logit contributions
 playing+0.075
 office+0.066
 living+0.065
 neighborhood+0.065
ich+0.064
Neg logit contributions
war-0.090
le-0.089
umbs-0.088
of-0.087
bs-0.087
Tokenfulness
Token ID15538
Feature activation-1.66e+01

Pos logit contributions
war+0.492
le+0.488
umbs+0.480
of+0.478
bs+0.476
Neg logit contributions
 playing-0.328
 office-0.284
 living-0.281
 neighborhood-0.278
ich-0.273
Token to
Token ID284
Feature activation-7.10e-01

Pos logit contributions
war+1.199
umbs+1.172
le+1.164
bs+1.147
of+1.145
Neg logit contributions
 playing-1.047
 office-0.927
 living-0.920
 neighborhood-0.894
ich-0.875
Token his
Token ID465
Feature activation+1.23e+00

Pos logit contributions
le+0.049
war+0.049
of+0.048
umbs+0.047
bs+0.047
Neg logit contributions
 playing-0.044
 office-0.040
 neighborhood-0.040
ich-0.040
 living-0.039
Token family
Token ID1641
Feature activation+4.85e+00

Pos logit contributions
 playing+0.060
 living+0.051
 neighborhood+0.051
ich+0.051
 office+0.050
Neg logit contributions
le-0.099
war-0.098
bread-0.097
bs-0.096
of-0.096
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 playing+0.308
 office+0.272
 living+0.271
 neighborhood+0.261
ich+0.254
Neg logit contributions
war-0.346
umbs-0.337
le-0.336
of-0.330
bs-0.330
Token So
Token ID1406
Feature activation-3.84e+00

Pos logit contributions
war+0.069
umbs+0.067
of+0.066
le+0.065
forts+0.065
Neg logit contributions
 playing-0.068
ich-0.063
 office-0.060
 living-0.060
 neighborhood-0.058
Token understanding
Token ID4547
Feature activation-2.96e+00

Pos logit contributions
 playing+0.068
ich+0.064
 neighborhood+0.062
 office+0.062
 living+0.060
Neg logit contributions
le-0.075
war-0.075
of-0.074
bs-0.073
fulness-0.070
Token only
Token ID691
Feature activation+1.97e+00

Pos logit contributions
war+0.217
umbs+0.215
le+0.212
of+0.204
bs+0.204
Neg logit contributions
 playing-0.183
 living-0.163
 office-0.161
 neighborhood-0.156
ich-0.148
Token came
Token ID1625
Feature activation-1.18e-01

Pos logit contributions
 playing+0.115
ich+0.105
 neighborhood+0.103
 office+0.103
 living+0.099
Neg logit contributions
le-0.142
war-0.142
of-0.139
bs-0.139
umbs-0.134
Token with
Token ID351
Feature activation+5.55e+00

Pos logit contributions
war+0.008
le+0.008
of+0.008
bs+0.008
umbs+0.008
Neg logit contributions
 playing-0.007
 office-0.006
 living-0.006
 neighborhood-0.006
ich-0.006
Token cheer
Token ID14042
Feature activation-4.71e+00

Pos logit contributions
 playing+0.337
 living+0.292
 office+0.287
 neighborhood+0.275
 exploring+0.265
Neg logit contributions
war-0.426
umbs-0.424
le-0.413
of-0.403
bread-0.402
Tokenfulness
Token ID15538
Feature activation-1.66e+01

Pos logit contributions
war+0.329
umbs+0.322
le+0.315
bs+0.311
of+0.311
Neg logit contributions
 playing-0.304
 living-0.268
 office-0.268
 neighborhood-0.259
ich-0.256
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
war+1.230
umbs+1.202
le+1.198
of+1.173
bs+1.166
Neg logit contributions
 playing-1.011
 living-0.888
 office-0.880
 neighborhood-0.858
ich-0.837
Token<|endoftext|>
Token ID50256
Feature activation-6.41e-01

Pos logit contributions
war+0.073
umbs+0.073
of+0.073
bs+0.072
le+0.070
Neg logit contributions
 playing-0.062
 office-0.055
ich-0.054
 living-0.053
 neighborhood-0.053
TokenOnce
Token ID7454
Feature activation-2.03e+00

Pos logit contributions
war+0.049
umbs+0.049
of+0.048
le+0.048
les+0.046
Neg logit contributions
 playing-0.035
 office-0.032
ich-0.032
 neighborhood-0.030
 living-0.030
Token upon
Token ID2402
Feature activation-4.23e-02

Pos logit contributions
war+0.151
le+0.150
umbs+0.148
of+0.147
bs+0.147
Neg logit contributions
 playing-0.118
 office-0.104
 living-0.102
 neighborhood-0.102
ich-0.101
Token a
Token ID257
Feature activation+2.37e-01

Pos logit contributions
war+0.004
le+0.004
umbs+0.004
bs+0.004
forts+0.004
Neg logit contributions
 playing-0.002
 living-0.002
 office-0.002
 neighborhood-0.002
ich-0.002
Token glad
Token ID9675
Feature activation+3.68e+00

Pos logit contributions
 playing+0.158
 office+0.149
 neighborhood+0.143
 living+0.143
ich+0.142
Neg logit contributions
war-0.208
bs-0.207
le-0.204
umbs-0.204
of-0.203
Token he
Token ID339
Feature activation+7.70e-01

Pos logit contributions
 playing+0.230
 office+0.211
 neighborhood+0.208
 living+0.206
ich+0.204
Neg logit contributions
le-0.250
war-0.249
of-0.246
bs-0.243
umbs-0.242
Token overcame
Token ID49490
Feature activation-7.21e-01

Pos logit contributions
 office+0.046
 playing+0.046
 neighborhood+0.043
ich+0.043
 living+0.043
Neg logit contributions
le-0.057
of-0.054
war-0.053
les-0.053
bs-0.051
Token his
Token ID465
Feature activation+1.25e+00

Pos logit contributions
umbs+0.057
war+0.056
bs+0.055
le+0.054
bread+0.054
Neg logit contributions
 playing-0.043
 office-0.036
 living-0.036
 neighborhood-0.034
 exploring-0.033
Token fear
Token ID3252
Feature activation-6.55e+00

Pos logit contributions
 playing+0.091
 office+0.080
 neighborhood+0.080
 living+0.080
ich+0.077
Neg logit contributions
war-0.079
umbs-0.078
le-0.078
of-0.076
bs-0.075
Tokenfulness
Token ID15538
Feature activation-1.66e+01

Pos logit contributions
war+0.492
umbs+0.471
le+0.467
bs+0.462
forts+0.460
Neg logit contributions
 playing-0.409
 living-0.356
 office-0.342
 neighborhood-0.331
 exploring-0.324
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
war+1.108
le+1.098
umbs+1.079
of+1.072
bs+1.065
Neg logit contributions
 playing-1.108
 office-0.991
 living-0.979
 neighborhood-0.973
ich-0.963
Token They
Token ID1119
Feature activation-3.09e+00

Pos logit contributions
war+0.067
umbs+0.065
of+0.064
forts+0.063
bs+0.063
Neg logit contributions
 playing-0.069
ich-0.063
 office-0.062
 living-0.060
 neighborhood-0.058
Token all
Token ID477
Feature activation+2.54e+00

Pos logit contributions
war+0.245
le+0.241
umbs+0.240
of+0.234
bs+0.234
Neg logit contributions
 playing-0.171
 living-0.145
 office-0.144
 neighborhood-0.143
ich-0.138
Token played
Token ID2826
Feature activation+4.94e+00

Pos logit contributions
 playing+0.143
 living+0.124
 office+0.122
 neighborhood+0.120
ich+0.116
Neg logit contributions
war-0.197
umbs-0.194
le-0.193
of-0.189
bs-0.188
Token and
Token ID290
Feature activation-3.52e+00

Pos logit contributions
 playing+0.317
 office+0.290
ich+0.285
 neighborhood+0.282
 living+0.282
Neg logit contributions
war-0.336
le-0.332
of-0.321
umbs-0.321
bs-0.321
Token place
Token ID1295
Feature activation+1.66e+00

Pos logit contributions
le+0.102
bs+0.100
bread+0.099
of+0.098
umbs+0.097
Neg logit contributions
 playing-0.076
 office-0.068
ich-0.067
 neighborhood-0.063
 living-0.063
Token,
Token ID11
Feature activation+2.75e+00

Pos logit contributions
 playing+0.105
 neighborhood+0.097
ich+0.096
 office+0.095
 living+0.092
Neg logit contributions
war-0.109
le-0.109
bs-0.107
of-0.106
umbs-0.106
Token full
Token ID1336
Feature activation+1.40e+00

Pos logit contributions
 playing+0.169
 office+0.164
 neighborhood+0.163
ich+0.163
 living+0.155
Neg logit contributions
le-0.175
war-0.172
umbs-0.171
of-0.170
bs-0.169
Token of
Token ID286
Feature activation-7.48e+00

Pos logit contributions
 playing+0.120
 living+0.111
 office+0.107
 neighborhood+0.105
 exploring+0.101
Neg logit contributions
war-0.075
le-0.071
of-0.067
bs-0.067
umbs-0.066
Token cheer
Token ID14042
Feature activation-4.66e+00

Pos logit contributions
le+0.488
war+0.484
bs+0.481
umbs+0.473
of+0.471
Neg logit contributions
 playing-0.465
ich-0.441
 office-0.430
 neighborhood-0.414
 living-0.406
Tokenfulness
Token ID15538
Feature activation-1.66e+01

Pos logit contributions
war+0.313
le+0.310
umbs+0.305
of+0.303
bs+0.301
Neg logit contributions
 playing-0.309
 office-0.276
 living-0.273
 neighborhood-0.271
ich-0.267
Token.
Token ID13
Feature activation-9.80e-01

Pos logit contributions
war+1.150
le+1.125
umbs+1.115
of+1.106
bs+1.095
Neg logit contributions
 playing-1.089
 living-0.962
 office-0.959
 neighborhood-0.941
ich-0.924
Token
Token ID220
Feature activation-3.32e+00

Pos logit contributions
war+0.056
umbs+0.055
of+0.054
le+0.053
bs+0.052
Neg logit contributions
 playing-0.072
ich-0.069
 office-0.067
 living-0.065
 neighborhood-0.064
Token
Token ID198
Feature activation+8.63e-02

Pos logit contributions
war+0.261
le+0.260
umbs+0.254
of+0.253
bs+0.252
Neg logit contributions
 playing-0.181
 office-0.156
 living-0.156
 neighborhood-0.154
ich-0.149
Token
Token ID198
Feature activation+9.27e-02

Pos logit contributions
 playing+0.005
 office+0.004
 living+0.004
ich+0.004
 neighborhood+0.004
Neg logit contributions
war-0.006
le-0.006
of-0.006
umbs-0.006
bs-0.006
TokenIn
Token ID818
Feature activation-1.17e+00

Pos logit contributions
 playing+0.008
 office+0.007
ich+0.007
 living+0.007
 neighborhood+0.007
Neg logit contributions
war-0.005
of-0.005
le-0.004
umbs-0.004
bs-0.004
Token play
Token ID711
Feature activation+4.04e+00

Pos logit contributions
war+0.053
le+0.052
umbs+0.051
of+0.051
bs+0.051
Neg logit contributions
 playing-0.047
 office-0.041
 living-0.041
 neighborhood-0.041
ich-0.040
Token a
Token ID257
Feature activation+1.99e-01

Pos logit contributions
ich+0.230
 playing+0.230
 neighborhood+0.217
 office+0.217
 living+0.212
Neg logit contributions
le-0.293
war-0.284
bs-0.270
of-0.265
les-0.264
Token game
Token ID983
Feature activation+6.28e-01

Pos logit contributions
 playing+0.012
 office+0.010
 neighborhood+0.010
ich+0.010
 living+0.010
Neg logit contributions
war-0.015
le-0.014
of-0.014
bs-0.014
bread-0.014
Token called
Token ID1444
Feature activation+4.20e+00

Pos logit contributions
 playing+0.042
ich+0.040
 neighborhood+0.039
 office+0.039
 living+0.038
Neg logit contributions
le-0.039
war-0.039
umbs-0.037
of-0.037
bs-0.037
Token "
Token ID366
Feature activation+4.52e+00

Pos logit contributions
 playing+0.234
 neighborhood+0.214
ich+0.211
 office+0.211
 living+0.206
Neg logit contributions
war-0.316
le-0.306
of-0.305
bs-0.305
umbs-0.304
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
 playing+0.316
 office+0.281
 neighborhood+0.279
 living+0.278
ich+0.277
Neg logit contributions
war-0.278
le-0.278
bs-0.274
forts-0.271
umbs-0.270
Token".
Token ID1911
Feature activation-1.57e+00

Pos logit contributions
war+0.977
le+0.969
umbs+0.955
of+0.907
bs+0.898
Neg logit contributions
 playing-1.192
 neighborhood-1.117
 office-1.098
 living-1.082
ich-1.064
Token This
Token ID770
Feature activation+1.84e+00

Pos logit contributions
war+0.105
le+0.104
umbs+0.103
of+0.102
bs+0.101
Neg logit contributions
 playing-0.105
 office-0.093
 living-0.092
 neighborhood-0.092
ich-0.090
Token game
Token ID983
Feature activation+6.41e-01

Pos logit contributions
 playing+0.111
ich+0.102
 office+0.099
 neighborhood+0.097
 living+0.096
Neg logit contributions
war-0.131
le-0.129
of-0.126
umbs-0.125
bs-0.124
Token was
Token ID373
Feature activation+5.06e+00

Pos logit contributions
ich+0.030
 playing+0.030
 office+0.027
 neighborhood+0.027
 living+0.026
Neg logit contributions
le-0.051
war-0.050
les-0.049
of-0.048
bs-0.048
Token just
Token ID655
Feature activation+1.21e+00

Pos logit contributions
 playing+0.293
ich+0.277
 neighborhood+0.276
 office+0.271
 living+0.261
Neg logit contributions
war-0.366
bs-0.354
le-0.351
of-0.348
umbs-0.340
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
 playing+0.108
 living+0.089
 neighborhood+0.079
 office+0.078
 exploring+0.076
Neg logit contributions
umbs-0.142
forts-0.137
bread-0.135
bs-0.135
war-0.135
Token tug
Token ID27762
Feature activation-5.13e+00

Pos logit contributions
war+0.261
le+0.255
bs+0.255
of+0.253
umbs+0.249
Neg logit contributions
 playing-0.204
 office-0.192
ich-0.186
 neighborhood-0.185
 living-0.181
Token-
Token ID12
Feature activation-1.20e+00

Pos logit contributions
umbs+0.412
war+0.393
forts+0.393
bread+0.391
of+0.382
Neg logit contributions
 playing-0.312
 living-0.241
 neighborhood-0.215
 doing-0.214
ich-0.209
Tokenof
Token ID1659
Feature activation-1.63e+01

Pos logit contributions
umbs+0.039
le+0.036
rily+0.035
fulness+0.033
bread+0.033
Neg logit contributions
 playing-0.114
 living-0.104
ich-0.102
gy-0.100
 office-0.100
Token-
Token ID12
Feature activation-1.22e+00

Pos logit contributions
forts+0.928
bread+0.925
umbs+0.916
war+0.869
uits+0.844
Neg logit contributions
 playing-1.237
 living-1.088
ich-0.990
 office-0.961
 neighborhood-0.943
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
umbs+0.039
le+0.036
rily+0.035
bread+0.033
fulness+0.033
Neg logit contributions
 playing-0.115
 living-0.106
ich-0.104
gy-0.102
 office-0.101
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
war+1.297
forts+1.279
of+1.266
umbs+1.264
bs+1.249
Neg logit contributions
 playing-0.936
 living-0.803
 office-0.744
 neighborhood-0.703
ich-0.701
Token The
Token ID383
Feature activation-6.19e-02

Pos logit contributions
le+0.086
war+0.082
of+0.081
les+0.081
umbs+0.079
Neg logit contributions
 playing-0.062
 living-0.054
 neighborhood-0.052
 office-0.049
 exploring-0.047
Token fox
Token ID21831
Feature activation+1.02e+00

Pos logit contributions
le+0.005
war+0.004
of+0.004
les+0.004
bread+0.004
Neg logit contributions
 playing-0.004
 office-0.003
 neighborhood-0.003
 living-0.003
ich-0.003
Token was
Token ID373
Feature activation+5.06e+00

Pos logit contributions
 playing+0.059
 office+0.053
 neighborhood+0.052
ich+0.052
 living+0.052
Neg logit contributions
le-0.076
war-0.075
of-0.074
bs-0.074
umbs-0.073
Token happy
Token ID3772
Feature activation-1.47e+00

Pos logit contributions
 playing+0.260
 office+0.258
 neighborhood+0.256
ich+0.243
 factory+0.241
Neg logit contributions
war-0.374
le-0.372
umbs-0.367
bs-0.366
of-0.351
Token a
Token ID257
Feature activation+2.04e-01

Pos logit contributions
 playing+0.101
 living+0.088
 office+0.087
 neighborhood+0.085
ich+0.081
Neg logit contributions
war-0.124
le-0.122
umbs-0.121
of-0.120
bs-0.119
Token tug
Token ID27762
Feature activation-5.12e+00

Pos logit contributions
 playing+0.013
 office+0.011
 living+0.011
 neighborhood+0.011
ich+0.011
Neg logit contributions
war-0.015
le-0.014
umbs-0.014
of-0.014
bs-0.014
Token-
Token ID12
Feature activation-1.22e+00

Pos logit contributions
war+0.399
forts+0.398
umbs+0.394
of+0.383
bs+0.381
Neg logit contributions
 playing-0.307
 living-0.251
 office-0.244
ich-0.227
 exploring-0.225
Tokenof
Token ID1659
Feature activation-1.63e+01

Pos logit contributions
uits+0.057
lights+0.057
umbs+0.056
rily+0.049
oons+0.048
Neg logit contributions
 playing-0.089
 living-0.084
ich-0.081
 office-0.080
 loved-0.076
Token-
Token ID12
Feature activation-1.21e+00

Pos logit contributions
forts+1.233
war+1.208
umbs+1.175
bs+1.142
ws+1.138
Neg logit contributions
 playing-0.966
 living-0.804
 office-0.772
ich-0.760
 neighborhood-0.688
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
uits+0.056
umbs+0.056
lights+0.055
oons+0.048
rily+0.048
Neg logit contributions
 playing-0.089
 living-0.084
ich-0.081
 office-0.080
 loved-0.077
Token game
Token ID983
Feature activation+6.43e-01

Pos logit contributions
war+1.204
bs+1.175
forts+1.170
of+1.170
le+1.162
Neg logit contributions
 playing-1.018
 living-0.887
 office-0.881
 neighborhood-0.848
ich-0.837
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 playing+0.041
 living+0.036
 office+0.036
 neighborhood+0.034
ich+0.033
Neg logit contributions
war-0.045
le-0.045
umbs-0.044
bs-0.044
of-0.044
Token They
Token ID1119
Feature activation-3.14e+00

Pos logit contributions
war+0.076
umbs+0.075
of+0.074
le+0.074
bs+0.073
Neg logit contributions
 playing-0.065
 office-0.058
 living-0.057
ich-0.057
 neighborhood-0.057
Token had
Token ID550
Feature activation+1.28e+00

Pos logit contributions
war+0.246
umbs+0.242
le+0.240
forts+0.233
of+0.233
Neg logit contributions
 playing-0.181
 living-0.154
 neighborhood-0.149
 office-0.149
ich-0.142
Token lots
Token ID6041
Feature activation-1.34e+00

Pos logit contributions
 playing+0.065
 living+0.053
 office+0.051
 neighborhood+0.051
 exploring+0.048
Neg logit contributions
le-0.112
war-0.110
bs-0.108
umbs-0.108
of-0.107
Token of
Token ID286
Feature activation-7.43e+00

Pos logit contributions
 playing+0.338
 living+0.305
 office+0.299
 neighborhood+0.292
 exploring+0.282
Neg logit contributions
war-0.244
umbs-0.232
le-0.225
of-0.219
bs-0.218
Token her
Token ID607
Feature activation+2.88e-01

Pos logit contributions
war+0.520
umbs+0.508
le+0.498
bs+0.494
forts+0.490
Neg logit contributions
 playing-0.495
 office-0.436
 living-0.433
 neighborhood-0.421
ich-0.410
Token daughter
Token ID4957
Feature activation-1.27e+00

Pos logit contributions
 playing+0.017
 office+0.015
 living+0.015
 neighborhood+0.015
ich+0.015
Neg logit contributions
war-0.021
le-0.021
umbs-0.020
of-0.020
bs-0.020
Token's
Token ID338
Feature activation-3.63e-02

Pos logit contributions
le+0.088
war+0.088
bs+0.086
of+0.086
umbs+0.086
Neg logit contributions
 playing-0.077
ich-0.070
 neighborhood-0.069
 office-0.069
 living-0.068
Token resource
Token ID8271
Feature activation-2.95e-01

Pos logit contributions
war+0.002
le+0.002
umbs+0.002
of+0.002
bs+0.002
Neg logit contributions
 playing-0.003
 office-0.002
 living-0.002
 neighborhood-0.002
ich-0.002
Tokenfulness
Token ID15538
Feature activation-1.65e+01

Pos logit contributions
war+0.015
umbs+0.014
le+0.014
of+0.014
bs+0.014
Neg logit contributions
 playing-0.025
 office-0.023
 living-0.023
ich-0.022
 exploring-0.021
Token and
Token ID290
Feature activation-3.40e+00

Pos logit contributions
war+1.251
umbs+1.204
le+1.184
of+1.177
bs+1.166
Neg logit contributions
 playing-1.018
 living-0.896
 office-0.875
 neighborhood-0.853
ich-0.820
Token smiled
Token ID13541
Feature activation-2.74e+00

Pos logit contributions
le+0.222
war+0.222
bs+0.220
of+0.218
forts+0.212
Neg logit contributions
 playing-0.220
 office-0.199
ich-0.199
 neighborhood-0.198
 living-0.195
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
le+0.171
war+0.162
bs+0.162
of+0.161
umbs+0.158
Neg logit contributions
 playing-0.191
 office-0.176
ich-0.173
 neighborhood-0.173
 living-0.170
Token She
Token ID1375
Feature activation+1.71e+00

Pos logit contributions
war+0.062
umbs+0.061
of+0.061
bs+0.060
le+0.060
Neg logit contributions
 playing-0.057
ich-0.051
 office-0.050
 living-0.050
 neighborhood-0.048
Token grate
Token ID48612
Feature activation-6.15e+00

Pos logit contributions
 playing+0.121
 office+0.109
 living+0.108
 neighborhood+0.107
ich+0.106
Neg logit contributions
war-0.107
le-0.105
umbs-0.104
of-0.103
bs-0.102
Token play
Token ID711
Feature activation+4.06e+00

Pos logit contributions
le+0.012
war+0.012
of+0.012
bs+0.012
umbs+0.012
Neg logit contributions
 playing-0.011
 office-0.010
ich-0.010
 living-0.010
 neighborhood-0.010
Token tug
Token ID27762
Feature activation-5.12e+00

Pos logit contributions
 playing+0.238
 office+0.217
 neighborhood+0.214
ich+0.212
 living+0.211
Neg logit contributions
le-0.300
war-0.299
of-0.288
umbs-0.286
bs-0.285
Token-
Token ID12
Feature activation-1.22e+00

Pos logit contributions
umbs+0.426
war+0.415
forts+0.412
of+0.405
bread+0.404
Neg logit contributions
 playing-0.276
 living-0.219
ich-0.209
 office-0.209
 exploring-0.201
Tokenof
Token ID1659
Feature activation-1.63e+01

Pos logit contributions
umbs+0.033
bread+0.031
le+0.030
forts+0.029
rily+0.029
Neg logit contributions
 playing-0.120
 living-0.112
 office-0.109
ich-0.108
 neighborhood-0.105
Token-
Token ID12
Feature activation-1.21e+00

Pos logit contributions
forts+0.991
umbs+0.940
bread+0.940
war+0.934
uits+0.926
Neg logit contributions
 playing-1.153
 living-0.976
ich-0.956
 office-0.919
 neighborhood-0.870
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
umbs+0.032
bread+0.030
le+0.028
forts+0.028
 fade+0.027
Neg logit contributions
 playing-0.120
 living-0.111
 office-0.108
ich-0.108
 neighborhood-0.104
Token,"
Token ID553
Feature activation-2.50e+00

Pos logit contributions
war+1.286
of+1.263
umbs+1.250
forts+1.245
bs+1.244
Neg logit contributions
 playing-0.936
 living-0.789
 office-0.778
 neighborhood-0.739
ich-0.736
Token said
Token ID531
Feature activation-5.36e+00

Pos logit contributions
umbs+0.178
war+0.173
forts+0.164
le+0.163
of+0.163
Neg logit contributions
 playing-0.161
 living-0.143
 office-0.136
 neighborhood-0.132
ich-0.125
Token the
Token ID262
Feature activation-5.76e-01

Pos logit contributions
war+0.384
le+0.381
umbs+0.373
of+0.371
bs+0.370
Neg logit contributions
 playing-0.330
 office-0.293
 living-0.289
 neighborhood-0.288
ich-0.284
Token oct
Token ID19318
Feature activation+2.77e+00

Pos logit contributions
of+0.038
le+0.038
les+0.037
war+0.037
umbs+0.036
Neg logit contributions
 playing-0.036
 office-0.034
 neighborhood-0.034
 exploring-0.032
 living-0.031
Tokenopus
Token ID25790
Feature activation+3.42e+00

Pos logit contributions
 playing+0.169
 office+0.149
 neighborhood+0.147
 living+0.147
 backyard+0.142
Neg logit contributions
of-0.206
umbs-0.205
le-0.204
war-0.203
les-0.192
Token play
Token ID711
Feature activation+4.09e+00

Pos logit contributions
war+0.049
le+0.049
umbs+0.048
bs+0.047
of+0.047
Neg logit contributions
 playing-0.044
 office-0.038
 neighborhood-0.038
 living-0.037
ich-0.036
Token tug
Token ID27762
Feature activation-5.10e+00

Pos logit contributions
 playing+0.213
 neighborhood+0.196
 office+0.195
ich+0.194
 living+0.191
Neg logit contributions
le-0.324
war-0.322
bs-0.310
of-0.307
umbs-0.303
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
umbs+0.424
war+0.418
forts+0.412
of+0.406
bread+0.402
Neg logit contributions
 playing-0.276
 living-0.223
 office-0.219
ich-0.215
 neighborhood-0.210
Tokenof
Token ID1659
Feature activation-1.63e+01

Pos logit contributions
umbs+0.046
le+0.044
forts+0.044
bread+0.042
 sip+0.042
Neg logit contributions
 playing-0.106
 office-0.096
ich-0.096
 living-0.095
 neighborhood-0.091
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
forts+1.309
umbs+1.249
bread+1.234
war+1.220
rene+1.188
Neg logit contributions
 playing-0.910
 living-0.730
ich-0.687
 office-0.674
 neighborhood-0.629
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
umbs+0.046
forts+0.043
le+0.042
bread+0.041
 sip+0.040
Neg logit contributions
 playing-0.107
 office-0.098
ich-0.097
 living-0.097
 neighborhood-0.093
Token with
Token ID351
Feature activation+5.55e+00

Pos logit contributions
war+1.457
le+1.436
umbs+1.427
of+1.422
bs+1.414
Neg logit contributions
 playing-0.754
 office-0.622
 living-0.618
 neighborhood-0.596
ich-0.588
Token it
Token ID340
Feature activation-6.39e+00

Pos logit contributions
 playing+0.370
 living+0.301
 office+0.288
 neighborhood+0.273
 backyard+0.272
Neg logit contributions
umbs-0.416
forts-0.403
war-0.397
bread-0.394
le-0.385
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
war+0.465
umbs+0.462
le+0.457
of+0.451
bs+0.446
Neg logit contributions
 playing-0.396
 living-0.341
 office-0.339
 neighborhood-0.330
ich-0.321
Token
Token ID220
Feature activation-3.40e+00

Pos logit contributions
war+0.078
le+0.077
umbs+0.076
of+0.076
bs+0.075
Neg logit contributions
 playing-0.061
 office-0.054
 living-0.053
 neighborhood-0.052
ich-0.052
Token
Token ID198
Feature activation+2.86e-02

Pos logit contributions
war+0.277
le+0.276
umbs+0.268
bread+0.266
of+0.266
Neg logit contributions
 playing-0.176
 living-0.151
 office-0.149
 neighborhood-0.148
ich-0.143
Token play
Token ID711
Feature activation+4.10e+00

Pos logit contributions
war+0.009
le+0.009
of+0.009
bs+0.009
umbs+0.009
Neg logit contributions
 playing-0.010
 living-0.009
 office-0.009
ich-0.009
 neighborhood-0.009
Token tug
Token ID27762
Feature activation-5.08e+00

Pos logit contributions
 playing+0.251
 office+0.230
 neighborhood+0.229
 living+0.226
ich+0.224
Neg logit contributions
war-0.290
le-0.289
of-0.277
bs-0.275
umbs-0.275
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
umbs+0.416
war+0.408
forts+0.403
of+0.398
bread+0.397
Neg logit contributions
 playing-0.281
 living-0.226
ich-0.219
 office-0.216
 neighborhood-0.210
Tokenof
Token ID1659
Feature activation-1.63e+01

Pos logit contributions
umbs+0.037
le+0.033
forts+0.033
bread+0.033
 fade+0.032
Neg logit contributions
 playing-0.111
ich-0.104
 living-0.102
 office-0.102
gy-0.097
Token-
Token ID12
Feature activation-1.18e+00

Pos logit contributions
forts+0.935
bread+0.888
umbs+0.873
rene+0.857
uits+0.844
Neg logit contributions
 playing-1.225
 living-1.052
ich-1.036
 office-0.973
 neighborhood-0.937
Tokenwar
Token ID5767
Feature activation-1.65e+01

Pos logit contributions
umbs+0.037
bread+0.032
forts+0.032
le+0.032
 fade+0.031
Neg logit contributions
 playing-0.112
ich-0.105
 living-0.103
 office-0.103
gy-0.098
Token!"
Token ID2474
Feature activation+1.59e+00

Pos logit contributions
war+1.260
le+1.244
umbs+1.230
of+1.223
bs+1.217
Neg logit contributions
 playing-0.941
 office-0.820
 living-0.812
 neighborhood-0.800
ich-0.789
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
 playing+0.080
 living+0.065
 office+0.064
 neighborhood+0.064
ich+0.058
Neg logit contributions
le-0.137
war-0.134
umbs-0.133
of-0.131
bread-0.130
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
 playing+0.002
 living+0.001
 office+0.001
 neighborhood+0.001
ich+0.001
Neg logit contributions
le-0.003
war-0.003
umbs-0.003
bread-0.003
bs-0.003
TokenThe
Token ID464
Feature activation-3.64e+00

Pos logit contributions
 playing+0.002
 living+0.002
 office+0.002
 neighborhood+0.002
ich+0.002
Neg logit contributions
umbs-0.002
le-0.002
war-0.002
bs-0.002
of-0.002
Token birds
Token ID10087
Feature activation+2.13e+00

Pos logit contributions
war+0.280
le+0.276
umbs+0.272
of+0.271
bs+0.270
Neg logit contributions
 playing-0.208
 office-0.182
 living-0.179
 neighborhood-0.177
ich-0.174
Token a
Token ID257
Feature activation+3.56e-01

Pos logit contributions
 playing+0.322
 living+0.274
 office+0.266
 neighborhood+0.248
 exploring+0.244
Neg logit contributions
war-0.474
umbs-0.470
le-0.458
bread-0.448
forts-0.441
Token little
Token ID1310
Feature activation-1.63e+00

Pos logit contributions
 playing+0.022
 living+0.019
 office+0.019
 neighborhood+0.019
ich+0.018
Neg logit contributions
war-0.026
le-0.026
umbs-0.026
of-0.025
bs-0.025
Token bit
Token ID1643
Feature activation-2.67e+00

Pos logit contributions
war+0.105
umbs+0.103
le+0.102
of+0.100
bs+0.099
Neg logit contributions
 playing-0.116
 living-0.104
 office-0.103
 neighborhood-0.101
ich-0.097
Token of
Token ID286
Feature activation-7.40e+00

Pos logit contributions
war+0.144
umbs+0.140
le+0.133
of+0.130
forts+0.126
Neg logit contributions
 playing-0.222
 living-0.202
 office-0.198
 neighborhood-0.194
 exploring-0.187
Token thought
Token ID1807
Feature activation-2.22e+00

Pos logit contributions
umbs+0.444
war+0.439
le+0.414
bread+0.410
of+0.410
Neg logit contributions
 playing-0.609
 living-0.546
 office-0.527
 neighborhood-0.510
 exploring-0.508
Tokenfulness
Token ID15538
Feature activation-1.65e+01

Pos logit contributions
war+0.155
umbs+0.152
le+0.151
of+0.148
forts+0.146
Neg logit contributions
 playing-0.144
 office-0.127
 living-0.127
 neighborhood-0.122
ich-0.119
Token,
Token ID11
Feature activation+2.88e+00

Pos logit contributions
war+1.247
le+1.222
umbs+1.216
of+1.200
bs+1.193
Neg logit contributions
 playing-0.966
 office-0.842
 living-0.839
 neighborhood-0.823
ich-0.805
Token dreams
Token ID10625
Feature activation-3.64e+00

Pos logit contributions
 playing+0.191
 living+0.168
 office+0.168
 neighborhood+0.162
ich+0.159
Neg logit contributions
war-0.202
umbs-0.197
le-0.196
of-0.191
forts-0.188
Token can
Token ID460
Feature activation-3.37e+00

Pos logit contributions
war+0.268
umbs+0.261
le+0.259
of+0.253
bs+0.252
Neg logit contributions
 playing-0.225
 living-0.197
 office-0.196
 neighborhood-0.189
ich-0.182
Token come
Token ID1282
Feature activation-2.93e+00

Pos logit contributions
war+0.272
umbs+0.264
bs+0.256
forts+0.252
le+0.252
Neg logit contributions
 playing-0.188
 living-0.166
 office-0.163
 neighborhood-0.158
ich-0.150
Token true
Token ID2081
Feature activation-2.01e+00

Pos logit contributions
war+0.217
le+0.216
of+0.211
umbs+0.211
bs+0.210
Neg logit contributions
 playing-0.173
 office-0.152
 living-0.150
 neighborhood-0.149
ich-0.147
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 playing+0.184
 office+0.159
 living+0.159
 neighborhood+0.155
ich+0.151
Neg logit contributions
war-0.212
umbs-0.207
of-0.205
le-0.204
bs-0.201
Token
Token ID198
Feature activation+2.34e-02

Pos logit contributions
war+0.079
umbs+0.077
of+0.076
bs+0.076
le+0.076
Neg logit contributions
 playing-0.063
 office-0.056
ich-0.055
 living-0.055
 neighborhood-0.054
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
 playing+0.001
 living+0.001
 office+0.001
 neighborhood+0.001
ich+0.001
Neg logit contributions
le-0.002
war-0.002
umbs-0.002
bread-0.002
bs-0.002
Token"
Token ID1
Feature activation-5.14e-01

Pos logit contributions
 playing+0.001
 office+0.001
 living+0.001
 neighborhood+0.001
ich+0.001
Neg logit contributions
le-0.001
war-0.001
umbs-0.001
bs-0.001
forts-0.001
TokenCan
Token ID6090
Feature activation+2.03e+00

Pos logit contributions
le+0.037
umbs+0.036
war+0.036
of+0.035
bs+0.035
Neg logit contributions
 playing-0.031
 neighborhood-0.029
ich-0.029
 living-0.029
 office-0.028
Token I
Token ID314
Feature activation+7.19e+00

Pos logit contributions
 neighborhood+0.137
 playing+0.137
 office+0.131
 living+0.129
 neighbourhood+0.128
Neg logit contributions
of-0.126
bs-0.119
war-0.118
le-0.112
fulness-0.108
Token try
Token ID1949
Feature activation-6.40e+00

Pos logit contributions
 playing+0.491
 office+0.433
 living+0.428
 neighborhood+0.424
ich+0.416
Neg logit contributions
war-0.476
umbs-0.467
le-0.462
of-0.450
bs-0.448
Token it
Token ID340
Feature activation-6.38e+00

Pos logit contributions
le+0.421
war+0.419
of+0.409
bs+0.406
umbs+0.403
Neg logit contributions
 playing-0.422
 office-0.386
 neighborhood-0.384
 living-0.380
ich-0.374
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
war+0.554
le+0.552
of+0.541
umbs+0.540
bs+0.537
Neg logit contributions
 playing-0.293
 office-0.251
 living-0.250
 neighborhood-0.249
ich-0.242
Token Sam
Token ID3409
Feature activation-4.31e-01

Pos logit contributions
 playing+0.145
 office+0.123
 living+0.118
 neighborhood+0.116
ich+0.115
Neg logit contributions
war-0.223
umbs-0.219
le-0.218
bs-0.214
of-0.213
Token?"
Token ID1701
Feature activation+5.11e+00

Pos logit contributions
le+0.037
war+0.035
bs+0.034
of+0.034
les+0.034
Neg logit contributions
 playing-0.021
ich-0.019
 neighborhood-0.019
 office-0.018
 living-0.018
Token songs
Token ID7259
Feature activation-6.24e+00

Pos logit contributions
 playing+0.218
 neighborhood+0.211
 office+0.211
ich+0.210
 living+0.195
Neg logit contributions
le-0.310
war-0.290
les-0.288
of-0.287
bs-0.284
Token for
Token ID329
Feature activation+1.70e-01

Pos logit contributions
le+0.463
war+0.460
umbs+0.450
of+0.449
bs+0.447
Neg logit contributions
 playing-0.362
 office-0.324
 neighborhood-0.320
 living-0.315
ich-0.315
Token her
Token ID607
Feature activation+1.38e-01

Pos logit contributions
 playing+0.010
 office+0.009
 living+0.008
 neighborhood+0.008
ich+0.008
Neg logit contributions
war-0.013
umbs-0.013
le-0.013
of-0.013
forts-0.013
Token family
Token ID1641
Feature activation+4.81e+00

Pos logit contributions
 playing+0.006
 office+0.006
 neighborhood+0.005
 factory+0.005
 living+0.005
Neg logit contributions
le-0.012
bs-0.011
of-0.011
war-0.011
les-0.011
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
 playing+0.320
ich+0.301
 office+0.299
 neighborhood+0.297
 living+0.285
Neg logit contributions
le-0.304
war-0.299
bs-0.292
of-0.291
umbs-0.287
Token friends
Token ID2460
Feature activation+8.55e+00

Pos logit contributions
bs+0.235
le+0.229
war+0.229
of+0.226
forts+0.221
Neg logit contributions
 playing-0.219
ich-0.210
 office-0.210
 neighborhood-0.205
 living-0.197
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 playing+0.609
 office+0.578
ich+0.567
 neighborhood+0.566
 living+0.542
Neg logit contributions
le-0.514
war-0.503
of-0.491
bs-0.491
umbs-0.469
Token She
Token ID1375
Feature activation+1.53e+00

Pos logit contributions
war+0.064
of+0.063
umbs+0.062
bs+0.062
le+0.062
Neg logit contributions
 playing-0.075
ich-0.070
 office-0.070
 neighborhood-0.067
 living-0.066
Token is
Token ID318
Feature activation+2.19e+00

Pos logit contributions
 playing+0.091
ich+0.090
 office+0.088
 neighborhood+0.083
 factory+0.079
Neg logit contributions
le-0.106
bs-0.105
les-0.104
of-0.103
war-0.099
Token excited
Token ID6568
Feature activation-2.26e+00

Pos logit contributions
 neighborhood+0.137
 factory+0.136
ich+0.132
 cabinet+0.131
 office+0.130
Neg logit contributions
bs-0.116
le-0.108
ck-0.107
 harder-0.107
 syrup-0.105
Token to
Token ID284
Feature activation-7.32e-01

Pos logit contributions
le+0.151
war+0.147
of+0.146
bs+0.143
umbs+0.142
Neg logit contributions
 playing-0.144
ich-0.135
 office-0.134
 neighborhood-0.133
 living-0.130
Token are
Token ID389
Feature activation-3.35e+00

Pos logit contributions
 playing+0.404
 office+0.365
 neighborhood+0.359
 living+0.357
ich+0.356
Neg logit contributions
le-0.382
war-0.381
of-0.374
umbs-0.373
bs-0.369
Token friends
Token ID2460
Feature activation+9.10e+00

Pos logit contributions
bs+0.247
le+0.245
war+0.243
umbs+0.241
forts+0.240
Neg logit contributions
 playing-0.159
 neighborhood-0.157
 office-0.156
ich-0.155
gy-0.142
Token,
Token ID11
Feature activation+3.31e+00

Pos logit contributions
 playing+0.641
 office+0.610
ich+0.598
 neighborhood+0.594
 living+0.578
Neg logit contributions
le-0.533
bs-0.507
of-0.506
umbs-0.500
war-0.500
Token we
Token ID356
Feature activation+7.71e-01

Pos logit contributions
 playing+0.196
 office+0.178
 neighborhood+0.175
ich+0.174
 living+0.173
Neg logit contributions
war-0.235
le-0.233
of-0.231
umbs-0.230
bs-0.230
Token are
Token ID389
Feature activation-2.59e+00

Pos logit contributions
 playing+0.054
 office+0.050
ich+0.050
 living+0.049
 neighborhood+0.049
Neg logit contributions
le-0.047
war-0.047
bs-0.047
of-0.046
umbs-0.045
Token friends
Token ID2460
Feature activation+9.14e+00

Pos logit contributions
bs+0.199
le+0.192
forts+0.188
umbs+0.186
war+0.184
Neg logit contributions
 office-0.112
ich-0.109
 neighborhood-0.108
 playing-0.105
 factory-0.099
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
 playing+0.652
 office+0.602
ich+0.590
 neighborhood+0.590
 living+0.582
Neg logit contributions
le-0.547
war-0.534
umbs-0.529
of-0.528
bs-0.525
Token we
Token ID356
Feature activation+6.63e-01

Pos logit contributions
 playing+0.217
 office+0.198
 neighborhood+0.195
 living+0.194
ich+0.194
Neg logit contributions
war-0.208
le-0.206
of-0.204
umbs-0.203
bs-0.202
Token are
Token ID389
Feature activation-2.72e+00

Pos logit contributions
 playing+0.048
 office+0.044
ich+0.043
 living+0.043
 neighborhood+0.043
Neg logit contributions
le-0.040
war-0.040
bs-0.039
of-0.039
umbs-0.038
Token friends
Token ID2460
Feature activation+9.00e+00

Pos logit contributions
bs+0.207
le+0.202
forts+0.194
umbs+0.193
ck+0.192
Neg logit contributions
 office-0.120
 neighborhood-0.116
ich-0.114
 playing-0.112
 factory-0.106
Token,
Token ID11
Feature activation+3.11e+00

Pos logit contributions
 playing+0.637
 office+0.580
 neighborhood+0.570
 living+0.567
ich+0.565
Neg logit contributions
le-0.553
war-0.550
umbs-0.540
of-0.537
bs-0.534
Token mom
Token ID1995
Feature activation+2.36e+00

Pos logit contributions
bs+0.322
war+0.320
le+0.313
of+0.307
forts+0.304
Neg logit contributions
 playing-0.207
ich-0.189
 office-0.188
 living-0.179
 exploring-0.170
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
 playing+0.153
ich+0.138
 office+0.138
 neighborhood+0.137
 living+0.135
Neg logit contributions
le-0.157
war-0.155
of-0.153
bs-0.152
umbs-0.151
Token dad
Token ID9955
Feature activation+2.88e+00

Pos logit contributions
war+0.284
le+0.281
bs+0.279
of+0.277
umbs+0.276
Neg logit contributions
 playing-0.186
 office-0.162
 neighborhood-0.160
ich-0.159
 living-0.159
Token with
Token ID351
Feature activation+5.52e+00

Pos logit contributions
 playing+0.191
 living+0.168
 office+0.168
 neighborhood+0.164
ich+0.159
Neg logit contributions
war-0.197
umbs-0.194
le-0.192
of-0.189
bs-0.187
Token the
Token ID262
Feature activation-5.92e-01

Pos logit contributions
 playing+0.308
 living+0.249
 office+0.237
 walking+0.219
 exploring+0.213
Neg logit contributions
war-0.485
umbs-0.483
le-0.474
of-0.460
bread-0.460
Token lawn
Token ID21133
Feature activation+7.01e+00

Pos logit contributions
war+0.047
le+0.046
umbs+0.046
bs+0.046
of+0.046
Neg logit contributions
 playing-0.031
 office-0.027
ich-0.026
 living-0.026
 neighborhood-0.026
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 playing+0.460
 office+0.409
 living+0.406
 neighborhood+0.400
ich+0.393
Neg logit contributions
war-0.477
le-0.469
umbs-0.463
of-0.459
bs-0.456
Token They
Token ID1119
Feature activation-3.15e+00

Pos logit contributions
le+0.089
war+0.087
forts+0.085
bread+0.085
umbs+0.084
Neg logit contributions
 playing-0.058
 living-0.050
 neighborhood-0.049
 office-0.047
 backyard-0.044
Token water
Token ID1660
Feature activation-5.74e+00

Pos logit contributions
war+0.209
umbs+0.205
le+0.205
of+0.199
bs+0.197
Neg logit contributions
 playing-0.215
 living-0.190
 office-0.190
 neighborhood-0.188
ich-0.181
Token the
Token ID262
Feature activation-5.82e-01

Pos logit contributions
war+0.424
le+0.415
umbs+0.410
of+0.406
forts+0.400
Neg logit contributions
 playing-0.363
 living-0.315
 office-0.306
 neighborhood-0.292
ich-0.288
Token flowers
Token ID12734
Feature activation+8.33e-01

Pos logit contributions
war+0.041
bs+0.040
umbs+0.040
le+0.040
of+0.039
Neg logit contributions
 playing-0.035
 office-0.032
ich-0.030
 neighborhood-0.030
 living-0.030
Token day
Token ID1110
Feature activation-2.88e-01

Pos logit contributions
war+0.392
le+0.391
of+0.384
bs+0.383
umbs+0.375
Neg logit contributions
 playing-0.247
 office-0.220
ich-0.219
 neighborhood-0.214
 living-0.209
Token,
Token ID11
Feature activation+2.75e+00

Pos logit contributions
bs+0.023
of+0.022
le+0.022
war+0.022
les+0.021
Neg logit contributions
 office-0.015
 neighborhood-0.014
 playing-0.014
ich-0.014
 living-0.013
Token while
Token ID981
Feature activation-6.13e+00

Pos logit contributions
 playing+0.169
 office+0.165
 neighborhood+0.164
ich+0.160
 living+0.157
Neg logit contributions
bs-0.186
of-0.183
war-0.178
le-0.178
umbs-0.171
Token she
Token ID673
Feature activation+7.50e-01

Pos logit contributions
bs+0.484
of+0.439
 sip+0.413
umbs+0.407
le+0.406
Neg logit contributions
 office-0.289
 neighborhood-0.289
ich-0.270
 neighbourhood-0.260
 living-0.254
Token was
Token ID373
Feature activation+5.11e+00

Pos logit contributions
ich+0.034
 office+0.034
 playing+0.033
 living+0.030
 neighborhood+0.029
Neg logit contributions
le-0.065
of-0.062
war-0.061
umbs-0.061
bs-0.060
Token playing
Token ID2712
Feature activation+1.32e+01

Pos logit contributions
ich+0.129
 Driver+0.124
 neighborhood+0.117
gy+0.110
 factory+0.097
Neg logit contributions
bs-0.507
 sip-0.460
le-0.454
war-0.447
 drift-0.443
Token in
Token ID287
Feature activation+5.54e+00

Pos logit contributions
ich+0.788
 neighborhood+0.688
 office+0.663
 neighbourhood+0.620
 playing+0.595
Neg logit contributions
le-1.002
 sip-0.922
bs-0.919
war-0.891
les-0.881
Token her
Token ID607
Feature activation+2.16e-01

Pos logit contributions
ich+0.398
 playing+0.369
 neighborhood+0.359
 office+0.355
 living+0.339
Neg logit contributions
of-0.288
war-0.286
 sip-0.277
le-0.275
umbs-0.271
Token backyard
Token ID24296
Feature activation+1.04e+01

Pos logit contributions
 saving+0.008
ich+0.008
 Sy+0.008
 selling+0.008
ter+0.007
Neg logit contributions
bs-0.016
les-0.015
 sip-0.015
of-0.014
war-0.014
Token,
Token ID11
Feature activation+2.78e+00

Pos logit contributions
ich+0.794
 Polic+0.701
 seeker+0.671
asc+0.659
itor+0.655
Neg logit contributions
le-0.527
of-0.517
bs-0.497
 sip-0.471
les-0.465
Token she
Token ID673
Feature activation+7.67e-01

Pos logit contributions
 neighborhood+0.189
ich+0.187
 office+0.183
 playing+0.180
 neighbourhood+0.173
Neg logit contributions
bs-0.167
of-0.160
le-0.155
war-0.149
les-0.143
Token "
Token ID366
Feature activation+4.54e+00

Pos logit contributions
war+0.101
le+0.100
umbs+0.099
of+0.099
bs+0.098
Neg logit contributions
 playing-0.042
 office-0.034
 living-0.033
 neighborhood-0.032
ich-0.032
TokenI
Token ID40
Feature activation+7.76e+00

Pos logit contributions
 playing+0.228
 living+0.201
ich+0.191
 neighborhood+0.188
 office+0.184
Neg logit contributions
war-0.375
le-0.371
bs-0.366
umbs-0.366
forts-0.359
Token can
Token ID460
Feature activation-3.55e+00

Pos logit contributions
 playing+0.444
 office+0.393
 living+0.385
 neighborhood+0.383
ich+0.380
Neg logit contributions
le-0.590
war-0.590
of-0.576
bs-0.574
umbs-0.573
Token call
Token ID869
Feature activation+5.08e-01

Pos logit contributions
le+0.248
war+0.247
of+0.244
umbs+0.242
bs+0.241
Neg logit contributions
 playing-0.221
 office-0.200
 neighborhood-0.196
 living-0.196
ich-0.192
Token your
Token ID534
Feature activation+3.56e+00

Pos logit contributions
 playing+0.029
 office+0.025
 neighborhood+0.025
 living+0.025
ich+0.025
Neg logit contributions
war-0.039
le-0.038
of-0.038
umbs-0.037
bs-0.037
Token name
Token ID1438
Feature activation+1.91e+00

Pos logit contributions
 playing+0.212
 office+0.188
 neighborhood+0.187
 living+0.185
ich+0.181
Neg logit contributions
war-0.250
le-0.247
of-0.247
bs-0.245
umbs-0.243
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 playing+0.113
 office+0.099
 living+0.098
 neighborhood+0.096
ich+0.095
Neg logit contributions
war-0.142
le-0.140
umbs-0.139
of-0.137
bs-0.137
Token What
Token ID1867
Feature activation+8.22e+00

Pos logit contributions
of+0.083
war+0.082
umbs+0.081
bs+0.081
le+0.081
Neg logit contributions
 playing-0.056
 office-0.050
 neighborhood-0.050
ich-0.049
 living-0.048
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
 playing+0.495
 office+0.452
ich+0.448
 neighborhood+0.448
 living+0.431
Neg logit contributions
le-0.582
war-0.581
of-0.576
bs-0.568
umbs-0.558
Token your
Token ID534
Feature activation+3.57e+00

Pos logit contributions
 neighborhood+0.138
 office+0.128
 playing+0.128
 neighbourhood+0.126
ich+0.121
Neg logit contributions
le-0.142
bs-0.140
war-0.136
of-0.132
ck-0.132
Token name
Token ID1438
Feature activation+1.90e+00

Pos logit contributions
 playing+0.233
 office+0.206
 neighborhood+0.206
 living+0.206
ich+0.200
Neg logit contributions
war-0.227
of-0.225
le-0.224
bs-0.221
umbs-0.217
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
le+0.074
war+0.073
umbs+0.071
of+0.071
bs+0.070
Neg logit contributions
 playing-0.049
 living-0.042
 office-0.041
 neighborhood-0.041
ich-0.040
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
 playing+0.006
 living+0.005
 neighborhood+0.005
 office+0.005
ich+0.005
Neg logit contributions
le-0.014
bread-0.014
umbs-0.014
war-0.013
forts-0.013
Token"
Token ID1
Feature activation-3.95e-01

Pos logit contributions
 playing+0.009
 living+0.008
 office+0.008
 neighborhood+0.008
ich+0.008
Neg logit contributions
le-0.010
umbs-0.010
war-0.010
bread-0.010
forts-0.010
TokenO
Token ID46
Feature activation-4.24e+00

Pos logit contributions
uits+0.029
umbs+0.029
ets+0.029
fulness+0.029
bs+0.028
Neg logit contributions
 playing-0.020
 living-0.019
ich-0.019
 neighborhood-0.018
 ha-0.018
Tokenw
Token ID86
Feature activation-6.76e+00

Pos logit contributions
umbs+0.272
war+0.270
le+0.256
forts+0.255
bread+0.254
Neg logit contributions
 playing-0.288
ich-0.266
 office-0.256
 living-0.247
 neighborhood-0.242
Token,
Token ID11
Feature activation+2.81e+00

Pos logit contributions
le+0.545
war+0.534
of+0.532
umbs+0.525
bs+0.522
Neg logit contributions
 playing-0.354
 neighborhood-0.327
 office-0.326
 living-0.319
ich-0.296
Token ow
Token ID12334
Feature activation-9.98e-01

Pos logit contributions
 playing+0.162
 neighborhood+0.145
 office+0.145
 living+0.141
ich+0.138
Neg logit contributions
war-0.209
le-0.208
umbs-0.206
of-0.205
bs-0.204
Token,
Token ID11
Feature activation+2.79e+00

Pos logit contributions
war+0.090
umbs+0.088
le+0.088
of+0.086
forts+0.086
Neg logit contributions
 playing-0.045
 living-0.036
 office-0.036
ich-0.036
 neighborhood-0.035
Token ow
Token ID12334
Feature activation-1.00e+00

Pos logit contributions
 playing+0.157
 office+0.136
 living+0.135
 neighborhood+0.133
ich+0.131
Neg logit contributions
war-0.217
le-0.214
umbs-0.212
of-0.210
bs-0.209
Token!"
Token ID2474
Feature activation+1.65e+00

Pos logit contributions
war+0.074
umbs+0.071
forts+0.069
le+0.069
of+0.068
Neg logit contributions
 playing-0.063
ich-0.054
 living-0.053
 office-0.052
 neighborhood-0.049
Token Mia
Token ID32189
Feature activation-1.36e+00

Pos logit contributions
 playing+0.097
 living+0.082
 office+0.081
 neighborhood+0.078
ich+0.076
Neg logit contributions
war-0.129
umbs-0.126
le-0.125
of-0.123
forts-0.121
Token to
Token ID284
Feature activation-6.65e-01

Pos logit contributions
 playing+0.142
 office+0.132
ich+0.131
 neighborhood+0.129
 living+0.127
Neg logit contributions
war-0.148
le-0.147
of-0.144
bs-0.143
umbs-0.143
Token play
Token ID711
Feature activation+4.13e+00

Pos logit contributions
le+0.052
war+0.052
of+0.052
bs+0.051
forts+0.050
Neg logit contributions
ich-0.034
 playing-0.033
 office-0.033
 neighborhood-0.031
 living-0.030
Token.
Token ID13
Feature activation-9.80e-01

Pos logit contributions
ich+0.240
 playing+0.236
 office+0.225
 neighborhood+0.223
 living+0.212
Neg logit contributions
le-0.302
war-0.292
bs-0.283
of-0.280
umbs-0.279
Token Emma
Token ID18966
Feature activation+2.33e+00

Pos logit contributions
war+0.067
umbs+0.066
bs+0.065
forts+0.064
of+0.064
Neg logit contributions
ich-0.060
 playing-0.058
 office-0.057
 neighborhood-0.052
 living-0.051
Token saw
Token ID2497
Feature activation+4.16e+00

Pos logit contributions
ich+0.155
 office+0.155
 seeker+0.150
 playing+0.143
 exploring+0.137
Neg logit contributions
le-0.137
bs-0.137
of-0.134
les-0.133
 sip-0.124
Token the
Token ID262
Feature activation-5.06e-01

Pos logit contributions
 playing+0.241
ich+0.236
 office+0.234
 neighborhood+0.231
 living+0.212
Neg logit contributions
le-0.288
war-0.286
of-0.284
bs-0.280
umbs-0.278
Token green
Token ID4077
Feature activation-2.88e+00

Pos logit contributions
le+0.033
umbs+0.032
les+0.032
 syrup+0.031
of+0.030
Neg logit contributions
 playing-0.030
 office-0.029
ich-0.028
 neighborhood-0.027
 exploring-0.026
Token t
Token ID256
Feature activation+1.73e+00

Pos logit contributions
le+0.234
les+0.226
of+0.212
bread+0.210
umbs+0.208
Neg logit contributions
 playing-0.147
 office-0.136
ich-0.135
 neighborhood-0.135
 seeking-0.123
Tokeneddy
Token ID21874
Feature activation-1.93e+00

Pos logit contributions
 playing+0.121
 office+0.107
 neighborhood+0.106
 living+0.106
ich+0.101
Neg logit contributions
le-0.113
umbs-0.111
war-0.110
of-0.110
bs-0.106
Token bear
Token ID6842
Feature activation+5.17e+00

Pos logit contributions
le+0.176
les+0.169
of+0.159
war+0.154
umbs+0.152
Neg logit contributions
 playing-0.081
 neighborhood-0.079
 office-0.079
ich-0.074
 neighbourhood-0.070
Token and
Token ID290
Feature activation-3.45e+00

Pos logit contributions
 playing+0.339
 office+0.318
 neighborhood+0.318
ich+0.316
 living+0.308
Neg logit contributions
le-0.333
war-0.325
of-0.321
umbs-0.321
bs-0.315
Token sun
Token ID4252
Feature activation-2.87e+00

Pos logit contributions
war+0.040
le+0.040
bs+0.039
umbs+0.039
of+0.039
Neg logit contributions
 playing-0.037
 living-0.033
 office-0.033
 neighborhood-0.033
ich-0.032
Token went
Token ID1816
Feature activation+1.14e+00

Pos logit contributions
war+0.235
le+0.231
umbs+0.229
of+0.228
bs+0.227
Neg logit contributions
 playing-0.151
 office-0.128
 living-0.128
 neighborhood-0.127
ich-0.124
Token down
Token ID866
Feature activation+2.74e+00

Pos logit contributions
 playing+0.059
 office+0.051
 living+0.051
 neighborhood+0.050
ich+0.049
Neg logit contributions
war-0.093
le-0.092
umbs-0.091
of-0.091
bs-0.090
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 playing+0.163
 living+0.139
 neighborhood+0.136
 office+0.133
 exploring+0.128
Neg logit contributions
war-0.215
le-0.211
umbs-0.207
of-0.207
bs-0.206
Token It
Token ID632
Feature activation-5.19e+00

Pos logit contributions
umbs+0.072
war+0.071
bs+0.070
of+0.069
le+0.068
Neg logit contributions
 playing-0.067
 office-0.064
ich-0.060
 neighborhood-0.059
 living-0.059
Token was
Token ID373
Feature activation+5.07e+00

Pos logit contributions
le+0.489
war+0.483
of+0.477
umbs+0.474
bs+0.468
Neg logit contributions
 playing-0.206
 office-0.176
ich-0.164
 living-0.164
 neighborhood-0.163
Token a
Token ID257
Feature activation+2.14e-01

Pos logit contributions
 playing+0.328
 office+0.294
 living+0.289
 neighborhood+0.287
ich+0.284
Neg logit contributions
war-0.348
le-0.345
umbs-0.340
of-0.337
bs-0.335
Token lovely
Token ID14081
Feature activation+1.04e+00

Pos logit contributions
 playing+0.014
 office+0.013
ich+0.012
 neighborhood+0.012
 living+0.012
Neg logit contributions
war-0.014
of-0.014
le-0.014
bs-0.014
umbs-0.013
Token day
Token ID1110
Feature activation-3.34e-01

Pos logit contributions
 playing+0.056
 office+0.052
ich+0.051
 neighborhood+0.050
 living+0.049
Neg logit contributions
le-0.079
war-0.077
of-0.077
umbs-0.075
bs-0.074
Token at
Token ID379
Feature activation+2.65e+00

Pos logit contributions
war+0.025
le+0.025
umbs+0.025
of+0.024
bs+0.024
Neg logit contributions
 playing-0.020
 living-0.017
 office-0.017
 neighborhood-0.017
ich-0.016
Token the
Token ID262
Feature activation-5.79e-01

Pos logit contributions
 playing+0.171
 office+0.164
ich+0.164
 neighborhood+0.162
 living+0.160
Neg logit contributions
le-0.161
war-0.158
of-0.156
umbs-0.154
les-0.148
Token."
Token ID526
Feature activation-1.73e+00

Pos logit contributions
war+0.269
umbs+0.259
le+0.259
of+0.258
bs+0.258
Neg logit contributions
 playing-0.240
 office-0.212
 living-0.207
 neighborhood-0.202
ich-0.197
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
le+0.147
war+0.147
umbs+0.142
of+0.140
bread+0.138
Neg logit contributions
 playing-0.090
 living-0.076
 office-0.074
 neighborhood-0.073
 exploring-0.067
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
 playing+0.005
 living+0.005
 neighborhood+0.004
 office+0.004
ich+0.004
Neg logit contributions
le-0.012
war-0.011
bread-0.011
umbs-0.011
forts-0.011
TokenBen
Token ID11696
Feature activation-3.63e+00

Pos logit contributions
 playing+0.008
 neighborhood+0.008
 living+0.008
 office+0.008
ich+0.007
Neg logit contributions
le-0.008
umbs-0.008
war-0.008
les-0.008
forts-0.008
Token tried
Token ID3088
Feature activation-2.85e+00

Pos logit contributions
le+0.224
of+0.215
war+0.215
les+0.213
bs+0.209
Neg logit contributions
 playing-0.251
ich-0.239
 office-0.228
 neighborhood-0.223
 living-0.221
Token to
Token ID284
Feature activation-6.43e-01

Pos logit contributions
war+0.215
le+0.213
umbs+0.212
bs+0.208
of+0.207
Neg logit contributions
 playing-0.174
 office-0.151
 living-0.149
 neighborhood-0.145
ich-0.140
Token push
Token ID4574
Feature activation-2.71e+00

Pos logit contributions
war+0.045
umbs+0.045
bs+0.044
forts+0.043
le+0.043
Neg logit contributions
 playing-0.042
 office-0.037
 living-0.036
 neighborhood-0.035
ich-0.035
Token the
Token ID262
Feature activation-5.07e-01

Pos logit contributions
umbs+0.224
bread+0.221
forts+0.219
war+0.218
bs+0.212
Neg logit contributions
 playing-0.144
 office-0.117
 living-0.114
 neighborhood-0.108
 seeker-0.104
Token bath
Token ID7837
Feature activation-1.16e+00

Pos logit contributions
war+0.036
le+0.036
umbs+0.035
of+0.035
bs+0.035
Neg logit contributions
 playing-0.032
 office-0.028
 living-0.028
 neighborhood-0.028
ich-0.027
Tokenrobe
Token ID25481
Feature activation+6.53e+00

Pos logit contributions
war+0.103
le+0.103
umbs+0.102
of+0.101
bs+0.101
Neg logit contributions
 playing-0.051
 office-0.042
ich-0.042
 neighborhood-0.042
 living-0.041
Token in
Token ID287
Feature activation+5.53e+00

Pos logit contributions
 playing+0.297
 office+0.246
 living+0.238
 neighborhood+0.217
 factory+0.208
Neg logit contributions
umbs-0.581
war-0.570
forts-0.563
bread-0.558
le-0.557
Token He
Token ID679
Feature activation+1.35e+00

Pos logit contributions
war+0.077
le+0.076
umbs+0.076
of+0.075
bs+0.075
Neg logit contributions
 playing-0.059
 office-0.051
 living-0.051
 neighborhood-0.050
ich-0.050
Token says
Token ID1139
Feature activation-6.80e+00

Pos logit contributions
 playing+0.108
 living+0.101
 office+0.098
 neighborhood+0.096
ich+0.091
Neg logit contributions
war-0.074
umbs-0.074
le-0.070
bs-0.068
of-0.068
Token he
Token ID339
Feature activation+7.51e-01

Pos logit contributions
war+0.604
le+0.598
umbs+0.591
of+0.589
bs+0.587
Neg logit contributions
 playing-0.306
 office-0.258
 living-0.254
 neighborhood-0.252
ich-0.246
Token will
Token ID481
Feature activation-5.35e+00

Pos logit contributions
 playing+0.048
 office+0.042
ich+0.042
 neighborhood+0.041
 living+0.041
Neg logit contributions
le-0.052
war-0.051
bs-0.050
of-0.050
forts-0.050
Token be
Token ID307
Feature activation+2.95e+00

Pos logit contributions
war+0.527
umbs+0.527
le+0.523
les+0.511
of+0.507
Neg logit contributions
 playing-0.210
 office-0.168
 living-0.160
 neighborhood-0.152
ich-0.137
Token nice
Token ID3621
Feature activation-1.50e+00

Pos logit contributions
 playing+0.182
ich+0.168
 office+0.166
 neighborhood+0.165
 living+0.160
Neg logit contributions
war-0.203
le-0.199
bs-0.195
umbs-0.193
forts-0.190
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
war+0.108
le+0.105
umbs+0.104
of+0.103
bs+0.103
Neg logit contributions
 playing-0.096
 office-0.085
 living-0.085
 neighborhood-0.082
ich-0.080
Token He
Token ID679
Feature activation+1.34e+00

Pos logit contributions
le+0.086
war+0.085
umbs+0.083
of+0.083
bs+0.081
Neg logit contributions
 playing-0.055
 living-0.047
 office-0.047
 neighborhood-0.046
ich-0.043
Token says
Token ID1139
Feature activation-6.81e+00

Pos logit contributions
 playing+0.106
 living+0.099
 office+0.097
 neighborhood+0.094
ich+0.089
Neg logit contributions
war-0.074
umbs-0.074
le-0.070
fulness-0.069
of-0.068
Token he
Token ID339
Feature activation+7.26e-01

Pos logit contributions
war+0.515
le+0.509
umbs+0.504
of+0.495
bs+0.487
Neg logit contributions
 playing-0.413
 office-0.355
 living-0.355
 neighborhood-0.340
ich-0.338
Token will
Token ID481
Feature activation-5.38e+00

Pos logit contributions
 playing+0.048
 office+0.042
ich+0.042
 neighborhood+0.042
 living+0.041
Neg logit contributions
war-0.049
le-0.049
bs-0.048
of-0.048
umbs-0.047
Token ball
Token ID2613
Feature activation+1.07e+00

Pos logit contributions
 playing+0.106
 office+0.096
 living+0.095
 neighborhood+0.094
ich+0.094
Neg logit contributions
war-0.116
le-0.115
of-0.112
bs-0.112
umbs-0.111
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 playing+0.061
 office+0.053
 living+0.053
 neighborhood+0.053
ich+0.052
Neg logit contributions
war-0.082
le-0.081
umbs-0.080
of-0.079
bs-0.079
Token You
Token ID921
Feature activation+6.56e+00

Pos logit contributions
war+0.045
le+0.044
umbs+0.044
of+0.044
bs+0.044
Neg logit contributions
 playing-0.037
 office-0.032
 neighborhood-0.032
 living-0.032
ich-0.031
Token can
Token ID460
Feature activation-3.14e+00

Pos logit contributions
 playing+0.390
 living+0.339
 neighborhood+0.330
 office+0.326
 seeker+0.302
Neg logit contributions
war-0.483
umbs-0.479
le-0.465
bread-0.459
of-0.456
Token throw
Token ID3714
Feature activation-2.66e-01

Pos logit contributions
war+0.240
le+0.239
of+0.237
umbs+0.235
bs+0.233
Neg logit contributions
 playing-0.176
 office-0.156
 living-0.153
 neighborhood-0.152
ich-0.150
Token it
Token ID340
Feature activation-6.02e+00

Pos logit contributions
umbs+0.019
bread+0.019
war+0.018
forts+0.018
bs+0.018
Neg logit contributions
 playing-0.018
 living-0.015
 office-0.014
 neighborhood-0.014
ich-0.013
Token and
Token ID290
Feature activation-3.19e+00

Pos logit contributions
war+0.467
le+0.461
umbs+0.460
of+0.452
bs+0.452
Neg logit contributions
 playing-0.339
 living-0.290
 office-0.287
 neighborhood-0.279
ich-0.274
Token he
Token ID339
Feature activation+1.04e+00

Pos logit contributions
war+0.217
le+0.215
umbs+0.213
of+0.209
bs+0.208
Neg logit contributions
 playing-0.211
 office-0.185
 living-0.185
 neighborhood-0.182
ich-0.179
Token will
Token ID481
Feature activation-5.05e+00

Pos logit contributions
 playing+0.067
 office+0.060
 neighborhood+0.059
 living+0.059
ich+0.059
Neg logit contributions
war-0.072
le-0.071
of-0.070
umbs-0.069
bs-0.069
Token fetch
Token ID21207
Feature activation+3.04e+00

Pos logit contributions
war+0.378
umbs+0.371
le+0.371
of+0.359
bs+0.359
Neg logit contributions
 playing-0.309
 living-0.264
 office-0.258
 neighborhood-0.254
ich-0.251
Token it
Token ID340
Feature activation-6.10e+00

Pos logit contributions
 playing+0.217
 living+0.188
 office+0.183
ich+0.178
 neighborhood+0.174
Neg logit contributions
umbs-0.197
war-0.192
forts-0.188
le-0.188
bread-0.188
Token from
Token ID422
Feature activation-1.16e+00

Pos logit contributions
le+0.102
war+0.102
bs+0.100
of+0.100
umbs+0.097
Neg logit contributions
 playing-0.073
 office-0.067
ich-0.067
 neighborhood-0.064
 living-0.063
Token a
Token ID257
Feature activation+2.16e-01

Pos logit contributions
bs+0.085
le+0.083
war+0.079
ck+0.078
of+0.078
Neg logit contributions
ich-0.057
 office-0.052
 playing-0.052
 neighborhood-0.052
 seeker-0.051
Token friend
Token ID1545
Feature activation+5.37e+00

Pos logit contributions
 playing+0.013
ich+0.013
 office+0.012
 living+0.011
 neighborhood+0.011
Neg logit contributions
war-0.015
bs-0.014
forts-0.014
of-0.014
le-0.013
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 playing+0.345
ich+0.337
 office+0.323
 neighborhood+0.315
 living+0.303
Neg logit contributions
le-0.342
bs-0.337
war-0.337
of-0.332
forts-0.318
Token<|endoftext|>
Token ID50256
Feature activation-6.44e-01

Pos logit contributions
umbs+0.074
war+0.073
of+0.072
bs+0.070
le+0.069
Neg logit contributions
 playing-0.065
 office-0.059
 neighborhood-0.057
ich-0.057
 living-0.055
TokenOnce
Token ID7454
Feature activation-2.02e+00

Pos logit contributions
war+0.047
umbs+0.046
le+0.046
of+0.046
bs+0.045
Neg logit contributions
 playing-0.038
 office-0.034
ich-0.033
 living-0.033
 neighborhood-0.033
Token upon
Token ID2402
Feature activation-3.36e-02

Pos logit contributions
war+0.149
le+0.147
of+0.146
umbs+0.145
bs+0.145
Neg logit contributions
 playing-0.118
 office-0.105
 living-0.103
 neighborhood-0.103
ich-0.102
Token a
Token ID257
Feature activation+2.29e-01

Pos logit contributions
war+0.003
le+0.003
umbs+0.003
bs+0.003
of+0.003
Neg logit contributions
 playing-0.002
 living-0.001
 office-0.001
 neighborhood-0.001
ich-0.001
Token time
Token ID640
Feature activation-6.55e+00

Pos logit contributions
 playing+0.016
 living+0.014
 neighborhood+0.014
 office+0.014
 backyard+0.013
Neg logit contributions
war-0.015
bread-0.014
umbs-0.014
le-0.014
bs-0.014
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
of+0.397
bs+0.377
war+0.341
umbs+0.341
le+0.336
Neg logit contributions
 office-0.452
ich-0.440
 neighborhood-0.431
 neighbourhood-0.398
 living-0.393
Token there
Token ID612
Feature activation+3.85e-01

Pos logit contributions
 office+0.142
 neighborhood+0.136
 playing+0.135
ich+0.135
 neighbourhood+0.122
Neg logit contributions
le-0.210
bs-0.208
war-0.202
of-0.202
 sip-0.193
Token loved
Token ID6151
Feature activation+8.18e+00

Pos logit contributions
 office+0.083
ich+0.080
 neighborhood+0.075
 factory+0.073
 playing+0.071
Neg logit contributions
le-0.112
bs-0.112
les-0.104
kered-0.102
of-0.101
Token to
Token ID284
Feature activation-7.31e-01

Pos logit contributions
 playing+0.501
ich+0.481
 neighborhood+0.480
 office+0.478
 seeker+0.454
Neg logit contributions
war-0.522
bs-0.508
of-0.501
umbs-0.498
bread-0.492
Token drink
Token ID4144
Feature activation-5.34e+00

Pos logit contributions
war+0.058
le+0.057
umbs+0.057
of+0.056
bs+0.055
Neg logit contributions
 playing-0.041
 office-0.035
 living-0.035
 neighborhood-0.034
ich-0.033
Token juice
Token ID13135
Feature activation-4.66e+00

Pos logit contributions
war+0.378
umbs+0.356
le+0.353
forts+0.352
of+0.347
Neg logit contributions
 playing-0.368
 living-0.328
 office-0.293
 neighborhood-0.283
 exploring-0.281
Token and
Token ID290
Feature activation-3.54e+00

Pos logit contributions
war+0.399
le+0.384
forts+0.381
of+0.379
umbs+0.379
Neg logit contributions
 playing-0.236
 living-0.199
 office-0.188
 neighborhood-0.176
ich-0.172
Token water
Token ID1660
Feature activation-5.73e+00

Pos logit contributions
war+0.222
umbs+0.215
le+0.214
of+0.208
forts+0.204
Neg logit contributions
 playing-0.268
 living-0.236
 office-0.227
 neighborhood-0.223
 exploring-0.217
Token all
Token ID477
Feature activation+2.51e+00

Pos logit contributions
war+0.470
le+0.449
of+0.448
umbs+0.443
forts+0.440
Neg logit contributions
 playing-0.328
 living-0.281
 office-0.253
 exploring-0.232
 neighborhood-0.229
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
 playing+0.122
 living+0.104
 office+0.099
 neighborhood+0.097
ich+0.092
Neg logit contributions
war-0.217
le-0.214
umbs-0.214
of-0.210
bs-0.210
Token time
Token ID640
Feature activation-6.57e+00

Pos logit contributions
war+0.039
le+0.037
umbs+0.037
bs+0.037
of+0.037
Neg logit contributions
 playing-0.042
 living-0.038
 neighborhood-0.037
 office-0.036
ich-0.034
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
bs+0.424
le+0.421
war+0.418
of+0.414
umbs+0.412
Neg logit contributions
 playing-0.416
ich-0.394
 office-0.393
 neighborhood-0.385
 living-0.375
Token One
Token ID1881
Feature activation-4.90e+00

Pos logit contributions
war+0.063
of+0.062
umbs+0.061
le+0.061
bs+0.061
Neg logit contributions
 playing-0.076
ich-0.071
 office-0.070
 living-0.068
 neighborhood-0.068
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 playing+0.033
 living+0.029
 office+0.028
 neighborhood+0.025
 hiding+0.024
Neg logit contributions
war-0.045
umbs-0.043
forts-0.042
le-0.042
of-0.042
Token It
Token ID632
Feature activation-5.20e+00

Pos logit contributions
le+0.090
war+0.089
umbs+0.087
of+0.086
forts+0.085
Neg logit contributions
 playing-0.053
 living-0.045
 office-0.044
 neighborhood-0.044
ich-0.041
Token makes
Token ID1838
Feature activation-4.05e+00

Pos logit contributions
war+0.389
umbs+0.388
forts+0.374
of+0.370
fulness+0.367
Neg logit contributions
 playing-0.312
 living-0.287
 office-0.262
 neighborhood-0.246
 hiding-0.242
Token a
Token ID257
Feature activation+2.00e-01

Pos logit contributions
forts+0.387
war+0.379
umbs+0.373
le+0.369
of+0.367
Neg logit contributions
 playing-0.189
 living-0.145
 office-0.135
 hiding-0.116
 neighborhood-0.113
Token loud
Token ID7812
Feature activation-8.21e+00

Pos logit contributions
 playing+0.013
 living+0.012
 office+0.012
ich+0.011
 neighborhood+0.011
Neg logit contributions
war-0.014
umbs-0.014
le-0.013
bs-0.013
of-0.013
Token noise
Token ID7838
Feature activation-2.65e+00

Pos logit contributions
war+0.619
umbs+0.615
bs+0.606
forts+0.595
of+0.584
Neg logit contributions
 playing-0.504
 living-0.435
 office-0.420
ich-0.389
 factory-0.385
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
war+0.214
umbs+0.208
forts+0.204
of+0.201
bs+0.198
Neg logit contributions
 playing-0.158
 living-0.134
 office-0.123
ich-0.114
 exploring-0.114
Token Anna
Token ID11735
Feature activation-1.64e+00

Pos logit contributions
le+0.088
war+0.087
umbs+0.085
of+0.084
forts+0.083
Neg logit contributions
 playing-0.057
 living-0.049
 office-0.047
 neighborhood-0.047
ich-0.044
Token likes
Token ID7832
Feature activation+3.27e+00

Pos logit contributions
war+0.117
umbs+0.115
of+0.111
le+0.111
forts+0.111
Neg logit contributions
 playing-0.106
 living-0.094
 office-0.090
 neighborhood-0.089
ich-0.085
Token the
Token ID262
Feature activation-5.89e-01

Pos logit contributions
 playing+0.205
 living+0.178
 office+0.178
 neighborhood+0.172
ich+0.170
Neg logit contributions
war-0.239
le-0.238
umbs-0.236
of-0.232
bs-0.228
Token noise
Token ID7838
Feature activation-2.64e+00

Pos logit contributions
war+0.051
le+0.051
umbs+0.051
of+0.050
bs+0.049
Neg logit contributions
 playing-0.029
 office-0.024
 living-0.024
 neighborhood-0.024
ich-0.023
Token Amy
Token ID14235
Feature activation+4.29e-02

Pos logit contributions
war+0.077
umbs+0.075
of+0.074
le+0.074
bs+0.074
Neg logit contributions
 playing-0.064
 office-0.059
ich-0.058
 living-0.057
 neighborhood-0.056
Token was
Token ID373
Feature activation+5.07e+00

Pos logit contributions
 office+0.002
 seeker+0.002
 playing+0.002
ich+0.002
 neighborhood+0.002
Neg logit contributions
le-0.003
of-0.003
bs-0.003
les-0.003
war-0.003
Token scared
Token ID12008
Feature activation+2.47e+00

Pos logit contributions
 office+0.288
 mall+0.288
 factory+0.284
 cabinet+0.282
 neighborhood+0.274
Neg logit contributions
ck-0.280
le-0.275
umbs-0.272
 harder-0.264
 tighter-0.254
Token but
Token ID475
Feature activation-2.12e+00

Pos logit contributions
 playing+0.148
 living+0.126
 office+0.121
 neighborhood+0.119
ich+0.115
Neg logit contributions
war-0.196
umbs-0.189
le-0.186
of-0.183
forts-0.183
Token she
Token ID673
Feature activation+6.99e-01

Pos logit contributions
bs+0.118
le+0.114
of+0.112
 tighter+0.111
war+0.105
Neg logit contributions
 office-0.153
ich-0.145
 factory-0.145
 neighborhood-0.141
gy-0.138
Token slowly
Token ID6364
Feature activation-9.22e+00

Pos logit contributions
 playing+0.046
 office+0.044
 living+0.042
 factory+0.041
 exploring+0.040
Neg logit contributions
le-0.047
of-0.045
bs-0.044
ck-0.043
war-0.042
Token moved
Token ID3888
Feature activation-1.33e+00

Pos logit contributions
umbs+0.767
war+0.732
fulness+0.700
forts+0.688
of+0.682
Neg logit contributions
 playing-0.521
 office-0.425
 living-0.424
 walking-0.418
 neighborhood-0.417
Token closer
Token ID5699
Feature activation-1.00e+01

Pos logit contributions
umbs+0.096
bread+0.090
forts+0.084
war+0.084
oons+0.083
Neg logit contributions
 playing-0.093
 exploring-0.077
 office-0.077
 living-0.077
 walking-0.076
Token to
Token ID284
Feature activation-7.34e-01

Pos logit contributions
umbs+0.874
war+0.848
forts+0.835
bs+0.824
le+0.809
Neg logit contributions
 playing-0.537
 living-0.452
 office-0.426
 exploring-0.420
 neighborhood-0.401
Token the
Token ID262
Feature activation-5.75e-01

Pos logit contributions
war+0.052
umbs+0.052
le+0.051
bs+0.050
of+0.050
Neg logit contributions
 playing-0.047
 living-0.040
 office-0.040
 neighborhood-0.039
ich-0.039
Token goose
Token ID37246
Feature activation-1.36e-01

Pos logit contributions
war+0.043
umbs+0.042
le+0.042
bs+0.042
of+0.042
Neg logit contributions
 playing-0.034
 living-0.031
ich-0.030
 office-0.030
 neighborhood-0.030
Token quickly
Token ID2952
Feature activation-4.25e+00

Pos logit contributions
 playing+0.548
 living+0.465
 office+0.459
 neighborhood+0.444
 exploring+0.419
Neg logit contributions
war-0.728
umbs-0.714
forts-0.706
le-0.695
bs-0.686
Token but
Token ID475
Feature activation-2.13e+00

Pos logit contributions
war+0.305
le+0.301
umbs+0.297
of+0.295
bs+0.294
Neg logit contributions
 playing-0.261
 office-0.230
 living-0.228
 neighborhood-0.226
ich-0.223
Token Jill
Token ID22772
Feature activation+3.71e-01

Pos logit contributions
bs+0.134
le+0.132
war+0.130
of+0.129
fulness+0.122
Neg logit contributions
 playing-0.137
ich-0.135
 office-0.130
 neighborhood-0.129
 living-0.127
Token was
Token ID373
Feature activation+5.05e+00

Pos logit contributions
 playing+0.019
ich+0.018
 office+0.017
 living+0.017
 neighborhood+0.017
Neg logit contributions
le-0.030
bs-0.029
of-0.028
war-0.028
les-0.028
Token counting
Token ID14143
Feature activation+9.31e+00

Pos logit contributions
ich+0.269
 neighborhood+0.241
 office+0.236
 playing+0.236
gy+0.233
Neg logit contributions
le-0.361
war-0.348
bs-0.348
 harder-0.339
bread-0.337
Token slowly
Token ID6364
Feature activation-9.23e+00

Pos logit contributions
 playing+0.567
 living+0.487
 office+0.486
 neighborhood+0.469
ich+0.441
Neg logit contributions
war-0.699
umbs-0.687
forts-0.675
le-0.668
of-0.661
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
war+0.651
le+0.639
umbs+0.635
of+0.628
bs+0.624
Neg logit contributions
 playing-0.586
 office-0.517
 living-0.513
 neighborhood-0.504
ich-0.492
Token She
Token ID1375
Feature activation+1.53e+00

Pos logit contributions
war+0.078
le+0.077
umbs+0.076
of+0.075
bs+0.075
Neg logit contributions
 playing-0.063
 office-0.056
 living-0.055
 neighborhood-0.054
ich-0.054
Token looked
Token ID3114
Feature activation-3.31e+00

Pos logit contributions
 playing+0.090
ich+0.085
 office+0.081
 living+0.080
 neighborhood+0.079
Neg logit contributions
le-0.111
of-0.108
bs-0.107
war-0.107
les-0.104
Token around
Token ID1088
Feature activation+2.95e+00

Pos logit contributions
le+0.257
war+0.254
bs+0.251
of+0.246
umbs+0.243
Neg logit contributions
 playing-0.179
ich-0.163
 office-0.162
 neighborhood-0.158
 living-0.156
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
 playing+0.186
 living+0.162
 office+0.158
 neighborhood+0.155
 exploring+0.148
Neg logit contributions
war-0.220
umbs-0.217
le-0.211
of-0.209
bs-0.207
Token some
Token ID617
Feature activation-5.76e+00

Pos logit contributions
war+0.248
umbs+0.243
forts+0.241
le+0.241
of+0.240
Neg logit contributions
 playing-0.191
 office-0.164
 living-0.163
 neighborhood-0.155
ich-0.154
Token cookies
Token ID14746
Feature activation-3.70e+00

Pos logit contributions
war+0.391
umbs+0.374
of+0.372
forts+0.371
bs+0.368
Neg logit contributions
 playing-0.406
 living-0.361
 office-0.350
 neighborhood-0.337
ich-0.321
Token with
Token ID351
Feature activation+5.54e+00

Pos logit contributions
war+0.256
le+0.249
of+0.246
umbs+0.245
bs+0.245
Neg logit contributions
 playing-0.244
 office-0.215
 living-0.214
 neighborhood-0.207
ich-0.205
Token the
Token ID262
Feature activation-5.80e-01

Pos logit contributions
 playing+0.352
 living+0.306
 office+0.277
 exploring+0.260
 neighborhood+0.255
Neg logit contributions
war-0.420
forts-0.401
le-0.398
umbs-0.394
of-0.392
Token brown
Token ID7586
Feature activation-2.82e+00

Pos logit contributions
war+0.025
le+0.024
of+0.024
umbs+0.023
bs+0.023
Neg logit contributions
 playing-0.054
 living-0.050
 office-0.049
 neighborhood-0.049
ich-0.048
Token butter
Token ID9215
Feature activation-1.26e+01

Pos logit contributions
war+0.151
of+0.141
umbs+0.140
le+0.138
forts+0.137
Neg logit contributions
 playing-0.234
 living-0.212
 office-0.211
 neighborhood-0.207
ich-0.202
Token!"
Token ID2474
Feature activation+1.58e+00

Pos logit contributions
war+0.854
le+0.827
of+0.817
bs+0.814
umbs+0.812
Neg logit contributions
 playing-0.845
 living-0.747
 office-0.746
ich-0.728
 neighborhood-0.720
Token<|endoftext|>
Token ID50256
Feature activation-6.62e-01

Pos logit contributions
 playing+0.086
 living+0.073
 office+0.073
 neighborhood+0.072
ich+0.067
Neg logit contributions
le-0.128
war-0.128
of-0.123
umbs-0.123
bs-0.123
TokenMary
Token ID24119
Feature activation-5.06e+00

Pos logit contributions
war+0.050
umbs+0.050
of+0.049
le+0.049
les+0.047
Neg logit contributions
 playing-0.037
 office-0.034
ich-0.034
 living-0.031
 neighborhood-0.031
Token was
Token ID373
Feature activation+5.06e+00

Pos logit contributions
le+0.331
bs+0.319
of+0.316
war+0.315
les+0.311
Neg logit contributions
 playing-0.333
ich-0.314
 office-0.311
 neighborhood-0.302
 living-0.297
Token in
Token ID287
Feature activation+5.48e+00

Pos logit contributions
ich+0.212
 factory+0.187
 neighborhood+0.178
 office+0.174
 Polic+0.174
Neg logit contributions
bs-0.403
umbs-0.385
le-0.384
 sip-0.376
war-0.369
Token pick
Token ID2298
Feature activation-6.27e-01

Pos logit contributions
war+0.232
umbs+0.229
le+0.228
of+0.224
forts+0.218
Neg logit contributions
 playing-0.226
 living-0.199
 office-0.196
 neighborhood-0.189
ich-0.183
Token the
Token ID262
Feature activation-4.13e-01

Pos logit contributions
forts+0.046
war+0.045
umbs+0.044
bs+0.042
of+0.042
Neg logit contributions
 playing-0.039
 living-0.035
 office-0.032
 neighborhood-0.031
 exploring-0.030
Token herbs
Token ID25411
Feature activation-4.07e+00

Pos logit contributions
war+0.027
le+0.026
umbs+0.026
of+0.026
bs+0.025
Neg logit contributions
 playing-0.029
 living-0.026
 office-0.026
 neighborhood-0.026
ich-0.025
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
war+0.328
of+0.312
umbs+0.311
forts+0.309
le+0.307
Neg logit contributions
 playing-0.234
 living-0.208
 office-0.192
 exploring-0.180
 neighborhood-0.177
Token Her
Token ID2332
Feature activation+3.93e-01

Pos logit contributions
war+0.068
umbs+0.067
le+0.067
of+0.066
bs+0.066
Neg logit contributions
 playing-0.052
 office-0.046
 living-0.045
 neighborhood-0.044
ich-0.044
Tokenbs
Token ID1443
Feature activation-1.47e+01

Pos logit contributions
 playing+0.020
ich+0.017
 office+0.017
 living+0.017
 neighborhood+0.017
Neg logit contributions
bs-0.031
war-0.030
le-0.030
forts-0.029
of-0.029
Token were
Token ID547
Feature activation-1.46e+00

Pos logit contributions
war+1.137
le+1.119
umbs+1.109
of+1.100
bs+1.093
Neg logit contributions
 playing-0.829
 office-0.721
 living-0.714
 neighborhood-0.705
ich-0.693
Token plants
Token ID6134
Feature activation+2.42e+00

Pos logit contributions
war+0.118
le+0.117
umbs+0.115
of+0.114
bs+0.114
Neg logit contributions
 playing-0.077
 office-0.067
 living-0.066
 neighborhood-0.065
ich-0.064
Token that
Token ID326
Feature activation+1.24e+00

Pos logit contributions
 playing+0.122
 office+0.104
 living+0.103
 neighborhood+0.101
ich+0.099
Neg logit contributions
war-0.201
le-0.198
umbs-0.196
of-0.195
bs-0.194
Token smelled
Token ID37609
Feature activation-3.07e+00

Pos logit contributions
 playing+0.077
 office+0.068
 living+0.067
 neighborhood+0.066
ich+0.065
Neg logit contributions
war-0.088
le-0.087
umbs-0.086
of-0.085
bs-0.085
Token good
Token ID922
Feature activation-2.63e+00

Pos logit contributions
war+0.244
of+0.233
umbs+0.233
le+0.230
forts+0.229
Neg logit contributions
 playing-0.178
 living-0.155
 neighborhood-0.140
 office-0.140
 exploring-0.136
Token her
Token ID607
Feature activation+4.10e-01

Pos logit contributions
war+0.218
umbs+0.214
le+0.212
of+0.209
forts+0.206
Neg logit contributions
 playing-0.230
 living-0.202
 office-0.198
 neighborhood-0.192
ich-0.187
Token dad
Token ID9955
Feature activation+3.14e+00

Pos logit contributions
 playing+0.025
 office+0.023
ich+0.023
 neighborhood+0.023
 living+0.022
Neg logit contributions
bs-0.027
le-0.026
war-0.026
umbs-0.026
bread-0.025
Token added
Token ID2087
Feature activation-4.75e+00

Pos logit contributions
 playing+0.203
 living+0.177
 office+0.173
 neighborhood+0.167
ich+0.161
Neg logit contributions
war-0.224
umbs-0.215
le-0.210
of-0.210
forts-0.209
Token some
Token ID617
Feature activation-5.55e+00

Pos logit contributions
war+0.396
forts+0.390
umbs+0.377
of+0.369
le+0.365
Neg logit contributions
 playing-0.272
 living-0.239
 office-0.196
 exploring-0.194
 best-0.191
Token noisy
Token ID31210
Feature activation-1.15e+00

Pos logit contributions
war+0.376
of+0.351
umbs+0.349
forts+0.347
le+0.330
Neg logit contributions
 playing-0.403
 living-0.372
 office-0.319
 neighborhood-0.314
 exploring-0.312
Token spices
Token ID27952
Feature activation-1.10e+01

Pos logit contributions
war+0.068
of+0.066
umbs+0.065
le+0.064
bs+0.064
Neg logit contributions
 playing-0.090
 living-0.080
 office-0.078
 neighborhood-0.076
 exploring-0.073
Token.
Token ID13
Feature activation-8.69e-01

Pos logit contributions
war+0.926
forts+0.887
of+0.883
umbs+0.866
reci+0.865
Neg logit contributions
 playing-0.581
 living-0.511
 office-0.429
 exploring-0.416
 hiding-0.398
Token He
Token ID679
Feature activation+1.48e+00

Pos logit contributions
war+0.062
le+0.062
umbs+0.061
of+0.060
bs+0.060
Neg logit contributions
 playing-0.054
 office-0.047
 living-0.047
 neighborhood-0.047
ich-0.046
Token said
Token ID531
Feature activation-5.20e+00

Pos logit contributions
 playing+0.092
 living+0.080
 office+0.079
 neighborhood+0.078
ich+0.075
Neg logit contributions
war-0.109
umbs-0.105
le-0.104
of-0.103
forts-0.102
Token,
Token ID11
Feature activation+2.86e+00

Pos logit contributions
war+0.411
le+0.408
umbs+0.402
of+0.401
bs+0.400
Neg logit contributions
 playing-0.280
 office-0.246
 neighborhood-0.242
 living-0.240
ich-0.238
Token "
Token ID366
Feature activation+4.68e+00

Pos logit contributions
 playing+0.105
 living+0.083
 office+0.081
 neighborhood+0.077
ich+0.075
Neg logit contributions
war-0.280
le-0.278
umbs-0.275
of-0.272
bs-0.270