TOP ACTIVATIONS
MAX = 15.206

the most perfect present ! Sy lves ter was so happy
he had never surrendered to Sy lves ter . <|endoftext|> Once
ety saw a cat named Sy lves ter trying to catch
you singing about jail ?" Sy lves ter asked .
on their way . The tort oise and the all igator
the fastest runner in the neighborhood . He was very happy
She liked to explore her neighborhood . One day , she
time there was an enormous tort oise who always wished for
She walked around the neighborhood and finally found the car
liked to walk around his neighborhood and look at all the
loved to ride around his neighborhood . One day , Tim
loved to wander around his neighborhood and explore new things .
liked to jog around their neighborhood every morning and it was
Bob continued to examine the neighborhood looking for the surprise .
rode it all around the neighborhood . He even rode it
girl wanted to explore her neighborhood , but it was a
loved to walk around his neighborhood and look at all the
liked to walk around his neighborhood with his mom my .
go outside and explore the neighborhood . His parents were hesitant
all rode together through the neighborhood . They went up and

MIN ACTIVATIONS
MIN = -17.762

" Le - le - le opard ," she says .
One day , Ann abel le 's mom asked her to
able little girl called El le . One day , El
. One day , El le was feeling brave so she
, and make everything spark le . Mom
the window . Ann abel le wanted to do something else
football . She would tick le him as he put it
Gold Land started to spark le and glitter .
them its light and spark le . They gave it back
the toy began to spark le and tw inkle like stars
's nice of you El le , but you know he
sound of hurry and bust le . G illy
It had a blue spark le in the sun and she
giving off a special spark le that made the princess feel
, it began to spark le and shine in the sunlight
heard his stomach g urg le after he ate it .
the basket started to spark le and shine .
A ire l le ran back to the door
g iggle . They tick le the fish back . They
special find . Ann abel le taught them the word �

INTERVAL 6.964 - 15.206
CONTAINS 2.369%

a spray bottle in the garden . It was yellow and
looking . Soon , he met an old man who was
I 'm sorry Steve . I hid the er aser and
my name is Lily and I like cats . Cats are
. He sat on the toilet and did his business .

INTERVAL -1.278 - 6.964
CONTAINS 68.702%

will not hurt us ," Lily said . "
. She tries to reach for the doll , but she
Mom came and hugged them . She said ,
they encountered a heavy storm . Sam reassured Zoe that they
a bit tired but he didn 't want to leave just

INTERVAL -9.520 - -1.278
CONTAINS 28.611%

birthday party and her parents were very wealthy . They offered
and bumped . They cried and shouted . They wanted to
learned something new . He told his friends about snakes and
of the bear on a screen . He smiled . "
did not understand why Max did not want to marry her

INTERVAL -17.762 - -9.520
CONTAINS 0.318%

" We used white cr ay ons , Mom . We
fire wasn 't coming any closer . The 3 - year
and che wed and che wed . Lily wanted to try
Dad said . " We understand that you are excited about
. He laughs and swim s to the edge .
Token the
Token ID262
Feature activation+1.61e-01

Pos logit contributions
 playing+0.278
ich+0.278
 neighborhood+0.273
 office+0.271
 living+0.249
Neg logit contributions
le-0.394
umbs-0.375
war-0.371
of-0.370
les-0.368
Token most
Token ID749
Feature activation+8.22e+00

Pos logit contributions
 playing+0.007
ich+0.006
 neighborhood+0.006
 office+0.006
 exploring+0.006
Neg logit contributions
le-0.013
of-0.013
les-0.012
war-0.012
umbs-0.012
Token perfect
Token ID2818
Feature activation+2.56e+00

Pos logit contributions
 playing+0.479
ich+0.448
 office+0.446
 neighborhood+0.436
 living+0.426
Neg logit contributions
le-0.601
of-0.567
les-0.564
umbs-0.563
war-0.563
Token present
Token ID1944
Feature activation-4.85e+00

Pos logit contributions
 playing+0.145
ich+0.141
 neighborhood+0.136
 office+0.134
 living+0.128
Neg logit contributions
le-0.185
of-0.172
les-0.171
war-0.170
umbs-0.169
Token!
Token ID0
Feature activation+2.99e+00

Pos logit contributions
le+0.305
of+0.290
umbs+0.286
les+0.279
war+0.273
Neg logit contributions
 playing-0.317
 neighborhood-0.311
ich-0.306
 office-0.299
 living-0.286
Token Sy
Token ID1632
Feature activation+1.52e+01

Pos logit contributions
 playing+0.180
ich+0.168
 office+0.167
 neighborhood+0.164
 living+0.161
Neg logit contributions
le-0.209
of-0.204
umbs-0.203
war-0.201
les-0.196
Tokenlves
Token ID31018
Feature activation+7.29e+00

Pos logit contributions
 playing+0.758
 office+0.718
 living+0.689
 neighborhood+0.682
ich+0.675
Neg logit contributions
le-1.284
of-1.248
les-1.245
war-1.193
bs-1.149
Tokenter
Token ID353
Feature activation+7.78e+00

Pos logit contributions
 playing+0.272
 neighborhood+0.247
 office+0.246
 backyard+0.224
 classroom+0.224
Neg logit contributions
le-0.710
les-0.681
of-0.662
fulness-0.624
umbs-0.623
Token was
Token ID373
Feature activation+5.51e+00

Pos logit contributions
 playing+0.375
ich+0.366
 office+0.358
 neighborhood+0.352
 seeker+0.349
Neg logit contributions
le-0.627
of-0.607
les-0.595
bs-0.574
umbs-0.571
Token so
Token ID523
Feature activation+3.93e+00

Pos logit contributions
ich+0.273
 neighborhood+0.265
 office+0.258
 factory+0.253
 playing+0.250
Neg logit contributions
le-0.417
umbs-0.401
of-0.399
les-0.388
war-0.387
Token happy
Token ID3772
Feature activation-1.25e+00

Pos logit contributions
 playing+0.224
ich+0.219
 office+0.219
 neighborhood+0.214
 living+0.205
Neg logit contributions
le-0.276
of-0.270
umbs-0.268
war-0.264
bs-0.263
Token he
Token ID339
Feature activation+1.13e+00

Pos logit contributions
ich+0.171
 neighborhood+0.171
 office+0.168
 playing+0.166
 seeker+0.160
Neg logit contributions
le-0.166
of-0.162
les-0.158
bs-0.156
fulness-0.155
Token had
Token ID550
Feature activation+1.58e+00

Pos logit contributions
 office+0.074
ich+0.072
 playing+0.071
 neighborhood+0.070
 factory+0.069
Neg logit contributions
le-0.076
of-0.072
les-0.069
bs-0.068
war-0.065
Token never
Token ID1239
Feature activation+3.40e+00

Pos logit contributions
ich+0.093
 factory+0.092
 office+0.089
 neighborhood+0.086
 seeker+0.083
Neg logit contributions
of-0.101
war-0.100
le-0.100
ck-0.094
bs-0.094
Token surrendered
Token ID29209
Feature activation-3.62e+00

Pos logit contributions
ich+0.203
 office+0.196
 playing+0.194
 factory+0.190
 neighborhood+0.190
Neg logit contributions
le-0.230
of-0.227
war-0.221
les-0.216
bs-0.212
Token to
Token ID284
Feature activation-4.84e-02

Pos logit contributions
le+0.245
of+0.234
war+0.231
umbs+0.229
les+0.228
Neg logit contributions
 playing-0.227
ich-0.216
 office-0.216
 neighborhood-0.212
 living-0.207
Token Sy
Token ID1632
Feature activation+1.47e+01

Pos logit contributions
le+0.003
of+0.003
les+0.003
war+0.002
fulness+0.002
Neg logit contributions
ich-0.003
 neighborhood-0.003
 office-0.003
 playing-0.003
 seeker-0.003
Tokenlves
Token ID31018
Feature activation+7.08e+00

Pos logit contributions
 playing+0.745
 office+0.693
ich+0.681
 living+0.675
 neighborhood+0.667
Neg logit contributions
le-1.228
of-1.185
les-1.177
war-1.152
bs-1.095
Tokenter
Token ID353
Feature activation+7.52e+00

Pos logit contributions
 playing+0.277
 neighborhood+0.271
 classroom+0.244
 office+0.243
 backyard+0.239
Neg logit contributions
le-0.678
of-0.631
les-0.628
war-0.601
fulness-0.586
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
 playing+0.522
 neighborhood+0.518
ich+0.514
 office+0.503
 factory+0.499
Neg logit contributions
le-0.450
of-0.413
war-0.393
les-0.391
umbs-0.384
Token<|endoftext|>
Token ID50256
Feature activation-1.67e+00

Pos logit contributions
of+0.030
forts+0.029
umbs+0.029
war+0.029
bs+0.028
Neg logit contributions
 playing-0.043
ich-0.041
 seeker-0.041
 office-0.040
 seeking-0.039
TokenOnce
Token ID7454
Feature activation-1.89e+00

Pos logit contributions
le+0.115
of+0.111
umbs+0.110
war+0.109
les+0.108
Neg logit contributions
 playing-0.102
ich-0.096
 office-0.096
 neighborhood-0.094
 living-0.092
Tokenety
Token ID2963
Feature activation-7.61e-01

Pos logit contributions
ich+0.402
 playing+0.397
 neighborhood+0.375
 office+0.352
 living+0.335
Neg logit contributions
le-0.757
war-0.728
umbs-0.715
 sip-0.709
les-0.708
Token saw
Token ID2497
Feature activation+3.30e+00

Pos logit contributions
le+0.064
les+0.061
of+0.060
war+0.058
bs+0.058
Neg logit contributions
 playing-0.036
 office-0.034
ich-0.034
 neighborhood-0.032
 living-0.032
Token a
Token ID257
Feature activation-3.88e-01

Pos logit contributions
 neighborhood+0.206
ich+0.204
 playing+0.203
 office+0.203
 factory+0.194
Neg logit contributions
le-0.201
of-0.196
les-0.192
umbs-0.190
war-0.189
Token cat
Token ID3797
Feature activation+2.23e+00

Pos logit contributions
of+0.022
le+0.022
 sip+0.021
les+0.021
war+0.020
Neg logit contributions
 playing-0.025
 office-0.024
 exploring-0.023
 seeking-0.023
 neighborhood-0.023
Token named
Token ID3706
Feature activation+9.47e+00

Pos logit contributions
 smartest+0.103
 neighborhood+0.102
 happiest+0.100
ich+0.099
 office+0.095
Neg logit contributions
le-0.184
of-0.171
les-0.168
 sip-0.167
bs-0.162
Token Sy
Token ID1632
Feature activation+1.47e+01

Pos logit contributions
 neighborhood+0.493
 office+0.474
ich+0.448
 playing+0.446
 neighbourhood+0.445
Neg logit contributions
of-0.709
umbs-0.692
war-0.683
le-0.670
bs-0.649
Tokenlves
Token ID31018
Feature activation+6.98e+00

Pos logit contributions
 playing+0.668
 office+0.633
 neighborhood+0.608
 living+0.598
ich+0.597
Neg logit contributions
le-1.279
of-1.231
les-1.218
war-1.208
umbs-1.171
Tokenter
Token ID353
Feature activation+7.41e+00

Pos logit contributions
 neighborhood+0.243
 playing+0.240
 office+0.228
 classroom+0.209
 backyard+0.203
Neg logit contributions
le-0.695
les-0.648
of-0.641
ck-0.617
war-0.615
Token trying
Token ID2111
Feature activation+4.28e+00

Pos logit contributions
 smartest+0.491
 neighborhood+0.488
 seeker+0.483
 happiest+0.482
 office+0.475
Neg logit contributions
le-0.439
of-0.417
les-0.390
bs-0.380
ck-0.375
Token to
Token ID284
Feature activation-2.52e-01

Pos logit contributions
 playing+0.245
 office+0.215
 living+0.215
ich+0.212
 neighborhood+0.210
Neg logit contributions
le-0.332
umbs-0.322
of-0.317
les-0.314
war-0.312
Token catch
Token ID4929
Feature activation+1.58e+00

Pos logit contributions
le+0.019
umbs+0.019
les+0.018
war+0.018
of+0.018
Neg logit contributions
 playing-0.014
 office-0.013
ich-0.013
 neighborhood-0.012
 living-0.012
Token you
Token ID345
Feature activation+2.01e+00

Pos logit contributions
of+0.221
bs+0.197
war+0.195
le+0.195
umbs+0.191
Neg logit contributions
 office-0.234
 neighborhood-0.230
 backyard-0.216
ich-0.216
 seeker-0.215
Token singing
Token ID13777
Feature activation+5.43e-01

Pos logit contributions
 neighborhood+0.088
 office+0.088
 playing+0.080
 living+0.080
 backyard+0.079
Neg logit contributions
le-0.164
umbs-0.158
of-0.158
les-0.155
war-0.150
Token about
Token ID546
Feature activation-1.90e+00

Pos logit contributions
 playing+0.028
ich+0.027
 office+0.027
 neighborhood+0.027
 living+0.026
Neg logit contributions
le-0.042
of-0.040
war-0.040
les-0.039
umbs-0.039
Token jail
Token ID7356
Feature activation+8.16e+00

Pos logit contributions
of+0.122
le+0.120
les+0.116
ck+0.108
l+0.107
Neg logit contributions
 neighborhood-0.106
 seeker-0.105
 office-0.104
 backyard-0.099
ich-0.098
Token?"
Token ID1701
Feature activation+4.49e+00

Pos logit contributions
 playing+0.451
ich+0.441
 neighborhood+0.436
 office+0.426
 living+0.420
Neg logit contributions
le-0.612
les-0.574
of-0.572
war-0.551
umbs-0.547
Token Sy
Token ID1632
Feature activation+1.46e+01

Pos logit contributions
 office+0.250
ich+0.247
 neighborhood+0.241
 playing+0.241
 seeker+0.237
Neg logit contributions
le-0.314
of-0.307
bs-0.305
war-0.299
les-0.297
Tokenlves
Token ID31018
Feature activation+6.88e+00

Pos logit contributions
 playing+0.884
 office+0.812
 neighborhood+0.810
 living+0.804
ich+0.792
Neg logit contributions
le-1.076
of-1.022
les-1.015
war-0.986
bs-0.965
Tokenter
Token ID353
Feature activation+7.34e+00

Pos logit contributions
 playing+0.243
 neighborhood+0.227
 office+0.206
 backyard+0.193
 neighbourhood+0.184
Neg logit contributions
le-0.689
les-0.654
of-0.649
ck-0.631
war-0.617
Token asked
Token ID1965
Feature activation-3.41e-01

Pos logit contributions
 playing+0.478
 office+0.471
 neighborhood+0.468
 living+0.456
ich+0.453
Neg logit contributions
le-0.489
of-0.449
les-0.439
bs-0.427
ck-0.424
Token.
Token ID13
Feature activation-9.55e-01

Pos logit contributions
le+0.018
of+0.017
les+0.016
war+0.016
bs+0.015
Neg logit contributions
ich-0.026
 neighborhood-0.026
 office-0.026
 playing-0.025
 backyard-0.025
Token
Token ID220
Feature activation-3.55e+00

Pos logit contributions
of+0.060
war+0.057
forts+0.057
l+0.057
bs+0.055
Neg logit contributions
 office-0.056
 seeker-0.056
ich-0.054
 playing-0.053
 smartest-0.052
Token on
Token ID319
Feature activation-1.33e-01

Pos logit contributions
le+0.062
umbs+0.059
of+0.058
les+0.058
bs+0.058
Neg logit contributions
ich-0.037
 playing-0.036
 office-0.035
 neighborhood-0.034
 factory-0.033
Token their
Token ID511
Feature activation-3.46e+00

Pos logit contributions
le+0.009
of+0.008
umbs+0.008
war+0.008
les+0.008
Neg logit contributions
 playing-0.009
 office-0.008
 neighborhood-0.008
ich-0.008
 living-0.008
Token way
Token ID835
Feature activation+1.78e+00

Pos logit contributions
le+0.235
of+0.223
war+0.221
les+0.219
bs+0.218
Neg logit contributions
 playing-0.203
ich-0.203
 office-0.195
 neighborhood-0.190
 living-0.182
Token.
Token ID13
Feature activation-4.70e-01

Pos logit contributions
 playing+0.109
 office+0.101
ich+0.101
 neighborhood+0.100
 living+0.098
Neg logit contributions
le-0.124
of-0.119
umbs-0.118
war-0.118
les-0.116
Token The
Token ID383
Feature activation-1.51e-01

Pos logit contributions
of+0.029
le+0.029
umbs+0.029
war+0.029
bs+0.028
Neg logit contributions
 playing-0.031
ich-0.030
 office-0.029
 neighborhood-0.028
 living-0.028
Token tort
Token ID7619
Feature activation+1.42e+01

Pos logit contributions
le+0.011
les+0.011
of+0.011
war+0.010
 sip+0.010
Neg logit contributions
 playing-0.008
ich-0.008
 neighborhood-0.007
 office-0.007
 exploring-0.007
Tokenoise
Token ID25678
Feature activation+1.75e+00

Pos logit contributions
 playing+0.899
 neighborhood+0.820
 office+0.817
ich+0.816
 living+0.815
Neg logit contributions
le-0.986
of-0.949
les-0.908
war-0.906
umbs-0.903
Token and
Token ID290
Feature activation-2.86e+00

Pos logit contributions
 playing+0.086
ich+0.084
 office+0.080
 neighborhood+0.078
 living+0.076
Neg logit contributions
le-0.142
of-0.137
les-0.135
war-0.131
bs-0.131
Token the
Token ID262
Feature activation+3.41e-01

Pos logit contributions
of+0.182
le+0.181
war+0.174
bs+0.174
les+0.171
Neg logit contributions
ich-0.180
 playing-0.176
 neighborhood-0.173
 office-0.169
 living-0.160
Token all
Token ID477
Feature activation+3.31e+00

Pos logit contributions
 playing+0.017
ich+0.016
 office+0.015
 neighborhood+0.015
 exploring+0.014
Neg logit contributions
le-0.025
les-0.024
of-0.024
war-0.023
forts-0.023
Tokenigator
Token ID23823
Feature activation+2.21e+00

Pos logit contributions
 playing+0.197
ich+0.186
 office+0.184
 neighborhood+0.182
 living+0.176
Neg logit contributions
le-0.238
of-0.228
war-0.226
umbs-0.224
les-0.224
Token the
Token ID262
Feature activation+4.09e-02

Pos logit contributions
le+0.069
bs+0.065
of+0.065
war+0.064
les+0.064
Neg logit contributions
 neighborhood-0.071
ich-0.071
 office-0.070
 playing-0.068
 living-0.066
Token fastest
Token ID14162
Feature activation+8.79e+00

Pos logit contributions
 playing+0.002
ich+0.002
 office+0.002
 neighborhood+0.002
 living+0.002
Neg logit contributions
le-0.003
of-0.003
war-0.003
les-0.003
umbs-0.003
Token runner
Token ID17490
Feature activation+6.55e+00

Pos logit contributions
ich+0.539
 playing+0.532
 office+0.524
 neighborhood+0.520
 living+0.501
Neg logit contributions
le-0.598
les-0.572
of-0.564
war-0.542
bs-0.531
Token in
Token ID287
Feature activation+5.88e+00

Pos logit contributions
ich+0.392
 neighborhood+0.378
 office+0.378
 playing+0.376
 living+0.361
Neg logit contributions
le-0.461
of-0.432
les-0.431
war-0.410
bs-0.407
Token the
Token ID262
Feature activation+1.61e-01

Pos logit contributions
ich+0.366
 playing+0.364
 office+0.350
 neighborhood+0.348
 living+0.336
Neg logit contributions
le-0.383
of-0.369
war-0.362
umbs-0.356
les-0.355
Token neighborhood
Token ID6232
Feature activation+1.41e+01

Pos logit contributions
ich+0.007
 playing+0.007
 saving+0.006
 oh+0.006
 office+0.006
Neg logit contributions
of-0.013
les-0.013
war-0.013
le-0.013
 sip-0.011
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
ich+0.982
 playing+0.942
 seeker+0.928
 office+0.919
 neighborhood+0.913
Neg logit contributions
le-0.825
of-0.784
les-0.773
war-0.739
umbs-0.736
Token He
Token ID679
Feature activation+1.70e+00

Pos logit contributions
of+0.026
umbs+0.026
le+0.026
war+0.025
les+0.024
Neg logit contributions
 playing-0.025
ich-0.024
 office-0.023
 neighborhood-0.023
 living-0.022
Token was
Token ID373
Feature activation+5.30e+00

Pos logit contributions
ich+0.087
 office+0.084
 playing+0.083
 neighborhood+0.076
 factory+0.076
Neg logit contributions
le-0.133
of-0.132
les-0.129
war-0.121
bs-0.121
Token very
Token ID845
Feature activation+6.37e+00

Pos logit contributions
ich+0.268
 factory+0.256
 neighborhood+0.256
 cabinet+0.251
 office+0.246
Neg logit contributions
le-0.378
of-0.356
ck-0.356
war-0.355
bs-0.349
Token happy
Token ID3772
Feature activation-1.32e+00

Pos logit contributions
ich+0.374
 office+0.372
 neighborhood+0.357
 playing+0.356
 factory+0.343
Neg logit contributions
bs-0.427
le-0.419
war-0.410
of-0.406
les-0.393
Token She
Token ID1375
Feature activation+1.54e+00

Pos logit contributions
of+0.118
umbs+0.116
le+0.113
war+0.113
bs+0.112
Neg logit contributions
ich-0.120
 playing-0.118
 office-0.116
 neighborhood-0.112
 living-0.109
Token liked
Token ID8288
Feature activation+4.95e+00

Pos logit contributions
 playing+0.090
ich+0.086
 office+0.084
 neighborhood+0.083
 living+0.080
Neg logit contributions
le-0.112
of-0.107
les-0.106
war-0.103
umbs-0.103
Token to
Token ID284
Feature activation+1.98e-01

Pos logit contributions
 playing+0.298
ich+0.293
 office+0.288
 neighborhood+0.288
 living+0.271
Neg logit contributions
le-0.327
of-0.319
war-0.314
umbs-0.312
les-0.311
Token explore
Token ID7301
Feature activation+5.17e+00

Pos logit contributions
 playing+0.011
 office+0.010
 neighborhood+0.010
 living+0.010
ich+0.010
Neg logit contributions
le-0.015
umbs-0.015
war-0.014
of-0.014
les-0.014
Token her
Token ID607
Feature activation+5.18e-01

Pos logit contributions
 playing+0.326
 office+0.300
ich+0.297
 neighborhood+0.296
 living+0.293
Neg logit contributions
le-0.355
umbs-0.343
of-0.340
war-0.340
les-0.332
Token neighborhood
Token ID6232
Feature activation+1.41e+01

Pos logit contributions
ich+0.023
 playing+0.022
 Sy+0.021
 smartest+0.021
 office+0.021
Neg logit contributions
le-0.041
les-0.040
of-0.039
bs-0.039
war-0.037
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
ich+0.994
 playing+0.948
 neighborhood+0.916
 office+0.914
 seeker+0.913
Neg logit contributions
le-0.827
les-0.760
of-0.759
umbs-0.723
war-0.718
Token One
Token ID1881
Feature activation-4.38e+00

Pos logit contributions
le+0.021
of+0.021
umbs+0.020
war+0.020
les+0.019
Neg logit contributions
 playing-0.024
ich-0.023
 office-0.023
 neighborhood-0.022
 living-0.022
Token day
Token ID1110
Feature activation+1.54e-01

Pos logit contributions
le+0.359
of+0.346
war+0.335
bs+0.335
les+0.334
Neg logit contributions
 playing-0.211
ich-0.210
 neighborhood-0.195
 office-0.193
 living-0.183
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
ich+0.008
 neighborhood+0.008
 office+0.008
 playing+0.008
 living+0.008
Neg logit contributions
le-0.012
bs-0.011
of-0.011
war-0.011
les-0.011
Token she
Token ID673
Feature activation+1.49e+00

Pos logit contributions
 neighborhood+0.204
 office+0.204
ich+0.202
 playing+0.199
 living+0.197
Neg logit contributions
le-0.202
of-0.199
bs-0.197
war-0.188
les-0.182
Token time
Token ID640
Feature activation-6.18e+00

Pos logit contributions
le+0.005
war+0.004
umbs+0.004
of+0.004
les+0.004
Neg logit contributions
 playing-0.006
 neighborhood-0.006
 office-0.006
ich-0.006
 living-0.006
Token there
Token ID612
Feature activation+5.88e-03

Pos logit contributions
of+0.345
bs+0.288
le+0.286
umbs+0.272
l+0.265
Neg logit contributions
 office-0.456
 neighborhood-0.446
ich-0.444
 smartest-0.424
 happiest-0.416
Token was
Token ID373
Feature activation+5.20e+00

Pos logit contributions
ich+0.000
 office+0.000
 playing+0.000
 neighborhood+0.000
 exploring+0.000
Neg logit contributions
le-0.000
of-0.000
les-0.000
umbs-0.000
 sip-0.000
Token an
Token ID281
Feature activation+2.73e+00

Pos logit contributions
 playing+0.285
 office+0.259
ich+0.259
 neighborhood+0.255
 living+0.251
Neg logit contributions
le-0.401
of-0.386
war-0.384
umbs-0.384
les-0.379
Token enormous
Token ID9812
Feature activation+9.13e-01

Pos logit contributions
 playing+0.145
ich+0.138
 neighborhood+0.136
 living+0.133
 seeking+0.130
Neg logit contributions
le-0.207
les-0.189
of-0.182
umbs-0.180
ck-0.177
Token tort
Token ID7619
Feature activation+1.40e+01

Pos logit contributions
ich+0.046
 playing+0.046
 saving+0.046
 neighborhood+0.045
 smartest+0.045
Neg logit contributions
le-0.063
les-0.061
 sip-0.059
of-0.058
bs-0.053
Tokenoise
Token ID25678
Feature activation+1.75e+00

Pos logit contributions
 playing+0.923
 neighborhood+0.866
 backyard+0.861
 living+0.857
 office+0.845
Neg logit contributions
le-0.921
of-0.888
umbs-0.838
war-0.823
les-0.819
Token who
Token ID508
Feature activation+4.93e+00

Pos logit contributions
 seeker+0.121
ich+0.121
 neighborhood+0.119
 smartest+0.118
 happiest+0.117
Neg logit contributions
le-0.096
of-0.091
les-0.084
bs-0.083
umbs-0.079
Token always
Token ID1464
Feature activation+6.92e+00

Pos logit contributions
ich+0.202
 playing+0.188
 office+0.182
 seeker+0.169
 neighborhood+0.166
Neg logit contributions
le-0.451
of-0.436
les-0.428
bs-0.422
war-0.414
Token wished
Token ID16555
Feature activation+2.87e+00

Pos logit contributions
ich+0.442
 office+0.414
 neighborhood+0.403
 backyard+0.364
 neighbourhood+0.360
Neg logit contributions
le-0.479
of-0.439
bs-0.425
war-0.422
les-0.421
Token for
Token ID329
Feature activation+8.80e-02

Pos logit contributions
 neighborhood+0.173
ich+0.173
 office+0.166
 playing+0.166
 seeker+0.162
Neg logit contributions
le-0.186
of-0.182
les-0.177
bs-0.174
umbs-0.173
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
le+0.002
of+0.002
umbs+0.002
war+0.002
les+0.002
Neg logit contributions
 playing-0.002
ich-0.002
 office-0.002
 neighborhood-0.002
 living-0.002
TokenShe
Token ID3347
Feature activation+1.10e+00

Pos logit contributions
 playing+0.016
 office+0.015
ich+0.015
 neighborhood+0.015
 living+0.015
Neg logit contributions
le-0.013
of-0.013
war-0.013
umbs-0.013
les-0.012
Token walked
Token ID6807
Feature activation+1.12e+00

Pos logit contributions
 playing+0.064
ich+0.062
 office+0.060
 neighborhood+0.059
 living+0.057
Neg logit contributions
le-0.080
of-0.077
les-0.075
war-0.074
umbs-0.074
Token around
Token ID1088
Feature activation+2.62e+00

Pos logit contributions
 playing+0.067
 office+0.062
ich+0.062
 neighborhood+0.061
 living+0.060
Neg logit contributions
le-0.080
of-0.077
umbs-0.076
war-0.076
les-0.075
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
ich+0.165
 playing+0.162
 office+0.158
 neighborhood+0.156
 smartest+0.151
Neg logit contributions
le-0.173
of-0.164
war-0.160
les-0.158
bs-0.154
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 playing+0.007
ich+0.006
 smartest+0.006
 saving+0.006
 oh+0.006
Neg logit contributions
of-0.013
les-0.013
le-0.013
war-0.013
 sip-0.013
Token and
Token ID290
Feature activation-2.91e+00

Pos logit contributions
ich+0.837
 seeker+0.798
 playing+0.786
 office+0.776
 neighborhood+0.768
Neg logit contributions
le-0.918
of-0.874
les-0.861
umbs-0.844
bs-0.833
Token finally
Token ID3443
Feature activation-1.99e+00

Pos logit contributions
le+0.203
bs+0.199
of+0.197
les+0.188
war+0.186
Neg logit contributions
ich-0.172
 office-0.162
 neighborhood-0.151
 playing-0.149
 factory-0.148
Token found
Token ID1043
Feature activation+3.56e+00

Pos logit contributions
le+0.152
of+0.147
war+0.144
les+0.143
umbs+0.142
Neg logit contributions
 playing-0.109
ich-0.104
 office-0.103
 neighborhood-0.101
 living-0.098
Token the
Token ID262
Feature activation+7.30e-02

Pos logit contributions
 playing+0.213
ich+0.204
 neighborhood+0.200
 office+0.199
 living+0.190
Neg logit contributions
le-0.250
of-0.240
war-0.236
umbs-0.235
les-0.235
Token car
Token ID1097
Feature activation+3.85e+00

Pos logit contributions
 playing+0.004
ich+0.004
 office+0.004
 neighborhood+0.003
 exploring+0.003
Neg logit contributions
le-0.005
of-0.005
les-0.005
war-0.005
umbs-0.005
Token liked
Token ID8288
Feature activation+4.98e+00

Pos logit contributions
ich+0.016
 seeking+0.016
 office+0.015
 seeker+0.014
 smartest+0.014
Neg logit contributions
le-0.025
les-0.023
of-0.022
bs-0.022
war-0.021
Token to
Token ID284
Feature activation+2.34e-03

Pos logit contributions
ich+0.335
 neighborhood+0.325
 office+0.323
 seeker+0.320
 playing+0.316
Neg logit contributions
war-0.270
of-0.263
umbs-0.259
le-0.259
les-0.254
Token walk
Token ID2513
Feature activation-2.49e+00

Pos logit contributions
 playing+0.000
ich+0.000
 office+0.000
 neighborhood+0.000
 living+0.000
Neg logit contributions
le-0.000
of-0.000
war-0.000
umbs-0.000
les-0.000
Token around
Token ID1088
Feature activation+2.59e+00

Pos logit contributions
le+0.209
of+0.198
war+0.197
les+0.196
umbs+0.192
Neg logit contributions
 playing-0.119
ich-0.115
 office-0.113
 neighborhood-0.110
 living-0.106
Token his
Token ID465
Feature activation+1.66e+00

Pos logit contributions
ich+0.192
 office+0.179
 smartest+0.178
 happiest+0.178
 neighborhood+0.173
Neg logit contributions
le-0.146
les-0.135
of-0.130
war-0.123
 sip-0.117
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 smartest+0.069
 oh+0.064
 happiest+0.063
 Sy+0.063
 saving+0.061
Neg logit contributions
les-0.122
of-0.121
le-0.114
bs-0.110
ushes-0.108
Token and
Token ID290
Feature activation-3.32e+00

Pos logit contributions
ich+1.064
 seeker+1.008
 smartest+0.943
 happiest+0.938
 Polic+0.935
Neg logit contributions
le-0.717
les-0.651
of-0.647
bs-0.591
umbs-0.583
Token look
Token ID804
Feature activation-3.69e+00

Pos logit contributions
of+0.183
bs+0.183
le+0.182
 sip+0.178
war+0.173
Neg logit contributions
ich-0.239
 office-0.215
 neighborhood-0.208
 neighbourhood-0.193
 factory-0.192
Token at
Token ID379
Feature activation+2.49e+00

Pos logit contributions
le+0.260
les+0.248
of+0.239
bs+0.233
war+0.233
Neg logit contributions
ich-0.225
 playing-0.216
 office-0.215
 neighborhood-0.212
 living-0.198
Token all
Token ID477
Feature activation+2.88e+00

Pos logit contributions
 playing+0.151
ich+0.145
 office+0.141
 neighborhood+0.140
 living+0.135
Neg logit contributions
le-0.172
of-0.166
les-0.164
war-0.162
umbs-0.161
Token the
Token ID262
Feature activation-8.22e-02

Pos logit contributions
 playing+0.174
 office+0.159
 living+0.158
 neighborhood+0.156
ich+0.153
Neg logit contributions
le-0.208
umbs-0.200
war-0.199
of-0.199
les-0.192
Token loved
Token ID6151
Feature activation+8.31e+00

Pos logit contributions
ich+0.061
 swamp+0.058
 office+0.057
 factory+0.057
 seeker+0.055
Neg logit contributions
of-0.083
les-0.081
bs-0.080
le-0.079
war-0.078
Token to
Token ID284
Feature activation-9.37e-02

Pos logit contributions
ich+0.581
 seeker+0.559
 neighborhood+0.553
 happiest+0.552
 smartest+0.543
Neg logit contributions
of-0.424
war-0.422
les-0.401
le-0.396
bs-0.380
Token ride
Token ID6594
Feature activation-6.06e-01

Pos logit contributions
le+0.007
of+0.006
umbs+0.006
war+0.006
les+0.006
Neg logit contributions
 playing-0.006
 office-0.005
ich-0.005
 neighborhood-0.005
 living-0.005
Token around
Token ID1088
Feature activation+2.57e+00

Pos logit contributions
le+0.040
war+0.037
of+0.037
les+0.037
bs+0.035
Neg logit contributions
ich-0.039
 playing-0.038
 neighborhood-0.038
 office-0.038
 living-0.036
Token his
Token ID465
Feature activation+1.53e+00

Pos logit contributions
ich+0.181
 office+0.173
 happiest+0.168
 smartest+0.167
 neighborhood+0.166
Neg logit contributions
le-0.153
les-0.141
of-0.136
war-0.132
umbs-0.121
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 seeking+0.059
 smartest+0.058
 saving+0.057
 oh+0.056
 Sy+0.056
Neg logit contributions
les-0.113
of-0.112
le-0.105
war-0.102
ushes-0.101
Token.
Token ID13
Feature activation-7.94e-01

Pos logit contributions
ich+1.137
 seeker+1.087
 playing+1.055
 neighborhood+1.054
 office+1.042
Neg logit contributions
le-0.636
of-0.580
les-0.575
war-0.545
umbs-0.532
Token One
Token ID1881
Feature activation-4.84e+00

Pos logit contributions
of+0.045
umbs+0.045
war+0.044
le+0.043
les+0.041
Neg logit contributions
ich-0.057
 playing-0.056
 office-0.053
 neighborhood-0.052
 living-0.052
Token day
Token ID1110
Feature activation-1.09e-01

Pos logit contributions
le+0.390
of+0.379
war+0.372
les+0.363
umbs+0.360
Neg logit contributions
 playing-0.234
ich-0.226
 office-0.218
 neighborhood-0.215
 seeking-0.212
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
bs+0.007
le+0.007
of+0.007
war+0.007
les+0.007
Neg logit contributions
 office-0.007
ich-0.007
 neighborhood-0.007
 happiest-0.006
 smartest-0.006
Token Tim
Token ID5045
Feature activation+4.29e+00

Pos logit contributions
 office+0.161
 neighborhood+0.160
ich+0.158
 living+0.149
 neighbourhood+0.147
Neg logit contributions
le-0.179
of-0.178
war-0.177
bs-0.176
 sip-0.173
Token loved
Token ID6151
Feature activation+8.41e+00

Pos logit contributions
 seeking+0.013
ich+0.013
 office+0.013
 seeker+0.012
 factory+0.012
Neg logit contributions
le-0.023
les-0.021
of-0.020
bs-0.020
war-0.019
Token to
Token ID284
Feature activation-6.36e-02

Pos logit contributions
ich+0.541
 neighborhood+0.526
 seeker+0.522
 office+0.518
 happiest+0.513
Neg logit contributions
war-0.475
of-0.470
le-0.463
les-0.462
umbs-0.454
Token wander
Token ID27776
Feature activation-8.36e-01

Pos logit contributions
le+0.005
of+0.005
umbs+0.005
war+0.005
les+0.005
Neg logit contributions
 playing-0.003
 office-0.003
ich-0.003
 neighborhood-0.003
 living-0.003
Token around
Token ID1088
Feature activation+2.59e+00

Pos logit contributions
le+0.067
umbs+0.066
of+0.065
war+0.065
les+0.063
Neg logit contributions
 playing-0.044
 office-0.039
 neighborhood-0.038
 living-0.038
ich-0.038
Token his
Token ID465
Feature activation+1.61e+00

Pos logit contributions
ich+0.183
 office+0.171
 smartest+0.169
 happiest+0.169
 neighborhood+0.167
Neg logit contributions
le-0.153
les-0.142
of-0.138
war-0.131
 sip-0.125
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 smartest+0.069
 oh+0.068
 happiest+0.064
 Sy+0.063
 saving+0.061
Neg logit contributions
les-0.116
of-0.115
le-0.109
ushes-0.104
bs-0.104
Token and
Token ID290
Feature activation-3.39e+00

Pos logit contributions
ich+1.091
 seeker+1.024
 Polic+0.974
 smartest+0.959
 happiest+0.956
Neg logit contributions
le-0.708
of-0.657
les-0.640
bs-0.587
umbs-0.569
Token explore
Token ID7301
Feature activation+5.10e+00

Pos logit contributions
of+0.196
bs+0.190
le+0.187
 sip+0.185
war+0.177
Neg logit contributions
ich-0.241
 office-0.215
 neighborhood-0.205
 factory-0.195
 playing-0.193
Token new
Token ID649
Feature activation-5.91e-01

Pos logit contributions
 playing+0.306
ich+0.289
 office+0.287
 neighborhood+0.281
 living+0.274
Neg logit contributions
le-0.362
of-0.347
war-0.341
les-0.340
umbs-0.340
Token things
Token ID1243
Feature activation+2.22e+00

Pos logit contributions
le+0.039
les+0.037
of+0.036
war+0.034
bs+0.033
Neg logit contributions
ich-0.039
 playing-0.038
 office-0.036
 living-0.034
 happiest-0.034
Token.
Token ID13
Feature activation-8.37e-01

Pos logit contributions
 playing+0.150
ich+0.141
 office+0.141
 neighborhood+0.139
 living+0.136
Neg logit contributions
le-0.141
of-0.134
war-0.133
umbs-0.133
les-0.131
Token liked
Token ID8288
Feature activation+4.88e+00

Pos logit contributions
le+0.192
of+0.188
les+0.182
war+0.181
bs+0.179
Neg logit contributions
 playing-0.137
ich-0.134
 office-0.132
 neighborhood-0.126
 living-0.123
Token to
Token ID284
Feature activation+6.37e-02

Pos logit contributions
ich+0.317
 office+0.314
 neighborhood+0.307
 playing+0.301
 seeker+0.295
Neg logit contributions
le-0.292
of-0.290
war-0.283
les-0.277
bs-0.267
Token jog
Token ID48342
Feature activation+2.97e+00

Pos logit contributions
 playing+0.004
ich+0.004
 office+0.004
 neighborhood+0.004
 living+0.003
Neg logit contributions
le-0.005
of-0.004
war-0.004
les-0.004
umbs-0.004
Token around
Token ID1088
Feature activation+2.63e+00

Pos logit contributions
ich+0.183
 office+0.182
 playing+0.181
 neighborhood+0.173
 living+0.167
Neg logit contributions
le-0.204
of-0.194
les-0.192
war-0.188
bs-0.183
Token their
Token ID511
Feature activation-3.51e+00

Pos logit contributions
ich+0.178
 office+0.174
 playing+0.169
 neighborhood+0.168
 smartest+0.162
Neg logit contributions
le-0.165
les-0.153
of-0.153
war-0.144
bs-0.139
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
of+0.239
les+0.238
le+0.232
war+0.212
bs+0.207
Neg logit contributions
ich-0.176
 office-0.174
 saving-0.173
 seeker-0.169
ter-0.168
Token every
Token ID790
Feature activation+6.20e-01

Pos logit contributions
ich+1.027
 office+0.977
 playing+0.960
 neighborhood+0.957
 seeker+0.954
Neg logit contributions
le-0.757
of-0.721
les-0.709
war-0.654
bs-0.652
Token morning
Token ID3329
Feature activation-3.00e+00

Pos logit contributions
 playing+0.028
 office+0.025
ich+0.024
 neighborhood+0.024
 living+0.024
Neg logit contributions
le-0.054
umbs-0.052
of-0.052
war-0.052
les-0.051
Token and
Token ID290
Feature activation-3.29e+00

Pos logit contributions
le+0.189
of+0.179
umbs+0.177
war+0.176
les+0.175
Neg logit contributions
 playing-0.202
ich-0.194
 office-0.192
 neighborhood-0.189
 living-0.184
Token it
Token ID340
Feature activation-6.64e+00

Pos logit contributions
le+0.217
of+0.213
bs+0.208
war+0.206
les+0.200
Neg logit contributions
ich-0.208
 office-0.194
 neighborhood-0.190
 playing-0.189
 factory-0.177
Token was
Token ID373
Feature activation+5.03e+00

Pos logit contributions
le+0.538
les+0.520
of+0.511
war+0.492
umbs+0.482
Neg logit contributions
ich-0.334
 playing-0.330
 office-0.317
 neighborhood-0.297
 seeker-0.289
Token Bob
Token ID5811
Feature activation+2.76e-01

Pos logit contributions
le+0.004
of+0.004
les+0.004
umbs+0.003
war+0.003
Neg logit contributions
 office-0.003
 playing-0.003
ich-0.003
 neighborhood-0.003
 living-0.003
Token continued
Token ID3767
Feature activation+3.49e+00

Pos logit contributions
 playing+0.016
 office+0.016
ich+0.016
 neighborhood+0.015
 living+0.015
Neg logit contributions
le-0.020
of-0.019
les-0.019
umbs-0.018
war-0.018
Token to
Token ID284
Feature activation-3.96e-01

Pos logit contributions
ich+0.193
 office+0.185
 playing+0.183
 neighborhood+0.182
 seeker+0.173
Neg logit contributions
le-0.249
of-0.240
les-0.238
war-0.236
bs-0.233
Token examine
Token ID10716
Feature activation+1.50e+00

Pos logit contributions
le+0.027
of+0.026
war+0.026
les+0.025
bs+0.025
Neg logit contributions
ich-0.024
 playing-0.023
 office-0.022
 neighborhood-0.022
 living-0.021
Token the
Token ID262
Feature activation+1.66e-02

Pos logit contributions
 playing+0.091
 office+0.082
 living+0.082
 neighborhood+0.081
ich+0.080
Neg logit contributions
le-0.105
umbs-0.105
war-0.104
of-0.103
les-0.099
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
 playing+0.001
ich+0.001
 office+0.001
 neighborhood+0.001
 living+0.001
Neg logit contributions
le-0.001
of-0.001
les-0.001
war-0.001
umbs-0.001
Token looking
Token ID2045
Feature activation+4.81e+00

Pos logit contributions
 playing+0.893
ich+0.842
 office+0.834
 neighborhood+0.826
 living+0.807
Neg logit contributions
le-0.933
of-0.890
umbs-0.883
war-0.880
les-0.872
Token for
Token ID329
Feature activation-2.28e-01

Pos logit contributions
 playing+0.270
 office+0.244
 neighborhood+0.241
 living+0.240
ich+0.239
Neg logit contributions
le-0.362
umbs-0.352
of-0.352
war-0.350
les-0.341
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
le+0.016
umbs+0.016
war+0.016
of+0.016
les+0.016
Neg logit contributions
 playing-0.014
 office-0.012
 neighborhood-0.012
 living-0.012
ich-0.012
Token surprise
Token ID5975
Feature activation-1.95e+00

Pos logit contributions
le+0.012
of+0.012
les+0.012
war+0.012
umbs+0.012
Neg logit contributions
 playing-0.009
ich-0.009
 office-0.008
 neighborhood-0.008
 living-0.008
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
le+0.125
of+0.119
umbs+0.118
war+0.117
les+0.116
Neg logit contributions
 playing-0.130
ich-0.123
 office-0.122
 neighborhood-0.121
 living-0.118
Token rode
Token ID22075
Feature activation+2.52e+00

Pos logit contributions
le+0.337
of+0.323
bs+0.319
les+0.318
war+0.317
Neg logit contributions
ich-0.299
 playing-0.293
 office-0.287
 neighborhood-0.285
 living-0.269
Token it
Token ID340
Feature activation-7.08e+00

Pos logit contributions
 playing+0.160
 office+0.144
 neighborhood+0.143
 living+0.143
ich+0.142
Neg logit contributions
le-0.171
umbs-0.170
of-0.164
war-0.164
les-0.160
Token all
Token ID477
Feature activation+2.59e+00

Pos logit contributions
le+0.503
of+0.486
war+0.480
les+0.476
umbs+0.467
Neg logit contributions
 playing-0.414
ich-0.402
 office-0.391
 neighborhood-0.386
 living-0.371
Token around
Token ID1088
Feature activation+2.39e+00

Pos logit contributions
ich+0.137
 playing+0.136
 office+0.132
 neighborhood+0.127
 factory+0.120
Neg logit contributions
le-0.200
of-0.193
les-0.190
war-0.188
umbs-0.181
Token the
Token ID262
Feature activation-1.86e-01

Pos logit contributions
ich+0.168
 playing+0.167
 office+0.162
 neighborhood+0.159
 living+0.153
Neg logit contributions
le-0.143
of-0.132
les-0.130
war-0.128
umbs-0.125
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
le+0.016
les+0.015
of+0.015
war+0.015
umbs+0.014
Neg logit contributions
 playing-0.008
ich-0.007
 office-0.007
 exploring-0.006
 seeking-0.006
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
ich+0.945
 playing+0.921
 office+0.903
 neighborhood+0.898
 seeker+0.882
Neg logit contributions
le-0.835
of-0.787
les-0.779
umbs-0.762
war-0.751
Token He
Token ID679
Feature activation+1.69e+00

Pos logit contributions
umbs+0.032
of+0.031
le+0.031
war+0.031
bs+0.030
Neg logit contributions
 playing-0.031
ich-0.030
 office-0.030
 neighborhood-0.029
 living-0.028
Token even
Token ID772
Feature activation+4.05e+00

Pos logit contributions
ich+0.086
 playing+0.086
 office+0.084
 neighborhood+0.079
 living+0.078
Neg logit contributions
le-0.132
of-0.128
les-0.126
war-0.120
bs-0.119
Token rode
Token ID22075
Feature activation+3.39e+00

Pos logit contributions
 playing+0.225
 office+0.205
ich+0.203
 neighborhood+0.202
 living+0.199
Neg logit contributions
le-0.307
umbs-0.299
of-0.295
war-0.295
les-0.289
Token it
Token ID340
Feature activation-6.49e+00

Pos logit contributions
 playing+0.211
 office+0.192
ich+0.192
 neighborhood+0.190
 living+0.188
Neg logit contributions
le-0.236
umbs-0.228
of-0.226
war-0.224
les-0.221
Token girl
Token ID2576
Feature activation+1.75e+00

Pos logit contributions
le+0.046
les+0.044
of+0.044
bs+0.042
war+0.042
Neg logit contributions
 playing-0.034
ich-0.033
 office-0.032
 neighborhood-0.031
 living-0.030
Token wanted
Token ID2227
Feature activation+3.43e+00

Pos logit contributions
ich+0.100
 playing+0.096
 office+0.094
 seeker+0.091
 factory+0.090
Neg logit contributions
le-0.128
of-0.124
les-0.120
bs-0.117
war-0.112
Token to
Token ID284
Feature activation+1.02e-01

Pos logit contributions
 playing+0.212
ich+0.205
 office+0.199
 neighborhood+0.199
 living+0.191
Neg logit contributions
le-0.233
of-0.226
war-0.222
umbs-0.219
les-0.216
Token explore
Token ID7301
Feature activation+5.21e+00

Pos logit contributions
 playing+0.007
ich+0.006
 office+0.006
 neighborhood+0.006
 living+0.006
Neg logit contributions
le-0.007
of-0.007
war-0.006
umbs-0.006
les-0.006
Token her
Token ID607
Feature activation+6.65e-01

Pos logit contributions
 playing+0.310
 office+0.283
 neighborhood+0.279
ich+0.278
 living+0.277
Neg logit contributions
le-0.376
umbs-0.367
war-0.364
of-0.362
les-0.355
Token neighborhood
Token ID6232
Feature activation+1.40e+01

Pos logit contributions
ich+0.034
 saving+0.033
 playing+0.032
ter+0.032
gy+0.032
Neg logit contributions
of-0.045
les-0.045
le-0.044
bs-0.042
war-0.040
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
ich+0.922
 playing+0.863
 neighborhood+0.839
 office+0.821
 seeker+0.818
Neg logit contributions
le-0.881
of-0.835
les-0.801
umbs-0.794
war-0.788
Token but
Token ID475
Feature activation-1.40e+00

Pos logit contributions
 playing+0.151
ich+0.145
 office+0.142
 neighborhood+0.140
 living+0.135
Neg logit contributions
le-0.223
of-0.218
umbs-0.214
war-0.214
les-0.210
Token it
Token ID340
Feature activation-6.47e+00

Pos logit contributions
le+0.084
bs+0.083
of+0.083
war+0.076
les+0.074
Neg logit contributions
ich-0.092
 neighborhood-0.089
 playing-0.087
 office-0.086
 living-0.085
Token was
Token ID373
Feature activation+5.22e+00

Pos logit contributions
le+0.545
of+0.504
les+0.499
war+0.484
bs+0.479
Neg logit contributions
 playing-0.320
ich-0.307
 office-0.290
 neighborhood-0.282
 living-0.278
Token a
Token ID257
Feature activation-9.39e-02

Pos logit contributions
 playing+0.323
ich+0.322
 office+0.311
 neighborhood+0.306
 living+0.293
Neg logit contributions
le-0.343
umbs-0.325
of-0.324
war-0.322
les-0.314
Token loved
Token ID6151
Feature activation+8.29e+00

Pos logit contributions
ich+0.011
 seeking+0.011
 seeker+0.010
 office+0.010
 smartest+0.010
Neg logit contributions
le-0.017
les-0.016
bs-0.015
of-0.015
war-0.014
Token to
Token ID284
Feature activation-1.40e-01

Pos logit contributions
ich+0.535
 neighborhood+0.518
 seeker+0.514
 office+0.510
 playing+0.502
Neg logit contributions
war-0.473
of-0.467
le-0.464
les-0.462
umbs-0.455
Token walk
Token ID2513
Feature activation-2.62e+00

Pos logit contributions
le+0.011
of+0.011
umbs+0.011
war+0.011
les+0.011
Neg logit contributions
 playing-0.007
 office-0.006
ich-0.006
 neighborhood-0.006
 living-0.006
Token around
Token ID1088
Feature activation+2.52e+00

Pos logit contributions
le+0.221
of+0.211
war+0.210
les+0.208
umbs+0.207
Neg logit contributions
 playing-0.123
ich-0.117
 office-0.115
 neighborhood-0.112
 living-0.108
Token his
Token ID465
Feature activation+1.56e+00

Pos logit contributions
ich+0.184
 office+0.171
 smartest+0.171
 happiest+0.171
 neighborhood+0.165
Neg logit contributions
le-0.143
les-0.133
of-0.128
war-0.121
 sip-0.114
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
 smartest+0.067
 oh+0.063
 happiest+0.061
 seeker+0.060
 seeking+0.059
Neg logit contributions
les-0.112
of-0.111
le-0.104
bs-0.102
ushes-0.101
Token and
Token ID290
Feature activation-3.39e+00

Pos logit contributions
ich+1.045
 seeker+0.999
 smartest+0.929
 happiest+0.919
 Polic+0.919
Neg logit contributions
le-0.723
of-0.667
les-0.662
bs-0.601
umbs-0.594
Token look
Token ID804
Feature activation-3.82e+00

Pos logit contributions
of+0.195
bs+0.191
le+0.191
 sip+0.188
war+0.181
Neg logit contributions
ich-0.236
 office-0.212
 neighborhood-0.204
 smartest-0.192
 factory-0.189
Token at
Token ID379
Feature activation+2.40e+00

Pos logit contributions
le+0.268
les+0.255
of+0.247
bs+0.240
war+0.239
Neg logit contributions
ich-0.234
 playing-0.224
 office-0.223
 neighborhood-0.220
 seeking-0.206
Token all
Token ID477
Feature activation+2.84e+00

Pos logit contributions
 playing+0.144
ich+0.138
 office+0.135
 neighborhood+0.133
 living+0.128
Neg logit contributions
le-0.167
of-0.160
les-0.159
war-0.157
umbs-0.156
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
 playing+0.173
 office+0.158
 living+0.157
 neighborhood+0.156
ich+0.154
Neg logit contributions
le-0.203
umbs-0.195
of-0.194
war-0.194
les-0.188
Token liked
Token ID8288
Feature activation+4.94e+00

Pos logit contributions
 seeking+0.013
ich+0.013
 office+0.012
 seeker+0.012
 smartest+0.011
Neg logit contributions
le-0.020
les-0.019
of-0.018
bs-0.018
war-0.017
Token to
Token ID284
Feature activation-2.17e-02

Pos logit contributions
ich+0.328
 neighborhood+0.319
 office+0.317
 playing+0.312
 seeker+0.312
Neg logit contributions
war-0.273
of-0.268
umbs-0.265
le-0.264
les-0.258
Token walk
Token ID2513
Feature activation-2.59e+00

Pos logit contributions
le+0.002
of+0.002
war+0.002
umbs+0.002
les+0.002
Neg logit contributions
 playing-0.001
ich-0.001
 office-0.001
 neighborhood-0.001
 living-0.001
Token around
Token ID1088
Feature activation+2.56e+00

Pos logit contributions
le+0.215
of+0.204
war+0.203
les+0.202
umbs+0.199
Neg logit contributions
 playing-0.125
ich-0.121
 office-0.119
 neighborhood-0.115
 living-0.111
Token his
Token ID465
Feature activation+1.61e+00

Pos logit contributions
ich+0.188
 office+0.176
 smartest+0.174
 happiest+0.173
 neighborhood+0.169
Neg logit contributions
le-0.145
les-0.134
of-0.130
war-0.123
 sip-0.116
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
 smartest+0.067
 oh+0.063
 happiest+0.061
 Sy+0.061
 saving+0.059
Neg logit contributions
les-0.118
of-0.118
le-0.110
bs-0.106
ushes-0.106
Token with
Token ID351
Feature activation+4.71e+00

Pos logit contributions
ich+1.067
 seeker+1.013
 smartest+0.949
 happiest+0.944
 Polic+0.939
Neg logit contributions
le-0.704
of-0.639
les-0.637
bs-0.580
umbs-0.573
Token his
Token ID465
Feature activation+1.58e+00

Pos logit contributions
 playing+0.271
 office+0.244
 neighborhood+0.240
ich+0.239
 living+0.238
Neg logit contributions
le-0.362
umbs-0.346
of-0.345
war-0.345
les-0.340
Token mom
Token ID1995
Feature activation+2.68e+00

Pos logit contributions
 office+0.080
 seeking+0.076
 playing+0.073
ich+0.072
 smartest+0.072
Neg logit contributions
bs-0.105
of-0.097
forts-0.096
war-0.094
les-0.093
Tokenmy
Token ID1820
Feature activation+7.19e-02

Pos logit contributions
 playing+0.146
ich+0.135
 office+0.134
 neighborhood+0.132
 living+0.129
Neg logit contributions
le-0.206
of-0.198
umbs-0.197
war-0.196
les-0.194
Token.
Token ID13
Feature activation-7.73e-01

Pos logit contributions
 playing+0.005
ich+0.005
 office+0.005
 neighborhood+0.005
 living+0.004
Neg logit contributions
le-0.005
of-0.004
umbs-0.004
war-0.004
les-0.004
Token go
Token ID467
Feature activation-1.12e+00

Pos logit contributions
 playing+0.008
ich+0.008
 office+0.008
 neighborhood+0.008
 living+0.008
Neg logit contributions
le-0.009
of-0.008
umbs-0.008
war-0.008
les-0.008
Token outside
Token ID2354
Feature activation+7.12e+00

Pos logit contributions
le+0.080
of+0.077
war+0.074
les+0.074
umbs+0.074
Neg logit contributions
 playing-0.064
ich-0.063
 neighborhood-0.061
 office-0.061
 living-0.060
Token and
Token ID290
Feature activation-3.17e+00

Pos logit contributions
 playing+0.479
ich+0.462
 office+0.452
 neighborhood+0.451
 living+0.440
Neg logit contributions
le-0.444
of-0.421
umbs-0.416
war-0.412
les-0.409
Token explore
Token ID7301
Feature activation+5.09e+00

Pos logit contributions
le+0.206
of+0.201
bs+0.197
 sip+0.194
war+0.194
Neg logit contributions
ich-0.206
 neighborhood-0.188
 office-0.185
 playing-0.180
 seeker-0.176
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
 playing+0.321
 office+0.297
ich+0.296
 neighborhood+0.293
 living+0.289
Neg logit contributions
le-0.348
of-0.333
umbs-0.333
war-0.332
les-0.326
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
 playing+0.001
ich+0.001
 office+0.001
 saving+0.001
 seeking+0.000
Neg logit contributions
le-0.001
les-0.001
of-0.001
war-0.001
umbs-0.001
Token.
Token ID13
Feature activation-4.15e-01

Pos logit contributions
ich+0.911
 playing+0.909
 neighborhood+0.871
 office+0.868
 seeker+0.844
Neg logit contributions
le-0.882
of-0.831
les-0.817
umbs-0.803
war-0.803
Token His
Token ID2399
Feature activation+6.43e-01

Pos logit contributions
le+0.028
of+0.027
umbs+0.027
war+0.027
les+0.026
Neg logit contributions
 playing-0.026
ich-0.025
 office-0.024
 neighborhood-0.024
 living-0.024
Token parents
Token ID3397
Feature activation+7.44e+00

Pos logit contributions
ich+0.034
 playing+0.032
 Sy+0.030
 seeking+0.029
 seeker+0.029
Neg logit contributions
le-0.044
les-0.044
of-0.042
bs-0.041
forts-0.040
Token were
Token ID547
Feature activation-1.06e+00

Pos logit contributions
 playing+0.491
 office+0.450
 neighborhood+0.445
 living+0.441
ich+0.439
Neg logit contributions
le-0.483
war-0.477
umbs-0.475
of-0.459
les-0.454
Token hesitant
Token ID32848
Feature activation+1.56e-01

Pos logit contributions
le+0.083
umbs+0.080
of+0.078
les+0.077
war+0.077
Neg logit contributions
ich-0.050
 playing-0.049
 neighborhood-0.047
 office-0.047
 factory-0.045
Token all
Token ID477
Feature activation+2.97e+00

Pos logit contributions
le+0.193
umbs+0.186
of+0.185
war+0.185
les+0.182
Neg logit contributions
 playing-0.139
 office-0.127
ich-0.126
 neighborhood-0.125
 living-0.123
Token rode
Token ID22075
Feature activation+3.38e+00

Pos logit contributions
 playing+0.177
 office+0.158
 neighborhood+0.158
 living+0.157
ich+0.155
Neg logit contributions
le-0.214
umbs-0.210
war-0.206
of-0.204
les-0.201
Token together
Token ID1978
Feature activation-3.88e+00

Pos logit contributions
 playing+0.205
 neighborhood+0.179
 living+0.178
 office+0.177
 exploring+0.175
Neg logit contributions
umbs-0.243
le-0.242
war-0.231
of-0.230
les-0.227
Token through
Token ID832
Feature activation+1.32e+00

Pos logit contributions
le+0.273
umbs+0.264
war+0.261
of+0.260
les+0.256
Neg logit contributions
 playing-0.240
 office-0.217
ich-0.215
 neighborhood-0.215
 living-0.214
Token the
Token ID262
Feature activation+2.30e-01

Pos logit contributions
 playing+0.080
ich+0.076
 office+0.075
 neighborhood+0.074
 living+0.072
Neg logit contributions
le-0.092
of-0.088
war-0.087
umbs-0.086
les-0.086
Token neighborhood
Token ID6232
Feature activation+1.39e+01

Pos logit contributions
 playing+0.012
ich+0.011
 office+0.011
 neighborhood+0.011
 living+0.011
Neg logit contributions
le-0.018
of-0.018
war-0.017
les-0.017
umbs-0.017
Token.
Token ID13
Feature activation-6.25e-01

Pos logit contributions
 playing+0.866
ich+0.852
 office+0.828
 neighborhood+0.814
 seeker+0.792
Neg logit contributions
le-0.908
of-0.879
les-0.861
war-0.850
umbs-0.847
Token They
Token ID1119
Feature activation-2.57e+00

Pos logit contributions
le+0.041
of+0.040
umbs+0.040
war+0.039
les+0.038
Neg logit contributions
 playing-0.041
ich-0.039
 office-0.038
 neighborhood-0.037
 living-0.037
Token went
Token ID1816
Feature activation+2.05e+00

Pos logit contributions
le+0.189
of+0.186
les+0.180
war+0.179
umbs+0.177
Neg logit contributions
 playing-0.144
 office-0.139
ich-0.139
 neighborhood-0.133
 living-0.131
Token up
Token ID510
Feature activation+3.30e+00

Pos logit contributions
ich+0.124
 office+0.121
 playing+0.120
 neighborhood+0.117
 living+0.113
Neg logit contributions
of-0.134
le-0.134
les-0.131
war-0.129
bs-0.125
Token and
Token ID290
Feature activation-3.18e+00

Pos logit contributions
 playing+0.202
ich+0.198
 office+0.194
 neighborhood+0.190
 living+0.187
Neg logit contributions
le-0.222
of-0.216
war-0.212
les-0.209
umbs-0.208
Token"
Token ID1
Feature activation-5.52e-03

Pos logit contributions
le+0.004
les+0.003
umbs+0.003
war+0.003
of+0.003
Neg logit contributions
 playing-0.004
 neighborhood-0.003
ich-0.003
 office-0.003
 living-0.003
TokenLe
Token ID3123
Feature activation-5.93e+00

Pos logit contributions
le+0.000
umbs+0.000
les+0.000
of+0.000
war+0.000
Neg logit contributions
 playing-0.000
ich-0.000
 neighborhood-0.000
 living-0.000
 office-0.000
Token-
Token ID12
Feature activation-8.45e-01

Pos logit contributions
le+0.514
of+0.497
umbs+0.495
war+0.494
les+0.489
Neg logit contributions
 playing-0.265
 office-0.238
ich-0.238
 neighborhood-0.234
 living-0.228
Tokenle
Token ID293
Feature activation-1.75e+01

Pos logit contributions
le+0.053
umbs+0.052
war+0.050
of+0.050
les+0.050
Neg logit contributions
 playing-0.057
 office-0.054
ich-0.054
 living-0.052
 neighborhood-0.052
Token-
Token ID12
Feature activation-9.91e-01

Pos logit contributions
le+1.376
of+1.319
les+1.298
war+1.290
umbs+1.288
Neg logit contributions
 playing-0.913
ich-0.856
 neighborhood-0.848
 office-0.845
 living-0.810
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
le+0.059
umbs+0.058
war+0.056
les+0.056
of+0.055
Neg logit contributions
 playing-0.070
 office-0.066
ich-0.066
 living-0.064
 neighborhood-0.063
Tokenopard
Token ID15478
Feature activation+6.28e+00

Pos logit contributions
le+1.372
of+1.318
umbs+1.303
war+1.302
les+1.295
Neg logit contributions
 playing-0.936
ich-0.864
 office-0.861
 neighborhood-0.854
 living-0.828
Token,"
Token ID553
Feature activation-3.05e+00

Pos logit contributions
 playing+0.375
ich+0.351
 neighborhood+0.350
 office+0.350
 living+0.337
Neg logit contributions
le-0.448
of-0.427
les-0.421
war-0.421
umbs-0.418
Token she
Token ID673
Feature activation+9.87e-01

Pos logit contributions
le+0.212
umbs+0.205
war+0.204
of+0.202
les+0.201
Neg logit contributions
 playing-0.189
 office-0.175
ich-0.173
 neighborhood-0.170
 living-0.170
Token says
Token ID1139
Feature activation-8.41e+00

Pos logit contributions
 playing+0.085
 office+0.079
 living+0.078
 neighborhood+0.078
ich+0.076
Neg logit contributions
umbs-0.044
war-0.042
le-0.042
of-0.041
les-0.038
Token.
Token ID13
Feature activation-8.92e-01

Pos logit contributions
le+0.580
umbs+0.572
war+0.569
of+0.569
les+0.554
Neg logit contributions
 playing-0.538
 office-0.480
 living-0.475
ich-0.470
 neighborhood-0.455
Token One
Token ID1881
Feature activation-5.06e+00

Pos logit contributions
le+0.058
of+0.058
umbs+0.057
war+0.056
les+0.055
Neg logit contributions
 playing-0.059
ich-0.057
 office-0.055
 neighborhood-0.054
 living-0.053
Token day
Token ID1110
Feature activation-3.74e-01

Pos logit contributions
le+0.412
of+0.395
war+0.382
umbs+0.377
les+0.376
Neg logit contributions
 playing-0.244
ich-0.242
 neighborhood-0.227
 office-0.226
 seeking-0.215
Token,
Token ID11
Feature activation+2.57e+00

Pos logit contributions
le+0.027
of+0.027
bs+0.026
war+0.025
les+0.025
Neg logit contributions
ich-0.021
 neighborhood-0.021
 office-0.020
 playing-0.020
 happiest-0.019
Token Ann
Token ID5506
Feature activation-3.90e-01

Pos logit contributions
 playing+0.157
ich+0.152
 neighborhood+0.151
 office+0.150
 living+0.144
Neg logit contributions
le-0.176
of-0.171
war-0.166
umbs-0.165
bs-0.164
Tokenabel
Token ID9608
Feature activation-7.16e+00

Pos logit contributions
le+0.023
of+0.022
les+0.021
umbs+0.021
war+0.021
Neg logit contributions
 playing-0.029
ich-0.027
 office-0.027
 neighborhood-0.027
 living-0.026
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
le+0.339
of+0.322
umbs+0.322
war+0.321
les+0.310
Neg logit contributions
 playing-0.598
 office-0.565
ich-0.564
 neighborhood-0.560
 living-0.553
Token's
Token ID338
Feature activation-7.53e-01

Pos logit contributions
le+1.379
les+1.290
of+1.276
umbs+1.224
bs+1.215
Neg logit contributions
 playing-0.951
ich-0.948
 office-0.911
 neighborhood-0.886
 living-0.857
Token mom
Token ID1995
Feature activation+2.45e+00

Pos logit contributions
of+0.047
bs+0.047
le+0.046
umbs+0.045
 sip+0.044
Neg logit contributions
 playing-0.040
 seeking-0.039
 finding-0.038
ich-0.038
 exploring-0.037
Token asked
Token ID1965
Feature activation-3.28e-01

Pos logit contributions
 playing+0.142
ich+0.141
 office+0.135
 neighborhood+0.134
 seeker+0.130
Neg logit contributions
le-0.172
of-0.166
les-0.164
umbs-0.161
bs-0.154
Token her
Token ID607
Feature activation+2.75e-01

Pos logit contributions
le+0.023
of+0.022
war+0.022
umbs+0.022
les+0.022
Neg logit contributions
 playing-0.020
ich-0.019
 office-0.019
 neighborhood-0.018
 living-0.018
Token to
Token ID284
Feature activation-2.65e-01

Pos logit contributions
ich+0.019
 neighborhood+0.019
 playing+0.018
 neighbourhood+0.017
 seeker+0.017
Neg logit contributions
le-0.016
of-0.015
bs-0.015
war-0.015
les-0.014
Tokenable
Token ID540
Feature activation-1.06e+01

Pos logit contributions
 playing+0.869
 office+0.831
 living+0.819
ich+0.818
 neighborhood+0.816
Neg logit contributions
umbs-0.324
le-0.320
of-0.303
war-0.303
les-0.287
Token little
Token ID1310
Feature activation-1.02e+00

Pos logit contributions
le+0.718
les+0.681
of+0.657
 sip+0.644
bs+0.625
Neg logit contributions
 playing-0.558
 saving-0.547
 seeking-0.543
 smartest-0.528
ich-0.523
Token girl
Token ID2576
Feature activation+1.35e+00

Pos logit contributions
le+0.075
of+0.072
les+0.072
bs+0.070
 sip+0.068
Neg logit contributions
 playing-0.054
ich-0.052
 office-0.051
 neighborhood-0.051
 exploring-0.050
Token called
Token ID1444
Feature activation+4.10e+00

Pos logit contributions
ich+0.056
 happiest+0.052
 neighborhood+0.051
 smartest+0.050
 seeker+0.048
Neg logit contributions
le-0.120
bs-0.115
of-0.113
les-0.106
 sip-0.105
Token El
Token ID2574
Feature activation+3.24e+00

Pos logit contributions
ich+0.245
 playing+0.238
 neighborhood+0.237
 office+0.231
 living+0.218
Neg logit contributions
of-0.284
le-0.281
bs-0.272
war-0.271
umbs-0.270
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
 playing+0.269
 office+0.255
ich+0.253
 living+0.253
 neighborhood+0.253
Neg logit contributions
umbs-0.152
war-0.145
of-0.139
le-0.139
forts-0.135
Token.
Token ID13
Feature activation-9.21e-01

Pos logit contributions
le+1.017
of+0.909
les+0.889
bs+0.823
 sip+0.753
Neg logit contributions
 seeker-1.262
ich-1.233
 seeking-1.225
 office-1.213
 smartest-1.212
Token One
Token ID1881
Feature activation-5.05e+00

Pos logit contributions
le+0.071
of+0.069
umbs+0.068
war+0.068
les+0.067
Neg logit contributions
 playing-0.050
ich-0.046
 office-0.046
 neighborhood-0.045
 living-0.043
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
le+0.409
of+0.386
war+0.381
umbs+0.375
bs+0.374
Neg logit contributions
 playing-0.248
ich-0.243
 office-0.235
 neighborhood-0.233
 seeking-0.219
Token,
Token ID11
Feature activation+2.58e+00

Pos logit contributions
le+0.021
of+0.021
bs+0.020
les+0.019
 sip+0.018
Neg logit contributions
 office-0.020
 neighborhood-0.019
ich-0.019
 neighbourhood-0.017
 factory-0.017
Token El
Token ID2574
Feature activation+3.24e+00

Pos logit contributions
 office+0.158
ich+0.157
 neighborhood+0.156
 playing+0.155
 factory+0.142
Neg logit contributions
le-0.175
of-0.172
bs-0.162
les-0.159
war-0.157
Token.
Token ID13
Feature activation-9.21e-01

Pos logit contributions
le+1.017
of+0.909
les+0.889
bs+0.823
 sip+0.753
Neg logit contributions
 seeker-1.262
ich-1.233
 seeking-1.225
 office-1.213
 smartest-1.212
Token One
Token ID1881
Feature activation-5.05e+00

Pos logit contributions
le+0.071
of+0.069
umbs+0.068
war+0.068
les+0.067
Neg logit contributions
 playing-0.050
ich-0.046
 office-0.046
 neighborhood-0.045
 living-0.043
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
le+0.409
of+0.386
war+0.381
umbs+0.375
bs+0.374
Neg logit contributions
 playing-0.248
ich-0.243
 office-0.235
 neighborhood-0.233
 seeking-0.219
Token,
Token ID11
Feature activation+2.58e+00

Pos logit contributions
le+0.021
of+0.021
bs+0.020
les+0.019
 sip+0.018
Neg logit contributions
 office-0.020
 neighborhood-0.019
ich-0.019
 neighbourhood-0.017
 factory-0.017
Token El
Token ID2574
Feature activation+3.24e+00

Pos logit contributions
 office+0.158
ich+0.157
 neighborhood+0.156
 playing+0.155
 factory+0.142
Neg logit contributions
le-0.175
of-0.172
bs-0.162
les-0.159
war-0.157
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
 playing+0.371
 neighborhood+0.357
 living+0.356
 office+0.356
ich+0.354
Neg logit contributions
umbs-0.049
war-0.043
of-0.037
forts-0.034
bs-0.027
Token was
Token ID373
Feature activation+5.00e+00

Pos logit contributions
le+1.461
les+1.378
of+1.361
umbs+1.313
bs+1.310
Neg logit contributions
ich-0.862
 playing-0.857
 office-0.816
 neighborhood-0.777
 smartest-0.752
Token feeling
Token ID4203
Feature activation+5.68e+00

Pos logit contributions
 Driver+0.211
itor+0.202
ich+0.189
stack+0.183
 mall+0.183
Neg logit contributions
bs-0.366
 sip-0.340
 harder-0.327
le-0.327
 drift-0.324
Token brave
Token ID14802
Feature activation+3.33e+00

Pos logit contributions
ich+0.308
 neighborhood+0.287
 office+0.264
 backyard+0.254
 seeker+0.253
Neg logit contributions
le-0.433
bs-0.419
les-0.402
 sip-0.388
umbs-0.384
Token so
Token ID523
Feature activation+3.30e+00

Pos logit contributions
 playing+0.242
ich+0.231
 office+0.229
 neighborhood+0.227
 living+0.221
Neg logit contributions
le-0.194
of-0.182
umbs-0.181
les-0.180
war-0.180
Token she
Token ID673
Feature activation+1.04e+00

Pos logit contributions
ich+0.217
 office+0.211
 neighborhood+0.206
 factory+0.202
 playing+0.201
Neg logit contributions
bs-0.212
of-0.199
le-0.191
les-0.186
war-0.178
Token,
Token ID11
Feature activation+2.51e+00

Pos logit contributions
 playing+0.063
 office+0.058
 living+0.057
 neighborhood+0.056
ich+0.056
Neg logit contributions
le-0.078
of-0.075
war-0.075
umbs-0.075
les-0.073
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
 playing+0.167
 office+0.154
 living+0.151
 neighborhood+0.149
ich+0.149
Neg logit contributions
le-0.168
war-0.161
of-0.161
umbs-0.161
les-0.156
Token make
Token ID787
Feature activation-2.60e+00

Pos logit contributions
le+0.252
of+0.241
war+0.238
umbs+0.237
les+0.235
Neg logit contributions
 playing-0.214
ich-0.205
 office-0.199
 neighborhood-0.199
 living-0.192
Token everything
Token ID2279
Feature activation-1.49e-01

Pos logit contributions
umbs+0.176
le+0.173
of+0.172
war+0.171
forts+0.170
Neg logit contributions
 playing-0.170
 office-0.157
 living-0.153
 neighborhood-0.148
ich-0.144
Token spark
Token ID9009
Feature activation-4.96e+00

Pos logit contributions
le+0.011
umbs+0.011
war+0.011
of+0.011
les+0.011
Neg logit contributions
 playing-0.008
 office-0.007
 living-0.007
 neighborhood-0.007
ich-0.007
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.137
war+0.122
forts+0.119
of+0.107
 sip+0.099
Neg logit contributions
 playing-0.501
 living-0.470
 office-0.464
 best-0.460
 smartest-0.454
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
war+1.275
umbs+1.273
le+1.259
of+1.250
forts+1.220
Neg logit contributions
 playing-1.025
 living-0.922
 office-0.902
ich-0.887
 neighborhood-0.880
Token
Token ID220
Feature activation-3.47e+00

Pos logit contributions
le+0.071
of+0.068
umbs+0.068
war+0.068
les+0.066
Neg logit contributions
 playing-0.060
ich-0.056
 office-0.056
 neighborhood-0.055
 living-0.054
Token
Token ID198
Feature activation-1.41e-01

Pos logit contributions
le+0.235
of+0.229
umbs+0.229
war+0.226
les+0.221
Neg logit contributions
 playing-0.214
ich-0.201
 office-0.199
 neighborhood-0.195
 living-0.192
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
le+0.011
les+0.010
war+0.010
umbs+0.010
of+0.010
Neg logit contributions
 playing-0.008
 office-0.007
 neighborhood-0.007
ich-0.007
 living-0.007
TokenMom
Token ID29252
Feature activation-2.45e+00

Pos logit contributions
le+0.008
umbs+0.007
of+0.007
war+0.007
les+0.007
Neg logit contributions
 playing-0.009
 office-0.008
ich-0.008
 neighborhood-0.008
 living-0.008
Token the
Token ID262
Feature activation-3.69e-01

Pos logit contributions
 playing+0.436
ich+0.416
 office+0.409
 neighborhood+0.407
 living+0.396
Neg logit contributions
le-0.440
of-0.419
umbs-0.416
war-0.413
les-0.409
Token window
Token ID4324
Feature activation-4.15e-01

Pos logit contributions
le+0.027
umbs+0.026
of+0.026
war+0.026
les+0.026
Neg logit contributions
 playing-0.020
ich-0.019
 office-0.017
 seeking-0.017
 exploring-0.017
Token.
Token ID13
Feature activation-8.62e-01

Pos logit contributions
le+0.027
of+0.026
war+0.026
umbs+0.026
les+0.025
Neg logit contributions
 playing-0.027
 office-0.026
ich-0.025
 neighborhood-0.025
 living-0.025
Token Ann
Token ID5506
Feature activation-3.97e-01

Pos logit contributions
le+0.065
umbs+0.061
of+0.061
war+0.061
les+0.061
Neg logit contributions
 playing-0.049
 office-0.045
 neighborhood-0.045
ich-0.044
 living-0.044
Tokenabel
Token ID9608
Feature activation-7.17e+00

Pos logit contributions
le+0.022
umbs+0.021
war+0.021
of+0.021
les+0.020
Neg logit contributions
 playing-0.030
 office-0.028
ich-0.028
 neighborhood-0.028
 living-0.028
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.140
war+0.136
of+0.115
forts+0.107
bs+0.099
Neg logit contributions
 playing-0.791
 office-0.752
 neighborhood-0.748
ich-0.745
 living-0.744
Token wanted
Token ID2227
Feature activation+3.23e+00

Pos logit contributions
le+1.431
of+1.370
umbs+1.356
war+1.352
les+1.351
Neg logit contributions
 playing-0.882
ich-0.818
 office-0.812
 neighborhood-0.800
 living-0.777
Token to
Token ID284
Feature activation-2.50e-01

Pos logit contributions
 playing+0.189
 office+0.174
ich+0.170
 neighborhood+0.169
 living+0.169
Neg logit contributions
le-0.238
umbs-0.229
of-0.228
war-0.228
les-0.225
Token do
Token ID466
Feature activation+3.55e-01

Pos logit contributions
le+0.019
of+0.018
war+0.018
umbs+0.018
les+0.018
Neg logit contributions
 playing-0.014
ich-0.013
 office-0.013
 neighborhood-0.013
 living-0.012
Token something
Token ID1223
Feature activation+5.30e-01

Pos logit contributions
ich+0.024
 playing+0.024
 neighborhood+0.023
 office+0.023
 living+0.022
Neg logit contributions
le-0.022
of-0.021
les-0.020
war-0.020
umbs-0.020
Token else
Token ID2073
Feature activation+1.51e-01

Pos logit contributions
 playing+0.031
ich+0.030
 office+0.029
 neighborhood+0.029
 living+0.028
Neg logit contributions
le-0.038
of-0.037
war-0.036
umbs-0.035
les-0.035
Token football
Token ID4346
Feature activation+5.05e+00

Pos logit contributions
ich+0.087
 playing+0.086
 office+0.084
 neighborhood+0.081
 living+0.078
Neg logit contributions
le-0.115
war-0.111
les-0.109
umbs-0.108
of-0.108
Token.
Token ID13
Feature activation-8.93e-01

Pos logit contributions
 playing+0.337
ich+0.334
 neighborhood+0.323
 office+0.318
 living+0.309
Neg logit contributions
le-0.312
war-0.298
of-0.296
umbs-0.294
les-0.289
Token She
Token ID1375
Feature activation+1.36e+00

Pos logit contributions
le+0.058
umbs+0.057
of+0.057
war+0.056
les+0.055
Neg logit contributions
 playing-0.056
ich-0.056
 office-0.053
 neighborhood-0.052
 living-0.051
Token would
Token ID561
Feature activation-1.54e+00

Pos logit contributions
ich+0.075
 playing+0.074
 office+0.071
 neighborhood+0.070
 living+0.066
Neg logit contributions
le-0.102
of-0.097
les-0.097
bs-0.095
war-0.093
Token tick
Token ID4378
Feature activation-4.97e+00

Pos logit contributions
le+0.106
umbs+0.101
of+0.101
war+0.101
les+0.100
Neg logit contributions
 playing-0.096
 office-0.089
ich-0.088
 neighborhood-0.087
 living-0.086
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.225
forts+0.203
of+0.197
war+0.192
bs+0.182
Neg logit contributions
 playing-0.447
 office-0.389
 living-0.386
ich-0.385
 exploring-0.383
Token him
Token ID683
Feature activation-1.98e+00

Pos logit contributions
umbs+1.293
le+1.282
war+1.259
of+1.245
forts+1.226
Neg logit contributions
 playing-1.036
 office-0.903
 living-0.895
ich-0.872
 neighborhood-0.852
Token as
Token ID355
Feature activation-1.57e+00

Pos logit contributions
le+0.128
of+0.121
war+0.121
umbs+0.119
les+0.119
Neg logit contributions
 playing-0.130
ich-0.124
 neighborhood-0.123
 office-0.123
 living-0.119
Token he
Token ID339
Feature activation+1.02e+00

Pos logit contributions
le+0.117
umbs+0.114
war+0.113
of+0.113
les+0.110
Neg logit contributions
 playing-0.091
 office-0.082
ich-0.082
 neighborhood-0.081
 living-0.080
Token put
Token ID1234
Feature activation-1.31e+00

Pos logit contributions
 playing+0.056
ich+0.053
 office+0.053
 neighborhood+0.051
 living+0.050
Neg logit contributions
le-0.079
of-0.075
les-0.075
war-0.073
umbs-0.073
Token it
Token ID340
Feature activation-6.77e+00

Pos logit contributions
umbs+0.098
of+0.096
le+0.094
war+0.094
forts+0.094
Neg logit contributions
 playing-0.079
 living-0.067
 office-0.063
 exploring-0.063
 neighborhood-0.060
Token Gold
Token ID3561
Feature activation+2.68e+00

Pos logit contributions
le+0.041
les+0.039
of+0.039
bs+0.037
umbs+0.037
Neg logit contributions
 playing-0.027
ich-0.026
 office-0.025
 neighborhood-0.025
 seeker-0.023
Token Land
Token ID6379
Feature activation-3.59e+00

Pos logit contributions
 playing+0.184
 office+0.173
ich+0.173
 neighborhood+0.171
 living+0.166
Neg logit contributions
le-0.170
of-0.162
les-0.160
umbs-0.156
war-0.155
Token started
Token ID2067
Feature activation+1.38e+00

Pos logit contributions
le+0.216
of+0.206
umbs+0.203
les+0.202
war+0.199
Neg logit contributions
 playing-0.253
ich-0.242
 office-0.239
 neighborhood-0.237
 living-0.228
Token to
Token ID284
Feature activation-2.07e-01

Pos logit contributions
 playing+0.081
ich+0.077
 office+0.076
 neighborhood+0.076
 living+0.073
Neg logit contributions
le-0.097
of-0.093
umbs-0.092
war-0.092
les-0.091
Token spark
Token ID9009
Feature activation-4.91e+00

Pos logit contributions
le+0.014
of+0.013
umbs+0.013
war+0.013
les+0.013
Neg logit contributions
 playing-0.013
ich-0.012
 office-0.012
 neighborhood-0.012
 living-0.012
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.146
war+0.122
forts+0.117
 sip+0.104
of+0.100
Neg logit contributions
 playing-0.493
 living-0.462
 office-0.458
ich-0.454
 neighborhood-0.450
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
le+1.268
of+1.212
umbs+1.199
war+1.195
les+1.191
Neg logit contributions
 playing-1.040
ich-0.972
 office-0.971
 neighborhood-0.959
 living-0.929
Token glitter
Token ID31133
Feature activation-8.61e+00

Pos logit contributions
le+0.212
of+0.209
bs+0.198
les+0.198
war+0.194
Neg logit contributions
ich-0.238
 office-0.233
 neighborhood-0.230
 playing-0.229
 factory-0.214
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
le+0.562
war+0.559
umbs+0.552
of+0.546
forts+0.532
Neg logit contributions
 playing-0.555
ich-0.506
 living-0.506
 office-0.503
 neighborhood-0.495
Token
Token ID220
Feature activation-3.38e+00

Pos logit contributions
of+0.058
umbs+0.057
le+0.057
war+0.056
les+0.054
Neg logit contributions
 playing-0.060
ich-0.059
 office-0.057
 neighborhood-0.055
 living-0.055
Token
Token ID198
Feature activation-8.01e-02

Pos logit contributions
le+0.243
of+0.233
umbs+0.232
war+0.232
les+0.229
Neg logit contributions
 playing-0.200
 office-0.185
ich-0.185
 neighborhood-0.182
 living-0.179
Token them
Token ID606
Feature activation-7.26e+00

Pos logit contributions
 playing+0.423
 living+0.386
 office+0.366
 exploring+0.335
 neighborhood+0.335
Neg logit contributions
umbs-0.500
war-0.428
ited-0.420
forts-0.415
olt-0.412
Token its
Token ID663
Feature activation-7.77e+00

Pos logit contributions
umbs+0.537
le+0.519
war+0.512
of+0.498
les+0.495
Neg logit contributions
 playing-0.431
 living-0.397
 office-0.383
 neighborhood-0.361
ich-0.360
Token light
Token ID1657
Feature activation-5.65e+00

Pos logit contributions
le+0.559
umbs+0.548
of+0.541
war+0.539
les+0.526
Neg logit contributions
 playing-0.477
 office-0.434
 living-0.433
 neighborhood-0.427
ich-0.426
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
war+0.411
umbs+0.411
of+0.401
le+0.397
bs+0.383
Neg logit contributions
 playing-0.331
 office-0.307
 living-0.305
 neighborhood-0.290
ich-0.286
Token spark
Token ID9009
Feature activation-4.86e+00

Pos logit contributions
le+0.236
of+0.226
war+0.224
umbs+0.223
les+0.221
Neg logit contributions
 playing-0.230
ich-0.217
 office-0.216
 neighborhood-0.214
 living-0.208
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.229
war+0.215
raw+0.191
 sip+0.182
irs+0.178
Neg logit contributions
 playing-0.400
 living-0.382
 best-0.358
 office-0.353
ich-0.345
Token.
Token ID13
Feature activation-9.66e-01

Pos logit contributions
war+1.217
umbs+1.212
le+1.191
of+1.179
les+1.135
Neg logit contributions
 playing-1.115
 living-1.005
 office-0.977
ich-0.977
 neighborhood-0.951
Token They
Token ID1119
Feature activation-3.01e+00

Pos logit contributions
le+0.065
war+0.061
of+0.061
les+0.060
umbs+0.060
Neg logit contributions
 playing-0.064
 office-0.059
ich-0.059
 neighborhood-0.059
 living-0.059
Token gave
Token ID2921
Feature activation-1.61e+00

Pos logit contributions
war+0.225
le+0.225
umbs+0.220
les+0.213
of+0.211
Neg logit contributions
 playing-0.170
 living-0.151
 neighborhood-0.151
 office-0.149
ich-0.148
Token it
Token ID340
Feature activation-6.75e+00

Pos logit contributions
umbs+0.118
war+0.108
forts+0.106
le+0.099
hell+0.099
Neg logit contributions
 playing-0.105
 living-0.093
ich-0.091
 office-0.090
 exploring-0.086
Token back
Token ID736
Feature activation+3.20e+00

Pos logit contributions
le+0.530
umbs+0.511
war+0.502
les+0.500
of+0.496
Neg logit contributions
 playing-0.372
 living-0.335
 office-0.334
ich-0.321
 neighborhood-0.320
Token the
Token ID262
Feature activation-3.47e-01

Pos logit contributions
ich+0.171
 neighborhood+0.167
 office+0.166
 playing+0.166
 living+0.159
Neg logit contributions
le-0.164
of-0.153
war-0.152
bs-0.149
les-0.149
Token toy
Token ID13373
Feature activation+3.14e+00

Pos logit contributions
le+0.026
les+0.025
of+0.024
war+0.024
umbs+0.024
Neg logit contributions
 playing-0.018
ich-0.018
 office-0.017
 neighborhood-0.017
 exploring-0.016
Token began
Token ID2540
Feature activation+2.05e-01

Pos logit contributions
 playing+0.159
ich+0.154
 exploring+0.153
 neighborhood+0.149
 office+0.146
Neg logit contributions
le-0.233
les-0.229
of-0.223
bs-0.208
sel-0.207
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
 playing+0.012
 office+0.011
ich+0.011
 neighborhood+0.011
 living+0.011
Neg logit contributions
le-0.015
of-0.014
umbs-0.014
war-0.014
les-0.014
Token spark
Token ID9009
Feature activation-4.91e+00

Pos logit contributions
umbs+0.016
war+0.016
le+0.016
of+0.015
les+0.015
Neg logit contributions
 playing-0.015
 office-0.014
 living-0.013
 neighborhood-0.013
ich-0.013
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.154
forts+0.133
war+0.122
bs+0.102
 sip+0.101
Neg logit contributions
 playing-0.492
 living-0.456
 office-0.453
 smartest-0.443
ich-0.443
Token and
Token ID290
Feature activation-3.52e+00

Pos logit contributions
le+1.268
war+1.259
umbs+1.256
of+1.240
forts+1.213
Neg logit contributions
 playing-1.036
 living-0.917
 office-0.907
ich-0.900
 neighborhood-0.892
Token tw
Token ID665
Feature activation-2.45e-01

Pos logit contributions
le+0.204
of+0.196
les+0.190
war+0.189
bs+0.187
Neg logit contributions
 playing-0.244
ich-0.244
 office-0.241
 neighborhood-0.239
 living-0.225
Tokeninkle
Token ID19894
Feature activation-1.05e+01

Pos logit contributions
umbs+0.014
war+0.014
le+0.014
forts+0.014
of+0.014
Neg logit contributions
 playing-0.018
ich-0.017
 office-0.016
 neighborhood-0.016
 living-0.016
Token like
Token ID588
Feature activation+1.53e+00

Pos logit contributions
war+0.803
umbs+0.794
of+0.793
le+0.793
forts+0.768
Neg logit contributions
 playing-0.577
 living-0.505
ich-0.499
 office-0.495
 neighborhood-0.488
Token stars
Token ID5788
Feature activation-3.94e+00

Pos logit contributions
 playing+0.089
ich+0.086
 office+0.085
 neighborhood+0.085
 living+0.081
Neg logit contributions
le-0.107
of-0.104
umbs-0.102
war-0.102
les-0.101
Token's
Token ID338
Feature activation-7.78e-01

Pos logit contributions
le+0.233
of+0.223
les+0.221
war+0.213
umbs+0.205
Neg logit contributions
 playing-0.220
ich-0.212
 neighborhood-0.203
 office-0.200
 seeking-0.199
Token nice
Token ID3621
Feature activation-1.78e+00

Pos logit contributions
le+0.049
of+0.048
war+0.046
fulness+0.043
bs+0.043
Neg logit contributions
ich-0.052
 neighborhood-0.049
 playing-0.047
 office-0.046
 living-0.045
Token of
Token ID286
Feature activation-7.77e+00

Pos logit contributions
le+0.122
of+0.119
umbs+0.116
les+0.116
war+0.115
Neg logit contributions
 playing-0.111
ich-0.106
 neighborhood-0.103
 office-0.103
 living-0.099
Token you
Token ID345
Feature activation+2.11e+00

Pos logit contributions
of+0.543
le+0.510
war+0.487
umbs+0.482
les+0.472
Neg logit contributions
ich-0.440
 neighborhood-0.432
 playing-0.418
 office-0.415
 seeker-0.389
Token El
Token ID2574
Feature activation+3.17e+00

Pos logit contributions
 playing+0.116
ich+0.114
 neighborhood+0.112
 office+0.111
 living+0.105
Neg logit contributions
le-0.155
of-0.152
les-0.146
umbs-0.146
war-0.145
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
 playing+0.273
 office+0.262
ich+0.258
 neighborhood+0.254
 living+0.253
Neg logit contributions
umbs-0.135
war-0.133
le-0.132
of-0.132
les-0.123
Token,
Token ID11
Feature activation+2.65e+00

Pos logit contributions
le+1.218
of+1.142
les+1.137
bs+1.082
umbs+1.078
Neg logit contributions
ich-1.061
 neighborhood-1.033
 playing-1.021
 office-1.003
 seeker-0.937
Token but
Token ID475
Feature activation-1.68e+00

Pos logit contributions
 playing+0.150
 office+0.138
ich+0.138
 neighborhood+0.136
 living+0.134
Neg logit contributions
le-0.198
of-0.190
umbs-0.190
war-0.189
les-0.187
Token you
Token ID345
Feature activation+2.12e+00

Pos logit contributions
le+0.115
of+0.113
bs+0.103
 sip+0.101
les+0.099
Neg logit contributions
 neighborhood-0.097
ich-0.096
 playing-0.092
 office-0.092
 neighbourhood-0.087
Token know
Token ID760
Feature activation-7.45e+00

Pos logit contributions
 playing+0.126
 office+0.115
 neighborhood+0.114
ich+0.113
 living+0.112
Neg logit contributions
le-0.152
war-0.147
of-0.147
umbs-0.146
les-0.144
Token he
Token ID339
Feature activation+1.01e+00

Pos logit contributions
le+0.502
of+0.500
les+0.474
bs+0.463
war+0.441
Neg logit contributions
ich-0.432
 neighborhood-0.426
 office-0.410
 playing-0.409
 seeker-0.403
Token sound
Token ID2128
Feature activation-3.17e+00

Pos logit contributions
le+0.005
of+0.005
les+0.005
war+0.005
umbs+0.005
Neg logit contributions
 playing-0.006
ich-0.005
 office-0.005
 seeking-0.005
 neighborhood-0.005
Token of
Token ID286
Feature activation-7.74e+00

Pos logit contributions
war+0.205
umbs+0.204
forts+0.197
of+0.196
ws+0.192
Neg logit contributions
 playing-0.219
 living-0.196
 office-0.194
 neighborhood-0.187
 exploring-0.185
Token hurry
Token ID23290
Feature activation+1.96e+00

Pos logit contributions
le+0.551
of+0.528
umbs+0.526
war+0.525
les+0.518
Neg logit contributions
 playing-0.451
ich-0.424
 office-0.421
 neighborhood-0.416
 living-0.406
Token and
Token ID290
Feature activation-3.50e+00

Pos logit contributions
 playing+0.130
 office+0.121
ich+0.121
 neighborhood+0.119
 living+0.118
Neg logit contributions
le-0.127
of-0.122
umbs-0.122
war-0.121
les-0.119
Token bust
Token ID13076
Feature activation-3.71e-01

Pos logit contributions
le+0.215
of+0.210
bs+0.203
war+0.199
les+0.196
Neg logit contributions
ich-0.233
 office-0.216
 playing-0.215
 neighborhood-0.210
 living-0.205
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.008
of+0.007
war+0.007
forts+0.006
bs+0.006
Neg logit contributions
 playing-0.041
 office-0.039
 neighborhood-0.039
 living-0.038
ich-0.038
Token.
Token ID13
Feature activation-8.61e-01

Pos logit contributions
le+1.257
of+1.210
umbs+1.206
war+1.204
les+1.181
Neg logit contributions
 playing-1.052
 office-0.966
ich-0.964
 neighborhood-0.952
 living-0.939
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
le+0.061
of+0.059
umbs+0.058
war+0.058
les+0.058
Neg logit contributions
 playing-0.052
ich-0.048
 office-0.048
 neighborhood-0.048
 living-0.047
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
le+0.010
les+0.010
umbs+0.010
forts+0.010
war+0.010
Neg logit contributions
 playing-0.005
 neighborhood-0.004
ich-0.004
 living-0.004
 office-0.004
TokenG
Token ID38
Feature activation-3.70e+00

Pos logit contributions
le+0.007
umbs+0.007
les+0.007
war+0.007
of+0.006
Neg logit contributions
 playing-0.007
ich-0.007
 neighborhood-0.007
 office-0.006
 living-0.006
Tokenilly
Token ID6548
Feature activation-6.49e+00

Pos logit contributions
war+0.226
le+0.223
 sip+0.217
umbs+0.217
bs+0.212
Neg logit contributions
ich-0.241
 office-0.241
 playing-0.241
 neighborhood-0.232
 living-0.225
Token It
Token ID632
Feature activation-6.15e+00

Pos logit contributions
le+0.052
of+0.051
umbs+0.051
war+0.050
les+0.049
Neg logit contributions
 playing-0.045
ich-0.044
 office-0.043
 neighborhood-0.041
 factory-0.040
Token had
Token ID550
Feature activation+1.63e+00

Pos logit contributions
le+0.480
of+0.460
les+0.455
bs+0.423
 sip+0.417
Neg logit contributions
 office-0.353
ich-0.328
 neighborhood-0.314
 playing-0.311
 seeker-0.310
Token a
Token ID257
Feature activation-3.31e-01

Pos logit contributions
 playing+0.087
 living+0.078
 neighborhood+0.074
ich+0.074
 office+0.074
Neg logit contributions
le-0.135
war-0.129
of-0.126
umbs-0.125
les-0.124
Token blue
Token ID4171
Feature activation-3.28e+00

Pos logit contributions
le+0.021
of+0.020
les+0.020
war+0.020
umbs+0.019
Neg logit contributions
 playing-0.021
 office-0.021
ich-0.020
 neighborhood-0.020
 living-0.019
Token spark
Token ID9009
Feature activation-4.90e+00

Pos logit contributions
war+0.219
umbs+0.216
le+0.211
forts+0.206
of+0.203
Neg logit contributions
 playing-0.219
ich-0.202
 living-0.197
 office-0.193
 neighborhood-0.192
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.280
war+0.277
of+0.267
forts+0.257
bs+0.244
Neg logit contributions
 playing-0.375
 living-0.339
ich-0.333
 office-0.332
 exploring-0.319
Token in
Token ID287
Feature activation+5.44e+00

Pos logit contributions
le+1.423
les+1.348
of+1.298
umbs+1.275
bs+1.255
Neg logit contributions
 office-0.861
 neighborhood-0.842
 playing-0.838
ich-0.810
 factory-0.775
Token the
Token ID262
Feature activation-2.14e-01

Pos logit contributions
 playing+0.334
 living+0.297
 exploring+0.273
 backyard+0.266
 forest+0.266
Neg logit contributions
war-0.416
forts-0.412
le-0.407
umbs-0.406
bs-0.400
Token sun
Token ID4252
Feature activation-2.63e+00

Pos logit contributions
le+0.013
of+0.012
les+0.012
war+0.012
umbs+0.012
Neg logit contributions
 playing-0.015
 office-0.014
ich-0.014
 neighborhood-0.013
 exploring-0.013
Token and
Token ID290
Feature activation-3.43e+00

Pos logit contributions
le+0.174
war+0.167
of+0.167
umbs+0.166
les+0.162
Neg logit contributions
 playing-0.173
ich-0.160
 office-0.159
 neighborhood-0.157
 living-0.156
Token she
Token ID673
Feature activation+1.04e+00

Pos logit contributions
le+0.184
les+0.178
of+0.174
bs+0.167
war+0.157
Neg logit contributions
ich-0.254
 office-0.252
 neighborhood-0.248
 playing-0.242
 factory-0.239
Token giving
Token ID3501
Feature activation+2.78e+00

Pos logit contributions
 playing+0.262
ich+0.240
 office+0.240
 neighborhood+0.236
 living+0.231
Neg logit contributions
le-0.403
of-0.388
umbs-0.386
war-0.385
les-0.381
Token off
Token ID572
Feature activation-6.42e+00

Pos logit contributions
 playing+0.159
 office+0.144
 neighborhood+0.142
 living+0.142
ich+0.140
Neg logit contributions
le-0.204
umbs-0.202
war-0.197
of-0.197
les-0.192
Token a
Token ID257
Feature activation-2.70e-01

Pos logit contributions
le+0.457
of+0.439
umbs+0.435
war+0.435
les+0.429
Neg logit contributions
 playing-0.384
ich-0.358
 office-0.356
 neighborhood-0.352
 living-0.344
Token special
Token ID2041
Feature activation-1.01e-01

Pos logit contributions
le+0.021
umbs+0.021
les+0.020
war+0.020
of+0.020
Neg logit contributions
 playing-0.015
 living-0.014
 office-0.013
 neighborhood-0.013
ich-0.013
Token spark
Token ID9009
Feature activation-4.84e+00

Pos logit contributions
le+0.007
umbs+0.007
of+0.007
war+0.007
les+0.006
Neg logit contributions
 playing-0.007
 office-0.006
 neighborhood-0.006
 living-0.006
ich-0.006
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.308
war+0.299
of+0.287
forts+0.278
ws+0.274
Neg logit contributions
 playing-0.327
 living-0.303
 office-0.299
 exploring-0.282
 factory-0.276
Token that
Token ID326
Feature activation+1.03e+00

Pos logit contributions
le+1.302
of+1.257
umbs+1.254
war+1.252
les+1.229
Neg logit contributions
 playing-1.001
 office-0.916
ich-0.913
 neighborhood-0.902
 living-0.891
Token made
Token ID925
Feature activation-1.50e+00

Pos logit contributions
 playing+0.059
ich+0.056
 office+0.055
 neighborhood+0.054
 living+0.052
Neg logit contributions
le-0.075
of-0.072
war-0.071
umbs-0.071
les-0.070
Token the
Token ID262
Feature activation-3.54e-02

Pos logit contributions
war+0.114
umbs+0.113
le+0.111
forts+0.111
of+0.109
Neg logit contributions
 playing-0.086
 office-0.075
 living-0.075
 exploring-0.071
 neighborhood-0.069
Token princess
Token ID21752
Feature activation+3.67e+00

Pos logit contributions
le+0.003
les+0.003
of+0.003
umbs+0.002
war+0.002
Neg logit contributions
 playing-0.002
ich-0.002
 office-0.002
 neighborhood-0.002
 exploring-0.002
Token feel
Token ID1254
Feature activation+8.14e-01

Pos logit contributions
 playing+0.215
 office+0.194
 living+0.192
ich+0.190
 neighborhood+0.189
Neg logit contributions
le-0.264
war-0.263
umbs-0.260
of-0.258
les-0.251
Token,
Token ID11
Feature activation+2.59e+00

Pos logit contributions
le+0.331
of+0.320
les+0.315
war+0.314
bs+0.310
Neg logit contributions
 playing-0.274
ich-0.268
 neighborhood-0.259
 office-0.259
 living-0.249
Token it
Token ID340
Feature activation-6.84e+00

Pos logit contributions
ich+0.166
 neighborhood+0.162
 playing+0.160
 office+0.156
 living+0.150
Neg logit contributions
le-0.163
of-0.153
bs-0.152
les-0.151
war-0.145
Token began
Token ID2540
Feature activation+1.76e-01

Pos logit contributions
le+0.491
les+0.463
of+0.461
war+0.455
umbs+0.449
Neg logit contributions
 playing-0.414
ich-0.389
 office-0.388
 neighborhood-0.380
 living-0.371
Token to
Token ID284
Feature activation-2.55e-01

Pos logit contributions
 playing+0.010
 office+0.010
ich+0.010
 neighborhood+0.010
 living+0.009
Neg logit contributions
le-0.013
of-0.012
umbs-0.012
war-0.012
les-0.012
Token spark
Token ID9009
Feature activation-4.94e+00

Pos logit contributions
le+0.017
war+0.017
umbs+0.017
of+0.017
les+0.017
Neg logit contributions
 playing-0.016
 office-0.015
 living-0.015
 neighborhood-0.014
ich-0.014
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.180
forts+0.157
war+0.150
 sip+0.127
uits+0.126
Neg logit contributions
 playing-0.462
 living-0.430
 office-0.430
ich-0.427
 neighborhood-0.421
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
le+1.327
war+1.303
umbs+1.303
of+1.291
les+1.257
Neg logit contributions
 playing-0.974
 office-0.864
ich-0.860
 living-0.860
 neighborhood-0.850
Token shine
Token ID18340
Feature activation-3.92e+00

Pos logit contributions
le+0.210
of+0.202
war+0.195
les+0.194
bs+0.193
Neg logit contributions
 playing-0.244
ich-0.243
 office-0.240
 neighborhood-0.239
 living-0.223
Token in
Token ID287
Feature activation+5.35e+00

Pos logit contributions
war+0.285
umbs+0.283
of+0.278
forts+0.277
le+0.276
Neg logit contributions
 playing-0.224
 living-0.202
 office-0.197
ich-0.196
 neighborhood-0.195
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
 playing+0.284
 living+0.251
 neighborhood+0.239
 office+0.238
 exploring+0.231
Neg logit contributions
war-0.442
le-0.439
umbs-0.436
bs-0.429
forts-0.428
Token sunlight
Token ID19606
Feature activation-8.29e+00

Pos logit contributions
le+0.012
of+0.012
umbs+0.012
war+0.012
les+0.012
Neg logit contributions
 playing-0.013
 office-0.012
ich-0.012
 neighborhood-0.012
 living-0.012
Token heard
Token ID2982
Feature activation-4.25e+00

Pos logit contributions
le+0.228
of+0.213
les+0.208
bs+0.205
war+0.204
Neg logit contributions
ich-0.218
 office-0.207
 playing-0.206
 neighborhood-0.204
 seeker-0.192
Token his
Token ID465
Feature activation+1.59e+00

Pos logit contributions
le+0.323
umbs+0.319
war+0.317
of+0.312
forts+0.305
Neg logit contributions
 playing-0.244
 office-0.215
 living-0.213
 neighborhood-0.204
ich-0.202
Token stomach
Token ID11384
Feature activation-1.13e+00

Pos logit contributions
 playing+0.097
 office+0.090
ich+0.089
 neighborhood+0.088
 living+0.086
Neg logit contributions
le-0.114
umbs-0.109
of-0.109
war-0.109
les-0.107
Token g
Token ID308
Feature activation-2.71e+00

Pos logit contributions
war+0.084
le+0.083
of+0.081
forts+0.081
umbs+0.080
Neg logit contributions
 playing-0.064
 living-0.056
ich-0.056
 neighborhood-0.054
 office-0.054
Tokenurg
Token ID3686
Feature activation-6.48e+00

Pos logit contributions
war+0.129
le+0.127
of+0.125
umbs+0.120
bs+0.117
Neg logit contributions
 playing-0.226
ich-0.212
 office-0.208
 neighborhood-0.207
 living-0.202
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
bs+0.224
war+0.217
of+0.212
forts+0.210
ushes+0.210
Neg logit contributions
 playing-0.600
 living-0.572
 office-0.548
 exploring-0.534
ter-0.530
Token after
Token ID706
Feature activation-2.19e+00

Pos logit contributions
war+1.239
of+1.221
le+1.218
umbs+1.218
bs+1.184
Neg logit contributions
 playing-1.085
 office-0.951
 living-0.950
ich-0.931
 neighborhood-0.913
Token he
Token ID339
Feature activation+1.13e+00

Pos logit contributions
le+0.151
war+0.148
of+0.147
umbs+0.146
bs+0.142
Neg logit contributions
 playing-0.143
 office-0.127
ich-0.125
 living-0.124
 neighborhood-0.122
Token ate
Token ID15063
Feature activation-1.87e+00

Pos logit contributions
 playing+0.063
 office+0.058
ich+0.058
 neighborhood+0.057
 living+0.056
Neg logit contributions
le-0.086
of-0.083
umbs-0.082
war-0.082
les-0.081
Token it
Token ID340
Feature activation-6.70e+00

Pos logit contributions
le+0.130
war+0.130
umbs+0.129
of+0.129
bs+0.127
Neg logit contributions
 playing-0.118
 office-0.103
 living-0.103
 neighborhood-0.099
 exploring-0.099
Token.
Token ID13
Feature activation-7.77e-01

Pos logit contributions
le+0.465
war+0.459
umbs+0.456
of+0.455
les+0.451
Neg logit contributions
 playing-0.414
 office-0.366
 living-0.360
ich-0.355
 neighborhood-0.353
Token the
Token ID262
Feature activation-2.27e-01

Pos logit contributions
 playing+0.165
 office+0.153
ich+0.153
 neighborhood+0.151
 living+0.149
Neg logit contributions
le-0.180
of-0.173
umbs-0.172
war-0.172
les-0.169
Token basket
Token ID7988
Feature activation-9.75e-01

Pos logit contributions
le+0.017
les+0.016
of+0.015
bs+0.015
war+0.015
Neg logit contributions
 playing-0.013
ich-0.011
 neighborhood-0.011
 living-0.011
 exploring-0.011
Token started
Token ID2067
Feature activation+1.43e+00

Pos logit contributions
le+0.074
war+0.072
of+0.071
umbs+0.070
les+0.069
Neg logit contributions
 playing-0.054
 office-0.049
ich-0.048
 neighborhood-0.048
 living-0.048
Token to
Token ID284
Feature activation-2.16e-01

Pos logit contributions
 playing+0.081
 office+0.072
 living+0.071
 neighborhood+0.070
ich+0.070
Neg logit contributions
le-0.113
of-0.108
war-0.108
umbs-0.107
les-0.106
Token spark
Token ID9009
Feature activation-4.87e+00

Pos logit contributions
le+0.015
war+0.015
umbs+0.014
of+0.014
les+0.014
Neg logit contributions
 playing-0.014
 office-0.013
 living-0.012
 neighborhood-0.012
ich-0.012
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.167
forts+0.146
war+0.135
of+0.115
uits+0.114
Neg logit contributions
 playing-0.460
 living-0.431
 office-0.422
 smartest-0.417
ich-0.417
Token and
Token ID290
Feature activation-3.48e+00

Pos logit contributions
war+1.336
umbs+1.336
le+1.331
of+1.313
forts+1.289
Neg logit contributions
 playing-0.961
 living-0.843
ich-0.824
 office-0.823
 neighborhood-0.811
Token shine
Token ID18340
Feature activation-3.89e+00

Pos logit contributions
le+0.218
of+0.209
war+0.205
les+0.203
umbs+0.202
Neg logit contributions
 playing-0.230
ich-0.223
 office-0.220
 neighborhood-0.218
 living-0.208
Token.
Token ID13
Feature activation-9.47e-01

Pos logit contributions
war+0.274
umbs+0.271
forts+0.267
of+0.263
bs+0.259
Neg logit contributions
 playing-0.234
 living-0.213
ich-0.201
 office-0.200
 neighborhood-0.199
Token
Token ID220
Feature activation-3.45e+00

Pos logit contributions
le+0.071
of+0.068
war+0.067
umbs+0.067
les+0.067
Neg logit contributions
 playing-0.054
 office-0.050
 neighborhood-0.049
ich-0.049
 living-0.048
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
le+0.247
of+0.237
umbs+0.236
war+0.236
les+0.232
Neg logit contributions
 playing-0.205
 office-0.189
ich-0.189
 neighborhood-0.187
 living-0.183
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
of+0.054
umbs+0.054
war+0.053
bs+0.052
le+0.050
Neg logit contributions
 playing-0.058
ich-0.057
 office-0.054
 neighborhood-0.053
 smartest-0.052
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
le+0.020
les+0.019
umbs+0.019
war+0.018
of+0.018
Neg logit contributions
 playing-0.013
 living-0.011
 neighborhood-0.011
 office-0.011
ich-0.011
TokenA
Token ID32
Feature activation-2.57e+00

Pos logit contributions
le+0.013
of+0.012
war+0.012
umbs+0.012
les+0.012
Neg logit contributions
 playing-0.016
 office-0.015
ich-0.015
 neighborhood-0.015
 living-0.014
Tokenire
Token ID557
Feature activation-4.51e+00

Pos logit contributions
le+0.161
of+0.158
war+0.153
umbs+0.151
les+0.150
Neg logit contributions
 playing-0.173
 office-0.161
ich-0.160
 neighborhood-0.159
 living-0.155
Tokenl
Token ID75
Feature activation-1.20e+01

Pos logit contributions
umbs+0.240
war+0.237
le+0.231
of+0.228
forts+0.221
Neg logit contributions
 playing-0.349
ich-0.328
 office-0.322
 living-0.322
 neighborhood-0.316
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
le+0.781
les+0.710
of+0.688
umbs+0.670
war+0.622
Neg logit contributions
 playing-0.849
 office-0.776
ich-0.775
 exploring-0.762
 neighborhood-0.762
Token ran
Token ID4966
Feature activation+1.67e+00

Pos logit contributions
le+1.198
of+1.128
les+1.111
umbs+1.094
war+1.086
Neg logit contributions
 playing-1.118
 office-1.069
ich-1.062
 neighborhood-1.050
 living-1.018
Token back
Token ID736
Feature activation+3.34e+00

Pos logit contributions
 playing+0.103
ich+0.099
 office+0.097
 neighborhood+0.097
 living+0.094
Neg logit contributions
le-0.115
of-0.111
war-0.111
les-0.108
umbs-0.108
Token to
Token ID284
Feature activation-2.71e-01

Pos logit contributions
ich+0.202
 playing+0.196
 happiest+0.193
 smartest+0.192
 office+0.189
Neg logit contributions
le-0.218
of-0.217
war-0.209
les-0.208
bs-0.203
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
le+0.018
of+0.017
war+0.017
umbs+0.017
les+0.016
Neg logit contributions
 playing-0.017
ich-0.017
 neighborhood-0.016
 office-0.016
 living-0.016
Token door
Token ID3420
Feature activation-1.16e+00

Pos logit contributions
le+0.015
 sip+0.015
les+0.014
war+0.014
of+0.013
Neg logit contributions
ich-0.010
 playing-0.010
 Sy-0.010
gy-0.010
 smartest-0.009
Token g
Token ID308
Feature activation-2.64e+00

Pos logit contributions
umbs+0.103
war+0.102
le+0.099
fulness+0.094
les+0.094
Neg logit contributions
 playing-0.101
 living-0.090
 neighborhood-0.085
 happiest-0.082
 exploring-0.081
Tokeniggle
Token ID24082
Feature activation-8.21e+00

Pos logit contributions
war+0.166
of+0.152
le+0.146
umbs+0.146
bread+0.142
Neg logit contributions
 playing-0.192
 neighborhood-0.168
ich-0.162
 office-0.162
 living-0.158
Token.
Token ID13
Feature activation-9.33e-01

Pos logit contributions
umbs+0.653
war+0.652
of+0.641
le+0.608
forts+0.605
Neg logit contributions
 playing-0.488
 living-0.414
 office-0.363
 exploring-0.357
 neighborhood-0.354
Token They
Token ID1119
Feature activation-3.01e+00

Pos logit contributions
le+0.069
les+0.065
of+0.065
war+0.064
umbs+0.064
Neg logit contributions
 playing-0.057
 living-0.052
 neighborhood-0.051
 office-0.049
ich-0.049
Token tick
Token ID4378
Feature activation-4.82e+00

Pos logit contributions
umbs+0.197
war+0.196
le+0.195
les+0.184
of+0.182
Neg logit contributions
 playing-0.199
 living-0.177
 neighborhood-0.175
ich-0.168
 seeker-0.167
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.135
uits+0.105
forts+0.102
war+0.102
olt+0.091
Neg logit contributions
 playing-0.494
 living-0.435
 best-0.417
 neighborhood-0.411
ich-0.408
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
umbs+1.249
war+1.218
forts+1.175
le+1.162
of+1.137
Neg logit contributions
 playing-1.156
 living-1.016
ich-0.931
 office-0.920
 exploring-0.903
Token fish
Token ID5916
Feature activation-2.86e+00

Pos logit contributions
le+0.016
umbs+0.016
of+0.016
war+0.016
bs+0.015
Neg logit contributions
 playing-0.017
ich-0.016
 living-0.016
 neighborhood-0.015
 office-0.015
Token back
Token ID736
Feature activation+3.23e+00

Pos logit contributions
war+0.205
umbs+0.203
forts+0.197
le+0.196
of+0.196
Neg logit contributions
 playing-0.172
 living-0.154
ich-0.140
 office-0.136
 exploring-0.133
Token.
Token ID13
Feature activation-9.29e-01

Pos logit contributions
 playing+0.194
 living+0.177
 office+0.168
ich+0.164
 neighborhood+0.160
Neg logit contributions
umbs-0.237
war-0.232
le-0.229
forts-0.225
of-0.224
Token They
Token ID1119
Feature activation-3.02e+00

Pos logit contributions
le+0.072
les+0.067
of+0.067
war+0.066
umbs+0.066
Neg logit contributions
 playing-0.055
 living-0.050
 neighborhood-0.048
 exploring-0.046
 office-0.045
Token special
Token ID2041
Feature activation-9.48e-02

Pos logit contributions
le+0.018
of+0.017
les+0.017
umbs+0.017
war+0.017
Neg logit contributions
 playing-0.015
ich-0.014
 office-0.014
 neighborhood-0.014
 living-0.013
Token find
Token ID1064
Feature activation+4.97e+00

Pos logit contributions
le+0.007
umbs+0.007
of+0.007
war+0.007
les+0.006
Neg logit contributions
 playing-0.006
 office-0.005
ich-0.005
 neighborhood-0.005
 living-0.005
Token.
Token ID13
Feature activation-7.75e-01

Pos logit contributions
 playing+0.303
 living+0.280
 office+0.279
 neighborhood+0.262
ich+0.262
Neg logit contributions
war-0.343
umbs-0.342
le-0.333
of-0.331
forts-0.324
Token Ann
Token ID5506
Feature activation-2.65e-01

Pos logit contributions
le+0.055
of+0.052
umbs+0.052
war+0.052
les+0.051
Neg logit contributions
 playing-0.047
 office-0.044
ich-0.044
 neighborhood-0.043
 living-0.042
Tokenabel
Token ID9608
Feature activation-7.14e+00

Pos logit contributions
war+0.016
umbs+0.016
le+0.016
of+0.016
les+0.015
Neg logit contributions
 playing-0.018
 office-0.017
 neighborhood-0.017
 living-0.017
ich-0.016
Tokenle
Token ID293
Feature activation-1.76e+01

Pos logit contributions
umbs+0.299
war+0.298
of+0.269
forts+0.263
le+0.256
Neg logit contributions
 playing-0.637
 office-0.593
 living-0.592
ich-0.592
 neighborhood-0.591
Token taught
Token ID7817
Feature activation-4.00e+00

Pos logit contributions
le+1.448
of+1.393
umbs+1.375
les+1.371
war+1.369
Neg logit contributions
 playing-0.855
ich-0.790
 office-0.787
 neighborhood-0.770
 living-0.748
Token them
Token ID606
Feature activation-7.17e+00

Pos logit contributions
umbs+0.327
war+0.321
le+0.315
forts+0.315
bs+0.310
Neg logit contributions
 playing-0.219
 office-0.190
 living-0.185
 exploring-0.175
 neighborhood-0.171
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
le+0.550
umbs+0.535
war+0.532
of+0.527
les+0.517
Neg logit contributions
 playing-0.404
 office-0.362
 living-0.362
 neighborhood-0.349
ich-0.345
Token word
Token ID1573
Feature activation-7.94e+00

Pos logit contributions
le+0.015
of+0.014
war+0.014
umbs+0.014
les+0.014
Neg logit contributions
 playing-0.013
ich-0.012
 office-0.012
 neighborhood-0.012
 living-0.012
Token �
Token ID6184
Feature activation+4.81e+00

Pos logit contributions
le+0.578
of+0.556
umbs+0.551
war+0.548
les+0.543
Neg logit contributions
 playing-0.457
ich-0.427
 office-0.423
 neighborhood-0.419
 living-0.407
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
 playing+0.230
ich+0.228
 neighborhood+0.226
 office+0.225
 factory+0.214
Neg logit contributions
le-0.244
war-0.236
of-0.233
les-0.230
umbs-0.228
Token spray
Token ID11662
Feature activation-5.90e-01

Pos logit contributions
of+0.008
 sip+0.008
le+0.008
war+0.008
les+0.007
Neg logit contributions
 playing-0.010
 office-0.009
ich-0.009
 seeking-0.009
 saving-0.009
Token bottle
Token ID9294
Feature activation-2.50e+00

Pos logit contributions
le+0.041
of+0.039
les+0.039
umbs+0.039
war+0.038
Neg logit contributions
 playing-0.035
ich-0.033
 neighborhood-0.033
 office-0.033
 exploring-0.031
Token in
Token ID287
Feature activation+5.61e+00

Pos logit contributions
le+0.184
of+0.177
war+0.176
umbs+0.176
les+0.173
Neg logit contributions
 playing-0.146
 office-0.134
ich-0.133
 neighborhood-0.131
 living-0.130
Token the
Token ID262
Feature activation+4.80e-02

Pos logit contributions
 playing+0.289
 office+0.251
 living+0.251
 neighborhood+0.241
 exploring+0.230
Neg logit contributions
le-0.480
bs-0.469
umbs-0.469
war-0.463
of-0.461
Token garden
Token ID11376
Feature activation+8.55e+00

Pos logit contributions
 playing+0.003
ich+0.002
 exploring+0.002
 office+0.002
 seeking+0.002
Neg logit contributions
war-0.003
le-0.003
 sip-0.003
les-0.003
of-0.003
Token.
Token ID13
Feature activation-6.39e-01

Pos logit contributions
ich+0.631
 playing+0.612
 seeker+0.594
 neighborhood+0.593
 office+0.581
Neg logit contributions
le-0.449
umbs-0.424
of-0.422
les-0.415
war-0.395
Token It
Token ID632
Feature activation-5.95e+00

Pos logit contributions
le+0.043
war+0.043
of+0.043
umbs+0.043
les+0.042
Neg logit contributions
 playing-0.039
ich-0.038
 office-0.037
 neighborhood-0.036
 living-0.035
Token was
Token ID373
Feature activation+5.22e+00

Pos logit contributions
le+0.483
of+0.458
les+0.455
 sip+0.449
bs+0.445
Neg logit contributions
 office-0.303
ich-0.293
 playing-0.285
 neighborhood-0.280
 seeker-0.267
Token yellow
Token ID7872
Feature activation-2.88e+00

Pos logit contributions
 playing+0.272
ich+0.266
 office+0.265
 neighborhood+0.263
 living+0.243
Neg logit contributions
le-0.390
umbs-0.377
of-0.372
war-0.371
les-0.366
Token and
Token ID290
Feature activation-3.33e+00

Pos logit contributions
le+0.159
umbs+0.149
of+0.148
les+0.147
war+0.142
Neg logit contributions
 playing-0.215
ich-0.209
 office-0.207
 neighborhood-0.206
 living-0.198
Token looking
Token ID2045
Feature activation+4.99e+00

Pos logit contributions
le+0.076
les+0.071
of+0.069
war+0.069
bs+0.068
Neg logit contributions
 neighborhood-0.072
ich-0.072
 office-0.068
 playing-0.068
 living-0.067
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
 playing+0.284
 office+0.254
 living+0.253
 neighborhood+0.247
ich+0.242
Neg logit contributions
le-0.373
umbs-0.372
war-0.367
of-0.366
les-0.350
Token Soon
Token ID15894
Feature activation-2.58e+00

Pos logit contributions
war+0.045
umbs+0.045
of+0.045
le+0.042
bs+0.042
Neg logit contributions
 playing-0.053
ich-0.052
 office-0.050
 living-0.049
 seeker-0.048
Token,
Token ID11
Feature activation+2.95e+00

Pos logit contributions
le+0.170
of+0.163
war+0.159
les+0.159
bs+0.158
Neg logit contributions
ich-0.163
 neighborhood-0.155
 playing-0.154
 office-0.152
 living-0.152
Token he
Token ID339
Feature activation+1.16e+00

Pos logit contributions
ich+0.186
 neighborhood+0.180
 playing+0.179
 office+0.178
 living+0.176
Neg logit contributions
le-0.182
of-0.176
war-0.172
bs-0.170
umbs-0.169
Token met
Token ID1138
Feature activation+8.04e+00

Pos logit contributions
 playing+0.063
ich+0.060
 office+0.059
 living+0.057
 neighborhood+0.056
Neg logit contributions
le-0.091
of-0.086
les-0.086
umbs-0.082
war-0.082
Token an
Token ID281
Feature activation+2.66e+00

Pos logit contributions
ich+0.483
 playing+0.483
 office+0.461
 neighborhood+0.457
 living+0.444
Neg logit contributions
le-0.555
of-0.529
war-0.524
les-0.518
umbs-0.512
Token old
Token ID1468
Feature activation-2.77e-02

Pos logit contributions
 playing+0.152
ich+0.149
 neighborhood+0.141
 living+0.139
 exploring+0.132
Neg logit contributions
le-0.195
les-0.176
umbs-0.165
ck-0.165
of-0.164
Token man
Token ID582
Feature activation+9.29e-01

Pos logit contributions
le+0.001
les+0.001
of+0.001
 sip+0.001
forts+0.001
Neg logit contributions
 playing-0.002
ich-0.002
 happiest-0.002
 smartest-0.002
 seeking-0.002
Token who
Token ID508
Feature activation+4.60e+00

Pos logit contributions
ich+0.047
 playing+0.046
 neighborhood+0.046
 office+0.044
 seeker+0.043
Neg logit contributions
le-0.072
of-0.069
les-0.066
war-0.065
umbs-0.065
Token was
Token ID373
Feature activation+5.16e+00

Pos logit contributions
ich+0.241
 playing+0.238
 office+0.229
 neighborhood+0.223
 seeker+0.214
Neg logit contributions
le-0.359
of-0.342
les-0.338
bs-0.323
war-0.319
TokenI
Token ID40
Feature activation+8.27e+00

Pos logit contributions
 playing+0.250
ich+0.236
 neighborhood+0.235
 office+0.227
 living+0.225
Neg logit contributions
le-0.454
umbs-0.448
war-0.438
les-0.436
of-0.432
Token'm
Token ID1101
Feature activation+7.07e+00

Pos logit contributions
 playing+0.475
 office+0.427
 neighborhood+0.425
 living+0.422
ich+0.409
Neg logit contributions
umbs-0.606
le-0.600
war-0.593
of-0.581
les-0.580
Token sorry
Token ID7926
Feature activation+2.73e+00

Pos logit contributions
ich+0.336
 neighborhood+0.336
 playing+0.318
 office+0.315
 factory+0.298
Neg logit contributions
le-0.549
bs-0.516
umbs-0.514
les-0.507
of-0.500
Token Steve
Token ID6542
Feature activation+1.42e+00

Pos logit contributions
 neighborhood+0.156
ich+0.154
 playing+0.152
 office+0.151
 seeking+0.143
Neg logit contributions
le-0.196
of-0.188
les-0.182
fulness-0.176
bs-0.175
Token.
Token ID13
Feature activation-5.01e-01

Pos logit contributions
 playing+0.092
ich+0.088
 neighborhood+0.088
 office+0.088
 factory+0.083
Neg logit contributions
le-0.093
of-0.088
les-0.087
umbs-0.087
war-0.085
Token I
Token ID314
Feature activation+7.49e+00

Pos logit contributions
of+0.036
bs+0.034
war+0.034
umbs+0.034
forts+0.034
Neg logit contributions
 playing-0.028
ich-0.027
 office-0.026
 neighborhood-0.026
 backyard-0.024
Token hid
Token ID24519
Feature activation+3.27e+00

Pos logit contributions
 playing+0.398
 neighborhood+0.356
 office+0.355
 living+0.350
ich+0.350
Neg logit contributions
le-0.578
umbs-0.573
war-0.566
of-0.562
les-0.553
Token the
Token ID262
Feature activation+6.27e-02

Pos logit contributions
 playing+0.179
 office+0.169
ich+0.169
 neighborhood+0.169
 living+0.160
Neg logit contributions
le-0.248
of-0.236
war-0.232
les-0.232
umbs-0.230
Token er
Token ID1931
Feature activation+5.82e+00

Pos logit contributions
 playing+0.003
ich+0.003
 neighborhood+0.003
 office+0.003
 exploring+0.003
Neg logit contributions
le-0.005
war-0.004
of-0.004
les-0.004
umbs-0.004
Tokenaser
Token ID6005
Feature activation-3.03e+00

Pos logit contributions
 playing+0.423
 neighborhood+0.394
 office+0.393
ich+0.393
 living+0.385
Neg logit contributions
le-0.344
of-0.328
umbs-0.324
war-0.322
les-0.320
Token and
Token ID290
Feature activation-3.15e+00

Pos logit contributions
le+0.229
of+0.216
umbs+0.214
war+0.214
les+0.213
Neg logit contributions
 playing-0.166
ich-0.157
 neighborhood-0.157
 office-0.154
 living-0.145
Token my
Token ID616
Feature activation+4.20e+00

Pos logit contributions
 neighborhood+0.141
 office+0.137
ich+0.134
 playing+0.133
 neighbourhood+0.127
Neg logit contributions
of-0.201
le-0.192
les-0.181
 sip-0.177
umbs-0.173
Token name
Token ID1438
Feature activation+4.56e-01

Pos logit contributions
 playing+0.251
 office+0.247
ich+0.239
 neighborhood+0.238
 exploring+0.234
Neg logit contributions
le-0.284
of-0.275
les-0.271
bs-0.261
umbs-0.256
Token is
Token ID318
Feature activation+1.46e+00

Pos logit contributions
 playing+0.026
ich+0.025
 office+0.025
 neighborhood+0.025
 exploring+0.023
Neg logit contributions
le-0.034
of-0.033
les-0.032
umbs-0.031
war-0.031
Token Lily
Token ID20037
Feature activation+2.26e+00

Pos logit contributions
 neighborhood+0.092
 office+0.090
ich+0.088
 backyard+0.085
 neighbourhood+0.083
Neg logit contributions
of-0.100
le-0.097
war-0.086
les-0.085
bs-0.084
Token and
Token ID290
Feature activation-3.49e+00

Pos logit contributions
 playing+0.168
 neighborhood+0.167
ich+0.166
 office+0.162
 factory+0.161
Neg logit contributions
le-0.124
of-0.113
les-0.111
umbs-0.109
war-0.102
Token I
Token ID314
Feature activation+7.40e+00

Pos logit contributions
of+0.243
bs+0.231
le+0.226
les+0.215
 sip+0.209
Neg logit contributions
 neighborhood-0.211
ich-0.198
 office-0.197
 neighbourhood-0.191
 backyard-0.184
Token like
Token ID588
Feature activation+1.72e+00

Pos logit contributions
 playing+0.380
ich+0.362
 office+0.362
 neighborhood+0.349
 living+0.338
Neg logit contributions
le-0.596
of-0.568
war-0.558
les-0.556
umbs-0.555
Token cats
Token ID11875
Feature activation+3.52e+00

Pos logit contributions
ich+0.101
 neighborhood+0.101
 office+0.100
 playing+0.098
 seeker+0.094
Neg logit contributions
of-0.113
le-0.112
war-0.107
les-0.107
umbs-0.105
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
 playing+0.243
ich+0.235
 office+0.233
 neighborhood+0.231
 living+0.221
Neg logit contributions
le-0.216
of-0.203
umbs-0.201
les-0.200
war-0.198
Token Cats
Token ID28997
Feature activation-3.00e-01

Pos logit contributions
of+0.063
le+0.063
umbs+0.062
bs+0.060
war+0.060
Neg logit contributions
ich-0.049
 playing-0.049
 office-0.048
 neighborhood-0.047
 factory-0.043
Token are
Token ID389
Feature activation-3.31e+00

Pos logit contributions
le+0.019
umbs+0.018
war+0.018
of+0.018
les+0.017
Neg logit contributions
 playing-0.021
 office-0.019
ich-0.019
 living-0.019
 neighborhood-0.019
Token.
Token ID13
Feature activation-8.36e-01

Pos logit contributions
le+0.428
war+0.413
umbs+0.412
of+0.411
les+0.402
Neg logit contributions
 playing-0.402
 office-0.361
 living-0.357
ich-0.353
 neighborhood-0.350
Token He
Token ID679
Feature activation+1.44e+00

Pos logit contributions
le+0.062
of+0.060
umbs+0.060
war+0.060
les+0.059
Neg logit contributions
 playing-0.048
ich-0.044
 office-0.044
 neighborhood-0.044
 living-0.043
Token sat
Token ID3332
Feature activation-5.09e-01

Pos logit contributions
 playing+0.091
ich+0.087
 office+0.085
 neighborhood+0.083
 living+0.082
Neg logit contributions
le-0.097
of-0.094
les-0.091
war-0.091
umbs-0.090
Token on
Token ID319
Feature activation-2.26e-01

Pos logit contributions
le+0.035
les+0.034
of+0.031
bs+0.031
umbs+0.030
Neg logit contributions
 playing-0.030
ich-0.028
 neighborhood-0.028
 factory-0.028
 office-0.028
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
le+0.018
umbs+0.017
war+0.017
of+0.017
les+0.016
Neg logit contributions
 playing-0.013
 office-0.011
 living-0.011
 neighborhood-0.011
ich-0.011
Token toilet
Token ID16146
Feature activation+8.24e+00

Pos logit contributions
 playing+0.006
ich+0.006
 neighborhood+0.006
 office+0.006
 exploring+0.005
Neg logit contributions
le-0.008
of-0.008
les-0.008
war-0.008
umbs-0.008
Token and
Token ID290
Feature activation-3.28e+00

Pos logit contributions
 playing+0.556
 office+0.530
 living+0.510
 neighborhood+0.504
ich+0.501
Neg logit contributions
le-0.526
umbs-0.511
war-0.510
of-0.509
les-0.489
Token did
Token ID750
Feature activation-2.60e+00

Pos logit contributions
le+0.216
of+0.204
les+0.201
war+0.199
bs+0.198
Neg logit contributions
 playing-0.209
ich-0.208
 office-0.201
 neighborhood-0.200
 factory-0.189
Token his
Token ID465
Feature activation+1.58e+00

Pos logit contributions
le+0.244
of+0.236
war+0.235
umbs+0.234
les+0.233
Neg logit contributions
 playing-0.096
ich-0.088
 office-0.086
 neighborhood-0.086
 living-0.081
Token business
Token ID1597
Feature activation+4.17e+00

Pos logit contributions
ich+0.085
 playing+0.084
 neighborhood+0.081
 living+0.080
 office+0.076
Neg logit contributions
le-0.115
war-0.109
of-0.107
les-0.105
bs-0.103
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
 playing+0.288
ich+0.280
 neighborhood+0.272
 office+0.268
 living+0.265
Neg logit contributions
le-0.255
of-0.239
war-0.237
les-0.235
umbs-0.235
Token will
Token ID481
Feature activation-4.48e+00

Pos logit contributions
 playing+0.091
ich+0.087
 office+0.086
 neighborhood+0.084
 living+0.082
Neg logit contributions
le-0.094
of-0.089
les-0.088
umbs-0.088
war-0.087
Token not
Token ID407
Feature activation+4.14e+00

Pos logit contributions
le+0.344
of+0.331
umbs+0.329
war+0.329
les+0.324
Neg logit contributions
 playing-0.242
 office-0.223
ich-0.223
 neighborhood-0.220
 living-0.215
Token hurt
Token ID5938
Feature activation+1.71e+00

Pos logit contributions
 playing+0.292
 office+0.271
ich+0.271
 neighborhood+0.268
 living+0.265
Neg logit contributions
le-0.252
of-0.241
umbs-0.240
war-0.240
les-0.236
Token us
Token ID514
Feature activation-3.89e+00

Pos logit contributions
 playing+0.119
 office+0.114
 neighborhood+0.113
ich+0.112
 living+0.109
Neg logit contributions
le-0.106
of-0.100
umbs-0.097
war-0.097
les-0.097
Token,"
Token ID553
Feature activation-2.85e+00

Pos logit contributions
le+0.242
of+0.228
umbs+0.226
war+0.224
les+0.221
Neg logit contributions
 playing-0.270
ich-0.255
 office-0.255
 neighborhood-0.255
 living-0.247
Token Lily
Token ID20037
Feature activation+2.61e+00

Pos logit contributions
le+0.200
of+0.193
war+0.190
umbs+0.188
bs+0.186
Neg logit contributions
 playing-0.170
ich-0.162
 office-0.159
 neighborhood-0.159
 living-0.154
Token said
Token ID531
Feature activation-6.13e+00

Pos logit contributions
 playing+0.209
 office+0.197
 neighborhood+0.194
 living+0.192
ich+0.182
Neg logit contributions
umbs-0.134
war-0.129
le-0.128
of-0.124
les-0.118
Token.
Token ID13
Feature activation-5.86e-01

Pos logit contributions
le+0.426
umbs+0.415
of+0.413
war+0.412
les+0.402
Neg logit contributions
 playing-0.391
 office-0.352
 living-0.348
ich-0.347
 neighborhood-0.340
Token
Token ID198
Feature activation+4.66e-02

Pos logit contributions
le+0.044
of+0.042
umbs+0.042
war+0.042
les+0.041
Neg logit contributions
 playing-0.033
 office-0.031
ich-0.031
 neighborhood-0.030
 living-0.030
Token
Token ID198
Feature activation+7.66e-02

Pos logit contributions
 playing+0.002
 living+0.002
 neighborhood+0.002
 office+0.002
ich+0.002
Neg logit contributions
le-0.004
umbs-0.004
les-0.004
war-0.004
of-0.004
Token"
Token ID1
Feature activation+1.24e-01

Pos logit contributions
 playing+0.005
 neighborhood+0.005
ich+0.005
 office+0.005
 living+0.005
Neg logit contributions
le-0.005
umbs-0.005
les-0.005
war-0.005
of-0.005
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
le+0.315
of+0.297
umbs+0.294
war+0.293
les+0.290
Neg logit contributions
 playing-0.354
ich-0.335
 office-0.333
 neighborhood-0.332
 living-0.321
Token She
Token ID1375
Feature activation+1.31e+00

Pos logit contributions
le+0.082
of+0.079
war+0.078
umbs+0.078
les+0.078
Neg logit contributions
 playing-0.054
 office-0.049
 neighborhood-0.048
ich-0.048
 living-0.047
Token tries
Token ID8404
Feature activation-3.48e+00

Pos logit contributions
 playing+0.093
 office+0.087
ich+0.087
 neighborhood+0.086
 living+0.085
Neg logit contributions
le-0.079
of-0.075
umbs-0.075
war-0.075
les-0.073
Token to
Token ID284
Feature activation-2.02e-01

Pos logit contributions
le+0.263
umbs+0.253
of+0.249
war+0.249
les+0.247
Neg logit contributions
 playing-0.203
 office-0.184
ich-0.181
 neighborhood-0.181
 living-0.180
Token reach
Token ID3151
Feature activation-2.89e+00

Pos logit contributions
umbs+0.014
les+0.014
war+0.013
le+0.013
of+0.013
Neg logit contributions
 playing-0.013
 office-0.012
ich-0.011
 living-0.011
 neighborhood-0.011
Token for
Token ID329
Feature activation-1.47e-01

Pos logit contributions
umbs+0.212
war+0.206
le+0.206
forts+0.201
of+0.200
Neg logit contributions
 playing-0.176
 office-0.154
 living-0.152
 exploring-0.148
ich-0.143
Token the
Token ID262
Feature activation-6.57e-02

Pos logit contributions
le+0.011
war+0.011
umbs+0.011
les+0.011
of+0.011
Neg logit contributions
 playing-0.009
 office-0.007
 living-0.007
 exploring-0.007
ich-0.007
Token doll
Token ID3654
Feature activation+5.02e+00

Pos logit contributions
le+0.004
les+0.004
umbs+0.004
of+0.004
war+0.004
Neg logit contributions
 playing-0.004
ich-0.004
 neighborhood-0.004
 office-0.004
 exploring-0.004
Token,
Token ID11
Feature activation+2.58e+00

Pos logit contributions
 playing+0.275
ich+0.261
 neighborhood+0.260
 office+0.252
 living+0.246
Neg logit contributions
le-0.378
of-0.366
umbs-0.361
war-0.357
les-0.350
Token but
Token ID475
Feature activation-1.49e+00

Pos logit contributions
 playing+0.157
ich+0.150
 neighborhood+0.149
 office+0.148
 living+0.141
Neg logit contributions
le-0.177
of-0.170
war-0.168
umbs-0.167
les-0.165
Token she
Token ID673
Feature activation+1.01e+00

Pos logit contributions
le+0.081
of+0.080
bs+0.080
war+0.077
les+0.073
Neg logit contributions
 neighborhood-0.109
ich-0.109
 playing-0.107
 office-0.105
 living-0.101
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
 playing+0.043
 neighborhood+0.038
 office+0.037
 living+0.037
ich+0.036
Neg logit contributions
le-0.061
war-0.057
umbs-0.057
of-0.057
les-0.057
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
le+0.012
umbs+0.012
war+0.012
les+0.012
of+0.012
Neg logit contributions
 playing-0.008
 living-0.007
 neighborhood-0.007
ich-0.007
 office-0.007
TokenMom
Token ID29252
Feature activation-2.30e+00

Pos logit contributions
le+0.007
umbs+0.006
of+0.006
war+0.006
les+0.006
Neg logit contributions
 playing-0.011
ich-0.011
 office-0.011
 neighborhood-0.011
 living-0.010
Token came
Token ID1625
Feature activation-4.06e-01

Pos logit contributions
war+0.185
umbs+0.179
raw+0.175
le+0.174
bs+0.170
Neg logit contributions
 playing-0.119
 living-0.114
 office-0.104
 neighborhood-0.100
 walking-0.092
Token and
Token ID290
Feature activation-3.41e+00

Pos logit contributions
le+0.029
of+0.028
umbs+0.028
war+0.028
les+0.027
Neg logit contributions
 playing-0.023
ich-0.022
 office-0.022
 neighborhood-0.021
 living-0.021
Token hugged
Token ID41130
Feature activation-1.00e+00

Pos logit contributions
le+0.276
of+0.269
war+0.265
umbs+0.264
les+0.263
Neg logit contributions
 playing-0.168
ich-0.155
 office-0.154
 neighborhood-0.152
 living-0.146
Token them
Token ID606
Feature activation-7.04e+00

Pos logit contributions
umbs+0.065
war+0.059
bread+0.058
le+0.055
bs+0.053
Neg logit contributions
 playing-0.076
 office-0.067
 living-0.067
 neighborhood-0.066
 mall-0.063
Token.
Token ID13
Feature activation-8.91e-01

Pos logit contributions
le+0.523
war+0.518
umbs+0.503
les+0.496
of+0.496
Neg logit contributions
 playing-0.430
 living-0.378
 office-0.360
 neighborhood-0.355
 exploring-0.334
Token She
Token ID1375
Feature activation+1.47e+00

Pos logit contributions
le+0.065
of+0.064
umbs+0.064
war+0.063
les+0.063
Neg logit contributions
 playing-0.049
ich-0.047
 office-0.045
 neighborhood-0.044
 living-0.043
Token said
Token ID531
Feature activation-6.43e+00

Pos logit contributions
 playing+0.107
ich+0.102
 office+0.101
 neighborhood+0.100
 living+0.097
Neg logit contributions
le-0.086
of-0.082
les-0.080
umbs-0.079
war-0.079
Token,
Token ID11
Feature activation+2.76e+00

Pos logit contributions
le+0.507
of+0.495
bs+0.486
les+0.483
war+0.479
Neg logit contributions
 playing-0.315
ich-0.313
 neighborhood-0.309
 office-0.306
 living-0.286
Token they
Token ID484
Feature activation-3.55e+00

Pos logit contributions
ich+0.206
 office+0.204
 neighborhood+0.198
 playing+0.192
 living+0.189
Neg logit contributions
of-0.152
le-0.150
war-0.144
bs-0.143
les-0.134
Token encountered
Token ID12956
Feature activation+2.03e+00

Pos logit contributions
le+0.278
of+0.270
les+0.262
bs+0.259
war+0.259
Neg logit contributions
 playing-0.181
ich-0.175
 office-0.172
 living-0.165
 neighborhood-0.164
Token a
Token ID257
Feature activation-1.99e-01

Pos logit contributions
 playing+0.118
ich+0.110
 office+0.109
 neighborhood+0.108
 living+0.105
Neg logit contributions
le-0.148
of-0.142
umbs-0.141
war-0.141
les-0.139
Token heavy
Token ID4334
Feature activation-5.68e+00

Pos logit contributions
le+0.012
of+0.012
war+0.012
 sip+0.012
les+0.012
Neg logit contributions
 playing-0.012
ich-0.011
 office-0.011
 exploring-0.010
 seeking-0.010
Token storm
Token ID6388
Feature activation-3.75e+00

Pos logit contributions
le+0.449
les+0.430
of+0.419
 sip+0.402
war+0.400
Neg logit contributions
 playing-0.266
ich-0.261
 living-0.237
 neighborhood-0.236
 smartest-0.236
Token.
Token ID13
Feature activation-7.78e-01

Pos logit contributions
le+0.248
of+0.235
war+0.233
umbs+0.233
les+0.231
Neg logit contributions
 playing-0.244
ich-0.229
 office-0.228
 neighborhood-0.226
 living-0.220
Token Sam
Token ID3409
Feature activation-3.29e-01

Pos logit contributions
war+0.046
of+0.045
umbs+0.044
le+0.044
bs+0.043
Neg logit contributions
ich-0.055
 playing-0.051
 office-0.051
 neighborhood-0.048
 living-0.048
Token reassured
Token ID48171
Feature activation-4.71e+00

Pos logit contributions
le+0.023
les+0.022
of+0.022
war+0.021
umbs+0.021
Neg logit contributions
 playing-0.020
 office-0.019
ich-0.019
 neighborhood-0.018
 living-0.018
Token Zoe
Token ID32280
Feature activation+2.53e+00

Pos logit contributions
le+0.292
of+0.279
war+0.274
umbs+0.270
les+0.269
Neg logit contributions
 playing-0.328
ich-0.310
 office-0.308
 neighborhood-0.307
 living-0.301
Token that
Token ID326
Feature activation+1.17e+00

Pos logit contributions
 playing+0.149
ich+0.138
 office+0.138
 neighborhood+0.137
 living+0.133
Neg logit contributions
le-0.184
of-0.176
umbs-0.174
war-0.174
les-0.172
Token they
Token ID484
Feature activation-3.51e+00

Pos logit contributions
ich+0.081
 neighborhood+0.078
 playing+0.077
 office+0.075
 seeker+0.072
Neg logit contributions
le-0.067
of-0.064
war-0.061
bs-0.061
les-0.060
Token a
Token ID257
Feature activation-3.94e-01

Pos logit contributions
ich+0.279
 playing+0.267
 office+0.260
 neighborhood+0.259
 factory+0.237
Neg logit contributions
le-0.470
umbs-0.445
les-0.442
bs-0.441
war-0.436
Token bit
Token ID1643
Feature activation-1.73e+00

Pos logit contributions
le+0.029
of+0.028
umbs+0.027
war+0.027
les+0.027
Neg logit contributions
 playing-0.023
 office-0.021
ich-0.021
 neighborhood-0.021
 living-0.021
Token tired
Token ID10032
Feature activation-8.04e-01

Pos logit contributions
of+0.170
umbs+0.169
war+0.169
le+0.162
fulness+0.157
Neg logit contributions
 playing-0.061
 living-0.046
 neighborhood-0.045
 office-0.045
 happiest-0.044
Token but
Token ID475
Feature activation-1.62e+00

Pos logit contributions
le+0.060
war+0.060
umbs+0.059
of+0.059
les+0.058
Neg logit contributions
 playing-0.045
 office-0.041
 living-0.040
 neighborhood-0.039
ich-0.039
Token he
Token ID339
Feature activation+1.01e+00

Pos logit contributions
le+0.108
of+0.102
bs+0.101
war+0.100
les+0.097
Neg logit contributions
ich-0.101
 office-0.093
 neighborhood-0.091
 factory-0.088
 playing-0.088
Token didn
Token ID1422
Feature activation+1.64e+00

Pos logit contributions
 playing+0.056
ich+0.056
 office+0.052
 factory+0.051
 neighborhood+0.051
Neg logit contributions
le-0.078
of-0.072
les-0.071
bs-0.068
war-0.068
Token't
Token ID470
Feature activation-1.61e-01

Pos logit contributions
 playing+0.063
 office+0.054
ich+0.051
 living+0.049
 neighborhood+0.049
Neg logit contributions
le-0.154
les-0.149
war-0.149
of-0.147
bs-0.146
Token want
Token ID765
Feature activation-1.87e+00

Pos logit contributions
le+0.010
umbs+0.010
war+0.009
of+0.009
les+0.009
Neg logit contributions
 playing-0.011
 office-0.011
 neighborhood-0.010
ich-0.010
 living-0.010
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
le+0.146
umbs+0.142
of+0.140
war+0.139
les+0.138
Neg logit contributions
 playing-0.105
 office-0.091
 living-0.091
 neighborhood-0.089
 exploring-0.088
Token leave
Token ID2666
Feature activation-1.36e+00

Pos logit contributions
le+0.014
of+0.013
war+0.013
umbs+0.013
les+0.012
Neg logit contributions
 playing-0.016
ich-0.016
 living-0.016
 neighborhood-0.015
 office-0.015
Token just
Token ID655
Feature activation+1.22e+00

Pos logit contributions
le+0.080
of+0.071
les+0.071
war+0.071
umbs+0.069
Neg logit contributions
ich-0.097
 playing-0.093
 office-0.092
 neighborhood-0.092
 living-0.090
Token birthday
Token ID10955
Feature activation+2.29e+00

Pos logit contributions
 sip+0.020
le+0.019
bs+0.018
of+0.018
war+0.018
Neg logit contributions
ich-0.021
 playing-0.020
 saving-0.019
 office-0.019
 seeking-0.019
Token party
Token ID2151
Feature activation+2.91e+00

Pos logit contributions
ich+0.124
 playing+0.117
 neighborhood+0.114
 smartest+0.109
 office+0.108
Neg logit contributions
le-0.167
of-0.159
umbs-0.156
bs-0.154
war-0.152
Token and
Token ID290
Feature activation-3.46e+00

Pos logit contributions
ich+0.162
 neighborhood+0.156
 playing+0.153
 factory+0.147
 swamp+0.146
Neg logit contributions
le-0.209
of-0.195
umbs-0.191
les-0.185
war-0.184
Token her
Token ID607
Feature activation+4.10e-01

Pos logit contributions
bs+0.171
of+0.170
le+0.161
les+0.160
 sip+0.150
Neg logit contributions
ich-0.262
 neighborhood-0.240
gy-0.229
 factory-0.227
 swamp-0.226
Token parents
Token ID3397
Feature activation+7.37e+00

Pos logit contributions
ich+0.024
 office+0.023
 exploring+0.022
 seeking+0.022
 finding+0.022
Neg logit contributions
bs-0.023
 sip-0.021
les-0.021
forts-0.021
umbs-0.021
Token were
Token ID547
Feature activation-1.31e+00

Pos logit contributions
ich+0.392
 office+0.360
 playing+0.354
 neighborhood+0.344
 seeking+0.335
Neg logit contributions
le-0.571
of-0.555
les-0.538
bs-0.522
umbs-0.518
Token very
Token ID845
Feature activation+5.97e+00

Pos logit contributions
umbs+0.087
le+0.087
bs+0.083
les+0.083
ck+0.081
Neg logit contributions
ich-0.063
 neighborhood-0.057
 seeker-0.056
gy-0.055
 Squirrel-0.054
Token wealthy
Token ID11574
Feature activation+3.93e+00

Pos logit contributions
ich+0.353
 playing+0.341
 neighborhood+0.340
 office+0.340
 backyard+0.315
Neg logit contributions
le-0.397
bs-0.394
of-0.388
umbs-0.383
war-0.373
Token.
Token ID13
Feature activation-8.27e-01

Pos logit contributions
ich+0.312
 playing+0.297
 neighborhood+0.294
 smartest+0.291
 happiest+0.288
Neg logit contributions
le-0.187
les-0.162
umbs-0.159
of-0.156
bs-0.147
Token They
Token ID1119
Feature activation-2.87e+00

Pos logit contributions
le+0.057
of+0.055
umbs+0.055
war+0.054
les+0.054
Neg logit contributions
 playing-0.050
ich-0.050
 office-0.048
 neighborhood-0.046
 living-0.045
Token offered
Token ID4438
Feature activation-1.42e+00

Pos logit contributions
le+0.195
of+0.188
bs+0.186
les+0.182
forts+0.174
Neg logit contributions
ich-0.181
 office-0.160
 neighborhood-0.160
 playing-0.157
 neighbourhood-0.146
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
 playing+0.152
 living+0.133
 neighborhood+0.128
 office+0.125
 exploring+0.125
Neg logit contributions
le-0.185
umbs-0.183
of-0.181
war-0.178
les-0.175
Token bumped
Token ID38034
Feature activation+2.56e+00

Pos logit contributions
le+0.270
umbs+0.259
of+0.256
war+0.255
les+0.254
Neg logit contributions
 playing-0.205
 living-0.180
ich-0.180
 office-0.180
 neighborhood-0.177
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 playing+0.158
 living+0.136
ich+0.135
 neighborhood+0.130
 exploring+0.127
Neg logit contributions
le-0.185
of-0.185
war-0.182
umbs-0.181
forts-0.178
Token They
Token ID1119
Feature activation-2.94e+00

Pos logit contributions
le+0.084
les+0.079
of+0.078
war+0.077
umbs+0.076
Neg logit contributions
 playing-0.065
 neighborhood-0.059
 living-0.058
 office-0.057
ich-0.056
Token cried
Token ID16896
Feature activation-3.86e+00

Pos logit contributions
le+0.228
umbs+0.225
war+0.224
of+0.221
les+0.217
Neg logit contributions
 playing-0.159
 living-0.138
 neighborhood-0.135
ich-0.135
 office-0.133
Token and
Token ID290
Feature activation-3.56e+00

Pos logit contributions
war+0.296
of+0.291
umbs+0.291
le+0.288
les+0.278
Neg logit contributions
 playing-0.231
 living-0.205
 office-0.180
 neighborhood-0.178
 exploring-0.177
Token shouted
Token ID17293
Feature activation-3.10e+00

Pos logit contributions
le+0.240
of+0.230
umbs+0.230
war+0.228
les+0.225
Neg logit contributions
 playing-0.233
 office-0.213
ich-0.212
 neighborhood-0.210
 living-0.209
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
le+0.233
umbs+0.227
of+0.227
war+0.227
les+0.219
Neg logit contributions
 playing-0.180
 living-0.159
 office-0.157
ich-0.155
 neighborhood-0.154
Token They
Token ID1119
Feature activation-2.90e+00

Pos logit contributions
le+0.073
of+0.070
umbs+0.070
war+0.070
les+0.069
Neg logit contributions
 playing-0.064
 office-0.060
ich-0.060
 neighborhood-0.059
 living-0.058
Token wanted
Token ID2227
Feature activation+3.22e+00

Pos logit contributions
le+0.225
umbs+0.218
war+0.218
of+0.217
les+0.213
Neg logit contributions
 playing-0.157
ich-0.140
 office-0.140
 neighborhood-0.139
 living-0.138
Token to
Token ID284
Feature activation-2.09e-01

Pos logit contributions
 playing+0.195
 office+0.177
 living+0.174
 neighborhood+0.173
ich+0.173
Neg logit contributions
le-0.238
umbs-0.229
of-0.226
war-0.226
les-0.225
Token learned
Token ID4499
Feature activation-1.58e+00

Pos logit contributions
 office+0.071
ich+0.071
 playing+0.069
 factory+0.066
 neighborhood+0.065
Neg logit contributions
le-0.080
les-0.076
of-0.073
bs-0.069
war-0.069
Token something
Token ID1223
Feature activation+8.54e-01

Pos logit contributions
le+0.119
of+0.114
war+0.114
umbs+0.113
les+0.112
Neg logit contributions
 playing-0.089
ich-0.082
 office-0.081
 neighborhood-0.081
 living-0.079
Token new
Token ID649
Feature activation-5.19e-01

Pos logit contributions
ich+0.052
 playing+0.052
 neighborhood+0.051
 seeker+0.050
 office+0.050
Neg logit contributions
le-0.057
les-0.055
of-0.054
war-0.052
bs-0.051
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
le+0.028
of+0.026
les+0.025
war+0.025
bs+0.024
Neg logit contributions
ich-0.039
 playing-0.038
 neighborhood-0.037
 office-0.036
 seeker-0.035
Token He
Token ID679
Feature activation+1.38e+00

Pos logit contributions
umbs+0.042
of+0.041
war+0.041
le+0.040
les+0.039
Neg logit contributions
ich-0.046
 playing-0.046
 office-0.043
 neighborhood-0.042
 living-0.041
Token told
Token ID1297
Feature activation-2.33e+00

Pos logit contributions
 playing+0.089
ich+0.089
 office+0.084
 neighborhood+0.081
 living+0.081
Neg logit contributions
le-0.089
of-0.086
les-0.084
bs-0.080
war-0.080
Token his
Token ID465
Feature activation+1.56e+00

Pos logit contributions
le+0.173
of+0.166
umbs+0.166
war+0.166
les+0.163
Neg logit contributions
 playing-0.135
 office-0.124
ich-0.123
 neighborhood-0.122
 living-0.120
Token friends
Token ID2460
Feature activation+8.54e+00

Pos logit contributions
 playing+0.072
ich+0.069
 neighborhood+0.067
 office+0.065
 living+0.063
Neg logit contributions
le-0.124
of-0.123
bs-0.120
umbs-0.120
war-0.119
Token about
Token ID546
Feature activation-1.59e+00

Pos logit contributions
 playing+0.534
ich+0.529
 neighborhood+0.510
 office+0.508
 living+0.482
Neg logit contributions
le-0.567
of-0.549
les-0.533
war-0.531
umbs-0.525
Token snakes
Token ID26042
Feature activation+1.35e+00

Pos logit contributions
le+0.091
of+0.088
les+0.085
war+0.085
umbs+0.083
Neg logit contributions
ich-0.103
 neighborhood-0.101
 seeker-0.099
 office-0.098
 playing-0.096
Token and
Token ID290
Feature activation-3.36e+00

Pos logit contributions
 playing+0.090
ich+0.089
 office+0.087
 neighborhood+0.086
 living+0.081
Neg logit contributions
le-0.085
of-0.080
les-0.079
umbs-0.078
war-0.077
Token of
Token ID286
Feature activation-7.73e+00

Pos logit contributions
 playing+0.152
 living+0.134
 office+0.129
ich+0.121
 exploring+0.121
Neg logit contributions
war-0.219
le-0.219
umbs-0.217
of-0.215
forts-0.212
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
le+0.568
umbs+0.544
of+0.544
war+0.544
les+0.532
Neg logit contributions
 playing-0.462
 office-0.421
 living-0.414
ich-0.410
 neighborhood-0.407
Token bear
Token ID6842
Feature activation+4.95e+00

Pos logit contributions
le+0.011
war+0.010
les+0.010
of+0.010
umbs+0.010
Neg logit contributions
 playing-0.010
ich-0.009
 neighborhood-0.009
 exploring-0.008
 seeking-0.008
Token on
Token ID319
Feature activation-3.81e-01

Pos logit contributions
 playing+0.289
ich+0.269
 office+0.268
 neighborhood+0.265
 living+0.259
Neg logit contributions
le-0.358
of-0.344
umbs-0.342
war-0.341
les-0.337
Token a
Token ID257
Feature activation-3.77e-01

Pos logit contributions
le+0.031
umbs+0.030
forts+0.030
war+0.030
of+0.030
Neg logit contributions
 playing-0.019
 office-0.018
 living-0.017
 neighborhood-0.016
 factory-0.016
Token screen
Token ID3159
Feature activation-2.89e+00

Pos logit contributions
le+0.027
of+0.026
war+0.026
umbs+0.026
les+0.026
Neg logit contributions
 playing-0.022
ich-0.021
 office-0.021
 neighborhood-0.020
 living-0.020
Token.
Token ID13
Feature activation-8.57e-01

Pos logit contributions
le+0.186
of+0.178
war+0.178
umbs+0.177
les+0.174
Neg logit contributions
 playing-0.193
 office-0.179
ich-0.178
 neighborhood-0.177
 living-0.175
Token He
Token ID679
Feature activation+1.34e+00

Pos logit contributions
le+0.076
of+0.071
les+0.070
war+0.070
umbs+0.069
Neg logit contributions
 playing-0.040
 office-0.036
 living-0.036
 neighborhood-0.035
ich-0.033
Token smiled
Token ID13541
Feature activation-3.34e+00

Pos logit contributions
 playing+0.076
ich+0.072
 office+0.071
 neighborhood+0.070
 living+0.068
Neg logit contributions
le-0.100
of-0.096
les-0.094
war-0.094
umbs-0.094
Token.
Token ID13
Feature activation-8.52e-01

Pos logit contributions
le+0.191
of+0.178
les+0.176
umbs+0.176
war+0.173
Neg logit contributions
 playing-0.245
 office-0.234
ich-0.234
 neighborhood-0.231
 living-0.226
Token "
Token ID366
Feature activation+5.38e+00

Pos logit contributions
le+0.070
of+0.066
war+0.065
umbs+0.065
les+0.065
Neg logit contributions
 playing-0.044
 office-0.040
 neighborhood-0.040
 living-0.040
ich-0.039
Token did
Token ID750
Feature activation-2.67e+00

Pos logit contributions
 playing+0.077
ich+0.077
 office+0.077
 neighborhood+0.071
 factory+0.071
Neg logit contributions
le-0.111
of-0.105
les-0.104
bs-0.102
war-0.099
Token not
Token ID407
Feature activation+4.17e+00

Pos logit contributions
le+0.263
of+0.256
war+0.254
umbs+0.254
les+0.252
Neg logit contributions
 playing-0.088
ich-0.077
 office-0.076
 neighborhood-0.074
 living-0.072
Token understand
Token ID1833
Feature activation-1.01e+01

Pos logit contributions
 playing+0.267
 seeker+0.236
 office+0.226
 living+0.223
 smartest+0.220
Neg logit contributions
umbs-0.290
war-0.269
of-0.265
les-0.259
rily-0.249
Token why
Token ID1521
Feature activation+4.76e+00

Pos logit contributions
le+0.784
of+0.755
umbs+0.753
war+0.751
les+0.741
Neg logit contributions
 playing-0.549
 office-0.503
ich-0.501
 neighborhood-0.494
 living-0.485
Token Max
Token ID5436
Feature activation+2.37e+00

Pos logit contributions
 playing+0.228
ich+0.225
 neighborhood+0.220
 office+0.220
 living+0.208
Neg logit contributions
le-0.378
of-0.370
bs-0.364
war-0.361
les-0.361
Token did
Token ID750
Feature activation-2.68e+00

Pos logit contributions
ich+0.117
 office+0.113
 playing+0.113
 neighborhood+0.108
 living+0.106
Neg logit contributions
le-0.196
of-0.183
les-0.181
war-0.176
bs-0.173
Token not
Token ID407
Feature activation+4.16e+00

Pos logit contributions
umbs+0.259
le+0.258
war+0.253
of+0.253
les+0.250
Neg logit contributions
 playing-0.099
 office-0.080
 living-0.078
ich-0.077
 neighborhood-0.075
Token want
Token ID765
Feature activation-1.73e+00

Pos logit contributions
 playing+0.284
 office+0.243
 seeker+0.241
 living+0.238
 neighborhood+0.236
Neg logit contributions
umbs-0.276
war-0.263
les-0.253
of-0.248
le-0.247
Token to
Token ID284
Feature activation-2.75e-01

Pos logit contributions
le+0.135
umbs+0.133
of+0.129
war+0.129
les+0.128
Neg logit contributions
 playing-0.098
 living-0.085
 office-0.083
 neighborhood-0.081
 exploring-0.079
Token marry
Token ID12479
Feature activation+2.71e+00

Pos logit contributions
umbs+0.020
war+0.020
le+0.020
les+0.020
of+0.019
Neg logit contributions
 playing-0.016
ich-0.014
 office-0.014
 neighborhood-0.014
 living-0.014
Token her
Token ID607
Feature activation+3.58e-01

Pos logit contributions
 playing+0.179
ich+0.177
 office+0.174
 neighborhood+0.171
 living+0.166
Neg logit contributions
le-0.177
of-0.166
war-0.162
les-0.162
bs-0.158
Token "
Token ID366
Feature activation+5.34e+00

Pos logit contributions
 playing+0.106
 office+0.104
 neighborhood+0.102
ich+0.102
 smartest+0.093
Neg logit contributions
of-0.225
bs-0.223
umbs-0.223
le-0.222
war-0.222
TokenWe
Token ID1135
Feature activation+2.70e+00

Pos logit contributions
ich+0.223
 playing+0.220
 neighborhood+0.209
 living+0.203
 office+0.191
Neg logit contributions
le-0.477
les-0.458
of-0.444
umbs-0.444
war-0.442
Token used
Token ID973
Feature activation+8.42e-01

Pos logit contributions
 playing+0.176
 neighborhood+0.157
 office+0.156
 living+0.156
ich+0.152
Neg logit contributions
le-0.179
war-0.177
umbs-0.177
of-0.174
les-0.170
Token white
Token ID2330
Feature activation-4.58e+00

Pos logit contributions
 playing+0.046
 living+0.041
 office+0.038
 exploring+0.036
 factory+0.034
Neg logit contributions
le-0.073
umbs-0.070
les-0.067
war-0.067
of-0.066
Token cr
Token ID1067
Feature activation-6.07e+00

Pos logit contributions
le+0.317
umbs+0.311
war+0.309
of+0.301
forts+0.293
Neg logit contributions
 playing-0.298
 office-0.269
 living-0.268
 neighborhood-0.254
 factory-0.250
Tokenay
Token ID323
Feature activation-9.72e+00

Pos logit contributions
le+0.379
 sip+0.370
war+0.365
forts+0.352
ited+0.343
Neg logit contributions
 playing-0.371
 office-0.361
ich-0.356
 living-0.347
 neighborhood-0.332
Tokenons
Token ID684
Feature activation-5.18e+00

Pos logit contributions
le+0.714
umbs+0.692
rily+0.691
war+0.680
 sip+0.666
Neg logit contributions
 playing-0.597
 office-0.509
 living-0.506
 doing-0.452
 fixing-0.442
Token,
Token ID11
Feature activation+2.66e+00

Pos logit contributions
le+0.387
war+0.376
umbs+0.369
les+0.366
of+0.364
Neg logit contributions
 playing-0.301
 living-0.275
 office-0.270
ich-0.258
 neighborhood-0.245
Token Mom
Token ID11254
Feature activation+8.03e-01

Pos logit contributions
 playing+0.156
 office+0.140
 living+0.137
ich+0.134
 neighborhood+0.132
Neg logit contributions
le-0.202
war-0.196
umbs-0.194
les-0.192
of-0.190
Token.
Token ID13
Feature activation-9.12e-01

Pos logit contributions
 playing+0.054
ich+0.051
 office+0.051
 neighborhood+0.050
 living+0.049
Neg logit contributions
le-0.051
of-0.049
umbs-0.049
war-0.048
les-0.048
Token We
Token ID775
Feature activation+1.80e+00

Pos logit contributions
le+0.068
of+0.066
war+0.065
umbs+0.065
les+0.065
Neg logit contributions
 playing-0.052
 office-0.048
ich-0.048
 neighborhood-0.047
 living-0.047
Token fire
Token ID2046
Feature activation-3.38e+00

Pos logit contributions
le+0.019
umbs+0.018
les+0.018
war+0.017
of+0.017
Neg logit contributions
 playing-0.020
ich-0.018
 office-0.018
 neighborhood-0.018
 exploring-0.017
Token wasn
Token ID2492
Feature activation+4.94e+00

Pos logit contributions
le+0.226
les+0.209
umbs+0.207
of+0.202
war+0.197
Neg logit contributions
 playing-0.213
ich-0.202
 neighborhood-0.200
 office-0.200
 seeker-0.196
Token't
Token ID470
Feature activation-2.60e-01

Pos logit contributions
 playing+0.197
ich+0.167
 office+0.160
 living+0.159
 neighborhood+0.150
Neg logit contributions
le-0.457
war-0.440
les-0.440
of-0.436
umbs-0.431
Token coming
Token ID2406
Feature activation+6.14e-01

Pos logit contributions
le+0.021
of+0.020
war+0.020
umbs+0.020
les+0.020
Neg logit contributions
 playing-0.013
 office-0.012
ich-0.012
 neighborhood-0.012
 living-0.012
Token any
Token ID597
Feature activation-3.48e+00

Pos logit contributions
 playing+0.035
ich+0.032
 office+0.032
 living+0.031
 neighborhood+0.031
Neg logit contributions
le-0.046
of-0.044
umbs-0.044
war-0.044
les-0.044
Token closer
Token ID5699
Feature activation-1.09e+01

Pos logit contributions
of+0.225
le+0.223
war+0.222
umbs+0.221
bs+0.214
Neg logit contributions
 playing-0.234
 living-0.215
 office-0.213
 neighborhood-0.213
ich-0.207
Token.
Token ID13
Feature activation-8.09e-01

Pos logit contributions
le+0.746
of+0.724
war+0.724
umbs+0.722
les+0.710
Neg logit contributions
 playing-0.697
 office-0.637
 living-0.628
ich-0.623
 neighborhood-0.622
Token The
Token ID383
Feature activation-5.30e-01

Pos logit contributions
umbs+0.049
war+0.049
of+0.048
le+0.047
bs+0.047
Neg logit contributions
 playing-0.054
ich-0.054
 office-0.051
 neighborhood-0.050
 seeker-0.049
Token 3
Token ID513
Feature activation+2.51e+00

Pos logit contributions
le+0.038
les+0.036
umbs+0.036
 sip+0.035
war+0.034
Neg logit contributions
ich-0.029
 playing-0.028
 exploring-0.025
 neighborhood-0.025
 seeking-0.025
Token-
Token ID12
Feature activation-8.85e-01

Pos logit contributions
 playing+0.154
 office+0.143
ich+0.141
 neighborhood+0.140
 living+0.138
Neg logit contributions
le-0.174
war-0.170
of-0.169
umbs-0.169
les-0.163
Tokenyear
Token ID1941
Feature activation-9.58e+00

Pos logit contributions
le+0.048
umbs+0.048
les+0.046
war+0.045
of+0.044
Neg logit contributions
 playing-0.065
 office-0.064
ich-0.061
 living-0.061
 neighborhood-0.058
Token and
Token ID290
Feature activation-3.42e+00

Pos logit contributions
le+0.092
of+0.084
les+0.084
umbs+0.083
war+0.081
Neg logit contributions
 playing-0.129
ich-0.128
 office-0.127
 neighborhood-0.126
 factory-0.121
Token che
Token ID1125
Feature activation+1.71e+00

Pos logit contributions
bs+0.199
le+0.198
les+0.192
of+0.190
forts+0.186
Neg logit contributions
 office-0.238
ich-0.235
 neighborhood-0.225
 seeker-0.218
 factory-0.216
Tokenwed
Token ID19103
Feature activation-1.12e+01

Pos logit contributions
 playing+0.145
ich+0.138
 office+0.133
gy+0.133
 neighborhood+0.133
Neg logit contributions
umbs-0.067
war-0.065
of-0.063
forts-0.061
 sip-0.058
Token and
Token ID290
Feature activation-3.43e+00

Pos logit contributions
umbs+0.889
le+0.881
war+0.875
forts+0.873
of+0.871
Neg logit contributions
 playing-0.571
 living-0.481
 exploring-0.449
ich-0.429
 office-0.419
Token che
Token ID1125
Feature activation+1.68e+00

Pos logit contributions
le+0.226
of+0.214
les+0.211
war+0.210
umbs+0.210
Neg logit contributions
 playing-0.217
ich-0.216
 office-0.214
 neighborhood-0.210
 living-0.199
Tokenwed
Token ID19103
Feature activation-1.12e+01

Pos logit contributions
 playing+0.135
gy+0.125
 lived+0.122
ich+0.122
 office+0.120
Neg logit contributions
umbs-0.070
war-0.067
of-0.067
forts-0.065
bread-0.063
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
umbs+0.851
le+0.842
war+0.840
of+0.834
forts+0.833
Neg logit contributions
 playing-0.611
 living-0.524
 exploring-0.490
ich-0.471
 office-0.470
Token Lily
Token ID20037
Feature activation+2.29e+00

Pos logit contributions
le+0.063
umbs+0.061
of+0.061
war+0.061
les+0.060
Neg logit contributions
 playing-0.044
ich-0.041
 office-0.041
 neighborhood-0.040
 living-0.039
Token wanted
Token ID2227
Feature activation+3.25e+00

Pos logit contributions
ich+0.129
 playing+0.128
 office+0.127
 neighborhood+0.122
 seeker+0.122
Neg logit contributions
le-0.166
of-0.159
les-0.159
umbs-0.150
war-0.149
Token to
Token ID284
Feature activation-2.06e-01

Pos logit contributions
 playing+0.184
 living+0.160
 office+0.159
 neighborhood+0.156
 exploring+0.153
Neg logit contributions
le-0.256
umbs-0.246
war-0.244
of-0.242
les-0.241
Token try
Token ID1949
Feature activation-6.08e+00

Pos logit contributions
le+0.013
war+0.013
umbs+0.013
les+0.012
of+0.012
Neg logit contributions
 playing-0.014
 office-0.013
 neighborhood-0.013
ich-0.013
 living-0.013
Token Dad
Token ID17415
Feature activation+9.22e-01

Pos logit contributions
le+0.253
umbs+0.239
war+0.239
of+0.238
les+0.235
Neg logit contributions
 playing-0.209
 office-0.186
ich-0.185
 living-0.183
 neighborhood-0.182
Token said
Token ID531
Feature activation-6.35e+00

Pos logit contributions
 playing+0.055
 living+0.047
 office+0.043
 walking+0.042
 neighborhood+0.042
Neg logit contributions
war-0.070
umbs-0.067
le-0.064
its-0.064
ws-0.063
Token.
Token ID13
Feature activation-9.39e-01

Pos logit contributions
le+0.480
war+0.476
umbs+0.472
of+0.466
les+0.453
Neg logit contributions
 playing-0.384
 living-0.336
 office-0.321
 exploring-0.307
ich-0.304
Token "
Token ID366
Feature activation+5.59e+00

Pos logit contributions
le+0.091
of+0.088
umbs+0.088
war+0.088
les+0.087
Neg logit contributions
 playing-0.032
 office-0.028
ich-0.028
 neighborhood-0.027
 living-0.026
TokenWe
Token ID1135
Feature activation+2.85e+00

Pos logit contributions
 playing+0.264
 living+0.249
ich+0.249
 neighborhood+0.242
 office+0.226
Neg logit contributions
le-0.478
les-0.454
of-0.452
umbs-0.449
war-0.446
Token understand
Token ID1833
Feature activation-1.00e+01

Pos logit contributions
 playing+0.201
 neighborhood+0.185
 office+0.184
 living+0.182
ich+0.181
Neg logit contributions
le-0.175
umbs-0.169
war-0.168
of-0.167
les-0.164
Token that
Token ID326
Feature activation+1.24e+00

Pos logit contributions
le+0.691
umbs+0.667
war+0.664
of+0.664
les+0.650
Neg logit contributions
 playing-0.631
 office-0.582
ich-0.578
 neighborhood-0.571
 living-0.569
Token you
Token ID345
Feature activation+2.37e+00

Pos logit contributions
ich+0.073
 neighborhood+0.070
 playing+0.070
 office+0.069
 seeker+0.067
Neg logit contributions
le-0.082
of-0.079
les-0.077
bs-0.076
war-0.076
Token are
Token ID389
Feature activation-3.22e+00

Pos logit contributions
 playing+0.159
 office+0.145
 neighborhood+0.145
ich+0.144
 living+0.143
Neg logit contributions
le-0.153
umbs-0.148
war-0.147
of-0.146
les-0.144
Token excited
Token ID6568
Feature activation-2.48e+00

Pos logit contributions
le+0.222
ck+0.209
bs+0.203
les+0.200
war+0.197
Neg logit contributions
 neighborhood-0.171
ich-0.168
 office-0.165
 playing-0.160
 mall-0.160
Token about
Token ID546
Feature activation-1.58e+00

Pos logit contributions
le+0.168
umbs+0.163
war+0.163
of+0.162
les+0.158
Neg logit contributions
 playing-0.160
 office-0.146
 living-0.145
 neighborhood-0.144
ich-0.143
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
umbs+0.110
war+0.110
le+0.108
of+0.108
forts+0.106
Neg logit contributions
 playing-0.092
 living-0.082
 office-0.077
ich-0.075
 neighborhood-0.074
Token He
Token ID679
Feature activation+1.27e+00

Pos logit contributions
le+0.080
of+0.076
war+0.075
les+0.075
umbs+0.074
Neg logit contributions
 playing-0.054
 living-0.049
 neighborhood-0.048
 office-0.047
ich-0.046
Token laughs
Token ID22051
Feature activation-4.71e+00

Pos logit contributions
 playing+0.090
 office+0.082
 neighborhood+0.082
 living+0.081
ich+0.081
Neg logit contributions
le-0.078
umbs-0.075
war-0.075
of-0.074
les-0.071
Token and
Token ID290
Feature activation-3.47e+00

Pos logit contributions
le+0.303
war+0.294
of+0.293
umbs+0.292
les+0.283
Neg logit contributions
 playing-0.322
ich-0.292
 office-0.291
 living-0.291
 neighborhood-0.288
Token swim
Token ID9422
Feature activation-2.89e+00

Pos logit contributions
le+0.210
of+0.201
war+0.199
umbs+0.199
les+0.197
Neg logit contributions
 playing-0.243
 office-0.231
ich-0.230
 neighborhood-0.228
 living-0.222
Tokens
Token ID82
Feature activation-9.81e+00

Pos logit contributions
le+0.194
umbs+0.190
of+0.186
war+0.185
les+0.180
Neg logit contributions
 playing-0.185
 living-0.167
ich-0.166
 neighborhood-0.166
 office-0.164
Token to
Token ID284
Feature activation-2.54e-01

Pos logit contributions
le+0.701
of+0.673
umbs+0.669
war+0.668
les+0.658
Neg logit contributions
 playing-0.589
 office-0.545
ich-0.545
 neighborhood-0.538
 living-0.528
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
le+0.017
of+0.017
war+0.016
umbs+0.016
les+0.016
Neg logit contributions
 playing-0.016
ich-0.015
 office-0.015
 neighborhood-0.015
 living-0.015
Token edge
Token ID5743
Feature activation+2.01e+00

Pos logit contributions
le+0.017
of+0.016
umbs+0.016
war+0.016
les+0.016
Neg logit contributions
 playing-0.012
 office-0.011
ich-0.011
 neighborhood-0.010
 living-0.010
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 playing+0.129
 living+0.117
 office+0.117
 neighborhood+0.111
ich+0.105
Neg logit contributions
war-0.134
le-0.133
forts-0.132
umbs-0.131
bs-0.130
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
le+0.074
of+0.071
umbs+0.071
war+0.071
les+0.070
Neg logit contributions
 playing-0.058
 office-0.053
ich-0.053
 neighborhood-0.052
 living-0.051