TOP ACTIVATIONS
MAX = -1.000

. They put a band - aid on her knee and
the help of the x - ray , he could see
him . B enny was so surprised he
. He had an x - ray that he shone on
He gave them a band - aid and moved a little
the storm . B enny felt happy that he
. Lily put the band - aid on Ben 's knee
and Tom were playing dress - up in their room .
the lid of the toy - box . He thought for
his toys from his toy - box . He never complained
myself ." B igg ie didn 't listen
need to get a check - up to make sure you
on Jill 's head and bites her hair . Jill screams
knee and put a band - aid on it . She
. " Uh - oh , Tom , we
the girl saw it spark le and she loved it .
nest . B igg ie said , "
veterinarian ?" B obby replied , " I
. She told her three - year - old daughter to
told her three - year - old daughter to attach her

MIN ACTIVATIONS
MIN = -1.000

He found a tiny spider living inside and he laughed as
with their toys in the living room . Sara has a
me here a long time ago . They forgot about me
more freeze tag in the living room . Do you understand
with their blocks in the living room . They liked to
his mom went to the living room . They unp acked
house . He explored the living room and the bedroom .
decided to decor ate the living room . They collected boxes
were both rushing around the living room , laughing and barking
the piano and around the living room . Soon
comes and sees the tidy living room . She smiles and
igo , rush into the living room . Am igo started
with their toys in the living room . But sometimes they
, thick curtain in the living room . She loved it
Mom my coming into the living room . "
with the bell in the living room . She saw a
and I â € ™ ll walk with you around the
noise and came to the living room . She saw the
best friend a long time ago . He was my dog
with their toys in the living room . They liked to

INTERVAL -1.000 - -1.000
CONTAINS 27.324%

got closer , he noticed that the traffic was increasing .
a switch in her mummy 's house and tried to press
back to their home . The ants were very tough and
man told Jon that he wanted him to fly to the
The lake was so big and there were lots
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.651
 the+0.650
,+0.522
 a+0.515
.+0.221
Neg logit contributions
umm-8.404
rol-8.347
lor-8.309
bek-8.135
arians-8.112
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+3.622
,+3.485
 the+3.434
.+3.312
 a+3.242
Neg logit contributions
umm-5.733
lor-5.649
rol-5.560
osity-5.492
arians-5.433
Token put
Token ID1234
Feature activation-1.00e+00

Pos logit contributions
 and+3.884
,+3.710
 the+3.696
 a+3.569
.+3.568
Neg logit contributions
umm-5.332
rol-5.307
lor-5.284
arians-5.154
bek-5.149
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.049
,+1.907
.+1.747
 the+1.678
 a+1.588
Neg logit contributions
umm-7.177
rol-7.167
lor-7.069
rob-7.039
bek-6.935
Token band
Token ID4097
Feature activation-1.00e+00

Pos logit contributions
 and+5.075
 the+4.899
,+4.880
.+4.798
 a+4.699
Neg logit contributions
umm-4.202
lor-4.189
bek-4.096
rol-4.076
rob-3.939
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+5.856
 the+5.764
,+5.718
.+5.564
 a+5.554
Neg logit contributions
lor-3.432
rol-3.394
umm-3.377
bek-3.189
rob-3.178
Tokenaid
Token ID1698
Feature activation-1.00e+00

Pos logit contributions
 and+7.257
,+7.090
 the+7.076
.+6.947
 a+6.905
Neg logit contributions
umm-2.136
lor-2.069
rol-2.038
bek-1.871
osity-1.855
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.833
 the+1.727
,+1.706
 a+1.573
.+1.547
Neg logit contributions
umm-7.409
lor-7.373
rol-7.334
bek-7.160
rob-7.144
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+2.259
,+2.126
.+1.963
 the+1.949
 a+1.853
Neg logit contributions
lor-6.758
umm-6.688
rol-6.625
arians-6.580
ocol-6.493
Token knee
Token ID10329
Feature activation-1.00e+00

Pos logit contributions
 and+5.171
 the+5.163
,+5.103
 a+4.927
.+4.889
Neg logit contributions
lor-3.857
umm-3.836
arians-3.688
rol-3.633
bek-3.618
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+0.630
 and+0.441
 a+0.441
,+0.364
.+0.149
Neg logit contributions
lor-8.492
rol-8.329
umm-8.292
arians-8.202
rob-8.173
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.738
.+2.506
,+2.486
 the+2.236
 a+2.039
Neg logit contributions
umm-6.574
rol-6.459
lor-6.432
ocol-6.386
osity-6.318
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+5.451
,+5.250
 the+5.225
.+5.108
 a+5.007
Neg logit contributions
umm-3.975
lor-3.892
rol-3.768
arians-3.737
bek-3.701
Token of
Token ID286
Feature activation-1.00e+00

Pos logit contributions
 and+2.905
 the+2.740
 a+2.626
,+2.625
.+2.573
Neg logit contributions
umm-6.348
rol-6.329
lor-6.283
osity-6.173
arians-6.108
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.428
,+2.177
.+2.058
 a+1.891
 the+1.882
Neg logit contributions
ocol-6.765
rol-6.683
umm-6.642
lor-6.625
bek-6.607
Token x
Token ID2124
Feature activation-1.00e+00

Pos logit contributions
 and+5.611
,+5.402
 the+5.385
.+5.250
 a+5.185
Neg logit contributions
umm-3.900
lor-3.758
arians-3.698
rol-3.601
rob-3.518
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.854
 the+3.766
,+3.635
 a+3.583
.+3.533
Neg logit contributions
lor-5.555
umm-5.436
rol-5.323
arians-5.265
rob-5.263
Tokenray
Token ID2433
Feature activation-1.00e+00

Pos logit contributions
 and+7.139
,+6.988
 the+6.947
.+6.814
 a+6.774
Neg logit contributions
lor-2.275
umm-2.238
rol-2.177
bek-2.062
osity-2.012
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+3.932
 the+3.846
 a+3.755
,+3.680
.+3.624
Neg logit contributions
umm-5.339
lor-5.251
arians-5.086
rol-5.081
bek-5.032
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.225
,+2.061
.+2.005
 the+1.788
 a+1.725
Neg logit contributions
umm-6.978
rol-6.848
lor-6.821
arians-6.766
osity-6.738
Token could
Token ID714
Feature activation-1.00e+00

Pos logit contributions
 and+3.950
,+3.929
 the+3.892
.+3.811
 a+3.753
Neg logit contributions
lor-5.159
rol-5.120
arians-5.042
umm-5.002
bek-4.979
Token see
Token ID766
Feature activation-1.00e+00

Pos logit contributions
 and+3.850
,+3.745
 the+3.729
.+3.585
 a+3.512
Neg logit contributions
lor-5.266
rol-5.199
umm-5.131
osity-5.048
arians-5.041
Token him
Token ID683
Feature activation-1.00e+00

Pos logit contributions
 and+3.620
,+3.357
 the+3.190
.+3.108
 a+3.030
Neg logit contributions
umm-5.679
rol-5.578
lor-5.577
osity-5.427
ocol-5.326
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.358
 the+2.336
,+2.213
 a+2.125
.+1.976
Neg logit contributions
rol-6.697
lor-6.651
umm-6.602
osity-6.527
ides-6.475
Token
Token ID220
Feature activation-1.00e+00

Pos logit contributions
 and+3.492
 the+3.298
,+3.296
.+3.195
 a+3.166
Neg logit contributions
umm-5.846
lor-5.766
rol-5.702
osity-5.560
arians-5.507
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.612
,+1.456
 the+1.448
 a+1.285
.+1.240
Neg logit contributions
lor-7.692
rol-7.647
umm-7.589
osity-7.390
ides-7.345
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.478
 the+1.318
,+1.312
 a+1.172
.+1.168
Neg logit contributions
lor-7.797
umm-7.748
rol-7.724
rob-7.511
bek-7.499
TokenB
Token ID33
Feature activation-1.00e+00

Pos logit contributions
 and+5.365
 the+5.201
,+5.200
.+5.070
 a+5.039
Neg logit contributions
umm-4.034
lor-3.968
rol-3.909
bek-3.771
osity-3.755
Tokenenny
Token ID11870
Feature activation-1.00e+00

Pos logit contributions
 and+7.699
,+7.518
 the+7.510
.+7.384
 a+7.356
Neg logit contributions
umm-1.729
lor-1.647
rol-1.610
bek-1.445
osity-1.437
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.170
 the+3.166
,+3.053
 a+3.020
.+3.015
Neg logit contributions
lor-5.916
umm-5.762
arians-5.737
rol-5.721
bek-5.704
Token so
Token ID523
Feature activation-1.00e+00

Pos logit contributions
 and+3.580
,+3.419
.+3.377
 the+3.346
 a+3.096
Neg logit contributions
rol-5.616
umm-5.512
lor-5.500
osity-5.397
beh-5.341
Token surprised
Token ID6655
Feature activation-1.00e+00

Pos logit contributions
 and+3.432
 the+3.245
,+3.235
.+3.179
 a+3.065
Neg logit contributions
umm-5.824
rol-5.718
lor-5.665
osity-5.664
bek-5.537
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+1.491
 the+1.390
,+1.339
 a+1.275
.+1.197
Neg logit contributions
umm-7.765
lor-7.629
rol-7.616
osity-7.483
bek-7.400
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.028
 and+0.957
,+0.844
 a+0.810
.+0.649
Neg logit contributions
lor-8.164
rol-8.087
umm-8.057
arians-7.864
osity-7.835
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+3.286
,+3.124
 the+3.071
.+3.002
 a+2.955
Neg logit contributions
umm-6.066
rol-5.913
lor-5.912
arians-5.800
osity-5.796
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+3.752
,+3.682
 the+3.663
.+3.614
 a+3.509
Neg logit contributions
lor-5.307
rol-5.276
arians-5.234
bek-5.231
umm-5.213
Token an
Token ID281
Feature activation-1.00e+00

Pos logit contributions
 and+3.780
,+3.690
.+3.510
 the+3.447
 a+3.141
Neg logit contributions
lor-5.532
rol-5.448
umm-5.188
osity-5.183
arians-5.145
Token x
Token ID2124
Feature activation-1.00e+00

Pos logit contributions
 and+4.798
 the+4.655
,+4.630
.+4.527
 a+4.456
Neg logit contributions
umm-4.559
rol-4.418
lor-4.417
arians-4.252
bek-4.233
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+4.278
 the+4.158
,+4.091
.+3.991
 a+3.960
Neg logit contributions
lor-5.067
umm-5.056
rol-4.967
bek-4.815
rob-4.804
Tokenray
Token ID2433
Feature activation-1.00e+00

Pos logit contributions
 and+6.989
,+6.836
 the+6.813
.+6.680
 a+6.628
Neg logit contributions
umm-2.373
lor-2.354
rol-2.304
bek-2.141
osity-2.123
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+3.503
 the+3.463
,+3.354
 a+3.303
.+3.203
Neg logit contributions
umm-5.685
lor-5.668
rol-5.576
osity-5.436
bek-5.436
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+4.087
,+3.872
.+3.765
 the+3.754
 a+3.649
Neg logit contributions
umm-5.262
rol-5.169
lor-5.140
rob-4.973
bek-4.965
Token shone
Token ID44193
Feature activation-1.00e+00

Pos logit contributions
 and+4.056
,+4.008
 the+3.996
.+3.813
 a+3.792
Neg logit contributions
lor-5.068
rol-5.066
arians-5.024
umm-4.974
bek-4.956
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+2.561
,+2.430
 the+2.372
.+2.253
 a+2.211
Neg logit contributions
umm-6.685
rol-6.652
lor-6.530
osity-6.392
bek-6.388
TokenHe
Token ID1544
Feature activation-1.00e+00

Pos logit contributions
 and+5.020
,+4.876
 the+4.822
.+4.708
 a+4.669
Neg logit contributions
lor-4.397
umm-4.359
rol-4.316
bek-4.239
arians-4.131
Token gave
Token ID2921
Feature activation-1.00e+00

Pos logit contributions
 and+3.743
,+3.723
 the+3.696
.+3.643
 a+3.516
Neg logit contributions
lor-5.296
rol-5.243
bek-5.211
umm-5.172
arians-5.097
Token them
Token ID606
Feature activation-1.00e+00

Pos logit contributions
 and+2.461
,+2.255
.+2.140
 the+1.898
 a+1.878
Neg logit contributions
rol-6.970
umm-6.933
lor-6.807
rob-6.755
ocol-6.671
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.331
,+2.177
 the+2.040
.+2.033
 a+1.876
Neg logit contributions
rol-6.901
umm-6.881
lor-6.814
osity-6.677
bek-6.645
Token band
Token ID4097
Feature activation-1.00e+00

Pos logit contributions
 and+4.584
 the+4.450
,+4.422
.+4.347
 a+4.158
Neg logit contributions
lor-4.630
rol-4.549
umm-4.541
bek-4.496
osity-4.437
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+6.348
 the+6.289
,+6.203
.+6.049
 a+6.022
Neg logit contributions
lor-2.905
rol-2.839
umm-2.792
osity-2.631
bek-2.611
Tokenaid
Token ID1698
Feature activation-1.00e+00

Pos logit contributions
 and+7.372
,+7.224
 the+7.188
.+7.057
 a+6.988
Neg logit contributions
umm-2.008
lor-1.972
rol-1.952
bek-1.770
osity-1.739
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.677
 the+2.617
,+2.579
 a+2.436
.+2.403
Neg logit contributions
lor-6.435
rol-6.393
umm-6.380
osity-6.217
rob-6.166
Token moved
Token ID3888
Feature activation-1.00e+00

Pos logit contributions
 and+3.822
,+3.771
.+3.635
 the+3.541
 a+3.258
Neg logit contributions
lor-5.135
rob-5.028
umm-4.982
rol-4.950
bek-4.918
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.747
,+2.608
 the+2.438
.+2.373
 a+2.306
Neg logit contributions
rol-6.516
lor-6.350
umm-6.316
bek-6.213
ocol-6.187
Token little
Token ID1310
Feature activation-1.00e+00

Pos logit contributions
 and+4.594
 the+4.506
,+4.429
 a+4.339
.+4.298
Neg logit contributions
lor-4.699
umm-4.633
bek-4.556
rol-4.522
osity-4.359
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.591
,+3.375
 the+3.141
.+3.093
 a+3.092
Neg logit contributions
rol-5.522
umm-5.481
ocol-5.444
arians-5.406
lor-5.401
Token storm
Token ID6388
Feature activation-1.00e+00

Pos logit contributions
 and+4.975
,+4.788
 the+4.769
 a+4.592
.+4.590
Neg logit contributions
umm-4.496
rol-4.319
lor-4.301
arians-4.265
bek-4.079
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.915
 the+1.898
 a+1.864
,+1.743
.+1.486
Neg logit contributions
umm-7.218
rol-7.172
lor-7.155
arians-7.039
osity-6.958
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+4.278
,+4.106
 the+4.081
.+3.963
 a+3.940
Neg logit contributions
umm-5.192
lor-5.099
rol-5.086
osity-4.877
pill-4.825
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.917
,+1.770
 the+1.745
.+1.608
 a+1.582
Neg logit contributions
lor-7.433
umm-7.385
rol-7.376
bek-7.153
rob-7.136
TokenB
Token ID33
Feature activation-1.00e+00

Pos logit contributions
 and+5.447
,+5.304
 the+5.280
.+5.160
 a+5.104
Neg logit contributions
umm-3.973
lor-3.954
rol-3.897
bek-3.750
osity-3.701
Tokenenny
Token ID11870
Feature activation-1.00e+00

Pos logit contributions
 and+7.881
 the+7.770
,+7.721
.+7.577
 a+7.576
Neg logit contributions
umm-1.587
lor-1.523
rol-1.494
osity-1.329
bek-1.328
Token felt
Token ID2936
Feature activation-1.00e+00

Pos logit contributions
 and+3.417
 the+3.403
,+3.315
.+3.299
 a+3.277
Neg logit contributions
lor-5.761
rol-5.565
umm-5.493
arians-5.488
bek-5.466
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+3.418
,+3.200
 the+3.121
.+3.102
 a+2.943
Neg logit contributions
rol-5.800
umm-5.796
lor-5.767
osity-5.581
beh-5.523
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+2.781
 the+2.750
,+2.660
 a+2.589
.+2.533
Neg logit contributions
umm-6.331
lor-6.237
rol-6.109
osity-6.088
arians-5.921
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.653
,+2.384
.+2.326
 the+2.295
 a+2.204
Neg logit contributions
umm-6.753
lor-6.573
rol-6.532
osity-6.440
bek-6.397
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.537
 and+1.490
,+1.368
 a+1.328
.+1.129
Neg logit contributions
umm-7.548
rol-7.532
lor-7.517
bek-7.338
arians-7.315
Token Lily
Token ID20037
Feature activation-1.00e+00

Pos logit contributions
 and+3.606
,+3.509
 the+3.406
.+3.339
 a+3.257
Neg logit contributions
umm-5.709
lor-5.604
rol-5.561
osity-5.523
arians-5.417
Token put
Token ID1234
Feature activation-1.00e+00

Pos logit contributions
 and+3.522
 the+3.511
,+3.469
.+3.416
 a+3.376
Neg logit contributions
lor-5.565
rol-5.450
umm-5.408
arians-5.343
rob-5.308
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.485
,+2.343
.+2.204
 the+2.110
 a+1.978
Neg logit contributions
rol-6.845
rob-6.746
umm-6.736
lor-6.733
ocol-6.552
Token band
Token ID4097
Feature activation-1.00e+00

Pos logit contributions
 and+4.544
,+4.440
 the+4.354
.+4.256
 a+4.130
Neg logit contributions
umm-4.761
lor-4.755
rol-4.645
rob-4.475
arians-4.474
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+6.397
 the+6.328
,+6.278
.+6.130
 a+6.084
Neg logit contributions
lor-2.872
rol-2.842
umm-2.717
bek-2.590
osity-2.586
Tokenaid
Token ID1698
Feature activation-1.00e+00

Pos logit contributions
 and+7.351
,+7.196
 the+7.161
.+7.039
 a+6.987
Neg logit contributions
umm-2.048
lor-2.004
rol-1.979
bek-1.802
osity-1.776
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.913
 the+1.798
,+1.798
.+1.658
 a+1.649
Neg logit contributions
lor-7.268
umm-7.265
rol-7.236
bek-7.052
osity-7.028
Token Ben
Token ID3932
Feature activation-1.00e+00

Pos logit contributions
 and+2.582
,+2.458
 the+2.349
.+2.312
 a+2.221
Neg logit contributions
lor-6.499
umm-6.480
rol-6.351
arians-6.254
bek-6.214
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 the+2.433
 and+2.346
 a+2.250
,+2.247
.+2.102
Neg logit contributions
lor-6.611
ides-6.435
rol-6.428
umm-6.414
arians-6.388
Token knee
Token ID10329
Feature activation-1.00e+00

Pos logit contributions
 the+5.282
 and+5.272
,+5.223
 a+5.018
.+4.968
Neg logit contributions
lor-3.907
umm-3.819
arians-3.617
ides-3.596
rol-3.575
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.345
 the+2.320
,+2.245
 a+2.186
.+2.155
Neg logit contributions
lor-6.682
umm-6.665
rol-6.590
rob-6.490
arians-6.443
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.873
,+2.680
 the+2.580
.+2.579
 a+2.404
Neg logit contributions
umm-6.307
lor-6.288
rol-6.233
bek-6.150
osity-6.142
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+3.141
 the+3.012
,+2.988
.+2.861
 a+2.855
Neg logit contributions
umm-6.030
rol-5.990
lor-5.968
osity-5.793
rob-5.781
Token playing
Token ID2712
Feature activation-1.00e+00

Pos logit contributions
 and+3.606
,+3.422
 the+3.422
.+3.314
 a+3.218
Neg logit contributions
umm-5.724
rol-5.703
lor-5.636
bek-5.456
osity-5.452
Token dress
Token ID6576
Feature activation-1.00e+00

Pos logit contributions
 and+1.993
 the+1.822
,+1.822
.+1.652
 a+1.615
Neg logit contributions
umm-7.117
rol-7.047
lor-7.031
rob-6.890
bek-6.857
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.393
 the+3.324
,+3.234
 a+3.138
.+3.048
Neg logit contributions
umm-5.863
lor-5.843
rol-5.830
arians-5.624
rob-5.567
Tokenup
Token ID929
Feature activation-1.00e+00

Pos logit contributions
 and+6.218
 the+6.045
,+6.036
.+5.910
 a+5.892
Neg logit contributions
umm-3.182
lor-3.089
rol-3.036
bek-2.894
osity-2.887
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 the+1.236
 and+1.223
,+1.087
 a+1.024
.+0.899
Neg logit contributions
umm-7.936
rol-7.852
lor-7.839
arians-7.705
rob-7.633
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+2.114
,+1.915
.+1.715
 the+1.542
 a+1.489
Neg logit contributions
umm-7.082
rol-6.986
lor-6.960
ocol-6.938
rob-6.845
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+4.399
,+4.178
 the+4.113
.+4.062
 a+4.026
Neg logit contributions
umm-4.967
lor-4.776
rol-4.722
bek-4.641
osity-4.600
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.613
 and+0.588
 a+0.420
,+0.378
.+0.138
Neg logit contributions
umm-8.516
lor-8.366
rol-8.328
arians-8.280
ides-8.146
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.147
,+1.029
.+0.779
 the+0.744
 a+0.684
Neg logit contributions
umm-8.060
lor-7.967
rol-7.897
bek-7.754
rob-7.728
Token lid
Token ID19789
Feature activation-1.00e+00

Pos logit contributions
 and+4.661
,+4.499
 the+4.465
.+4.325
 a+4.301
Neg logit contributions
umm-4.764
lor-4.628
rol-4.550
bek-4.423
osity-4.384
Token of
Token ID286
Feature activation-1.00e+00

Pos logit contributions
 and+0.671
 the+0.670
,+0.515
 a+0.515
.+0.250
Neg logit contributions
umm-8.438
lor-8.414
rol-8.410
ides-8.140
osity-8.101
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.665
,+3.477
.+3.294
 the+3.062
 a+2.962
Neg logit contributions
umm-5.608
ocol-5.561
rol-5.454
lor-5.349
osity-5.270
Token toy
Token ID13373
Feature activation-1.00e+00

Pos logit contributions
 and+4.594
 the+4.436
,+4.429
.+4.260
 a+4.251
Neg logit contributions
umm-4.849
lor-4.672
rol-4.601
bek-4.501
arians-4.462
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+4.138
 the+4.066
,+3.921
 a+3.896
.+3.712
Neg logit contributions
umm-5.109
lor-5.106
rol-4.947
arians-4.864
bek-4.779
Tokenbox
Token ID3524
Feature activation-1.00e+00

Pos logit contributions
 and+6.972
,+6.807
 the+6.778
.+6.644
 a+6.618
Neg logit contributions
umm-2.422
lor-2.392
rol-2.345
bek-2.180
osity-2.165
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.212
 and+1.181
 a+1.028
,+1.017
.+0.750
Neg logit contributions
lor-7.943
umm-7.939
rol-7.888
ides-7.721
arians-7.696
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+3.514
,+3.346
 the+3.319
.+3.211
 a+3.167
Neg logit contributions
umm-5.932
lor-5.848
rol-5.762
osity-5.628
bek-5.613
Token thought
Token ID1807
Feature activation-1.00e+00

Pos logit contributions
 and+3.487
 the+3.365
,+3.360
.+3.254
 a+3.174
Neg logit contributions
umm-5.754
lor-5.746
rol-5.671
bek-5.616
arians-5.537
Token for
Token ID329
Feature activation-1.00e+00

Pos logit contributions
 and+1.778
,+1.589
 the+1.583
.+1.551
 a+1.471
Neg logit contributions
umm-7.363
lor-7.302
rol-7.247
osity-7.179
ocol-7.124
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.023
,+1.848
 the+1.740
.+1.670
 a+1.625
Neg logit contributions
rol-6.979
umm-6.960
ocol-6.904
lor-6.879
rob-6.787
Token toys
Token ID14958
Feature activation-1.00e+00

Pos logit contributions
 and+4.925
 the+4.714
,+4.674
.+4.567
 a+4.451
Neg logit contributions
umm-4.316
lor-4.215
ocol-4.192
rol-4.101
bek-4.073
Token from
Token ID422
Feature activation-1.00e+00

Pos logit contributions
 and+1.207
 the+1.187
 a+1.066
,+1.043
.+0.880
Neg logit contributions
lor-7.644
umm-7.638
rol-7.613
arians-7.471
bek-7.434
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.733
,+2.526
.+2.359
 the+2.298
 a+2.245
Neg logit contributions
umm-6.374
ocol-6.274
rob-6.272
rol-6.253
arians-6.244
Token toy
Token ID13373
Feature activation-1.00e+00

Pos logit contributions
 and+4.727
 the+4.512
,+4.423
 a+4.336
.+4.334
Neg logit contributions
umm-4.657
lor-4.465
arians-4.431
bek-4.371
agna-4.312
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+4.442
 the+4.360
 a+4.198
,+4.159
.+4.049
Neg logit contributions
lor-4.848
umm-4.829
arians-4.703
rol-4.662
rob-4.532
Tokenbox
Token ID3524
Feature activation-1.00e+00

Pos logit contributions
 and+6.848
,+6.679
 the+6.655
.+6.521
 a+6.493
Neg logit contributions
lor-2.525
umm-2.520
rol-2.474
bek-2.312
osity-2.272
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+2.513
 and+2.433
 a+2.308
,+2.231
.+2.059
Neg logit contributions
lor-6.709
umm-6.586
rol-6.577
arians-6.479
ides-6.385
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+3.039
,+2.858
 the+2.851
 a+2.712
.+2.709
Neg logit contributions
umm-6.317
lor-6.247
rol-6.225
osity-6.060
arians-6.014
Token never
Token ID1239
Feature activation-1.00e+00

Pos logit contributions
 and+3.652
 the+3.505
,+3.493
.+3.376
 a+3.324
Neg logit contributions
umm-5.671
lor-5.629
rol-5.593
bek-5.477
arians-5.425
Token complained
Token ID13832
Feature activation-1.00e+00

Pos logit contributions
 and+4.101
 the+3.953
,+3.893
 a+3.789
.+3.766
Neg logit contributions
umm-5.197
rol-5.077
lor-5.073
osity-5.016
bek-5.000
Token myself
Token ID3589
Feature activation-1.00e+00

Pos logit contributions
 and+2.926
 the+2.855
,+2.773
 a+2.656
.+2.644
Neg logit contributions
rol-6.322
lor-6.231
umm-6.219
osity-5.990
arians-5.941
Token."
Token ID526
Feature activation-1.00e+00

Pos logit contributions
 and+3.160
 the+3.146
,+3.010
 a+2.957
.+2.847
Neg logit contributions
lor-6.001
rol-5.965
umm-5.883
osity-5.699
ides-5.632
Token
Token ID220
Feature activation-1.00e+00

Pos logit contributions
 and+3.196
 the+3.062
,+3.030
.+2.923
 a+2.903
Neg logit contributions
lor-6.094
rol-6.038
umm-6.007
arians-5.811
osity-5.797
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.628
,+1.479
 the+1.459
 a+1.297
.+1.254
Neg logit contributions
lor-7.763
rol-7.715
umm-7.635
osity-7.419
arians-7.408
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.632
,+1.483
 the+1.462
.+1.334
 a+1.307
Neg logit contributions
lor-7.751
rol-7.662
umm-7.613
bek-7.441
rob-7.437
TokenB
Token ID33
Feature activation-1.00e+00

Pos logit contributions
 and+5.951
,+5.812
 the+5.784
.+5.679
 a+5.611
Neg logit contributions
lor-3.523
umm-3.434
rol-3.403
bek-3.321
arians-3.271
Tokenigg
Token ID6950
Feature activation-1.00e+00

Pos logit contributions
 and+7.251
 the+7.077
,+7.075
.+6.938
 a+6.916
Neg logit contributions
umm-2.179
lor-2.101
rol-2.065
bek-1.900
osity-1.894
Tokenie
Token ID494
Feature activation-1.00e+00

Pos logit contributions
 and+5.767
,+5.591
 the+5.589
.+5.454
 a+5.435
Neg logit contributions
umm-3.622
lor-3.547
rol-3.511
bek-3.345
osity-3.340
Token didn
Token ID1422
Feature activation-1.00e+00

Pos logit contributions
 the+3.482
 and+3.464
,+3.364
 a+3.331
.+3.318
Neg logit contributions
lor-5.631
rol-5.473
umm-5.467
bek-5.429
arians-5.401
Token't
Token ID470
Feature activation-1.00e+00

Pos logit contributions
 and+4.427
,+4.252
 the+4.249
.+4.115
 a+4.086
Neg logit contributions
umm-4.950
lor-4.888
rol-4.845
osity-4.664
bek-4.662
Token listen
Token ID6004
Feature activation-1.00e+00

Pos logit contributions
 and+4.252
 the+4.097
,+4.076
 a+3.924
.+3.922
Neg logit contributions
umm-5.090
lor-5.004
rol-4.987
osity-4.854
bek-4.826
Token need
Token ID761
Feature activation-1.00e+00

Pos logit contributions
 and+4.236
 the+4.105
,+4.080
.+3.988
 a+3.921
Neg logit contributions
lor-5.005
umm-4.962
rol-4.935
arians-4.848
osity-4.821
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+2.677
,+2.502
 the+2.412
.+2.378
 a+2.150
Neg logit contributions
lor-6.511
rol-6.276
umm-6.257
arians-6.228
osity-6.197
Token get
Token ID651
Feature activation-1.00e+00

Pos logit contributions
 and+4.145
 the+4.023
,+3.944
.+3.795
 a+3.782
Neg logit contributions
lor-5.042
ides-4.882
arians-4.842
umm-4.791
vation-4.731
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+3.695
,+3.521
 the+3.383
.+3.332
 a+3.180
Neg logit contributions
rol-5.468
lor-5.453
beh-5.410
umm-5.382
arians-5.265
Token check
Token ID2198
Feature activation-1.00e+00

Pos logit contributions
 and+5.128
 the+5.041
,+4.914
.+4.822
 a+4.745
Neg logit contributions
lor-4.154
umm-4.038
bek-3.944
rol-3.884
arians-3.864
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.232
 the+3.148
,+3.038
 a+2.990
.+2.919
Neg logit contributions
lor-5.919
umm-5.868
rol-5.848
arians-5.730
osity-5.712
Tokenup
Token ID929
Feature activation-1.00e+00

Pos logit contributions
 and+6.728
,+6.562
 the+6.519
.+6.406
 a+6.338
Neg logit contributions
umm-2.647
lor-2.625
rol-2.559
bek-2.421
osity-2.405
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+4.554
 the+4.536
,+4.375
 a+4.342
.+4.231
Neg logit contributions
lor-4.641
umm-4.513
arians-4.478
rol-4.413
ides-4.322
Token make
Token ID787
Feature activation-1.00e+00

Pos logit contributions
 and+4.332
,+4.167
.+3.932
 the+3.870
 a+3.749
Neg logit contributions
lor-4.782
arians-4.667
ocol-4.659
umm-4.651
vation-4.574
Token sure
Token ID1654
Feature activation-1.00e+00

Pos logit contributions
 and+3.580
,+3.387
.+3.115
 the+3.085
 a+2.871
Neg logit contributions
lor-5.755
umm-5.572
beh-5.537
rol-5.535
arians-5.495
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.352
,+3.136
 the+2.984
.+2.955
 a+2.946
Neg logit contributions
lor-5.764
umm-5.671
rol-5.653
arians-5.620
osity-5.575
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.921
 the+1.894
,+1.817
.+1.683
 a+1.680
Neg logit contributions
rol-7.117
umm-7.025
lor-7.009
plets-6.799
arians-6.767
Token Jill
Token ID22772
Feature activation-1.00e+00

Pos logit contributions
 and+1.612
,+1.539
.+1.347
 the+1.295
 a+1.170
Neg logit contributions
lor-7.250
umm-7.244
arians-7.057
rol-7.055
ocol-7.028
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 the+1.728
 and+1.652
 a+1.614
,+1.556
.+1.375
Neg logit contributions
lor-7.287
rol-7.243
ides-7.086
umm-7.059
osity-7.052
Token head
Token ID1182
Feature activation-1.00e+00

Pos logit contributions
 the+4.881
 and+4.859
,+4.750
 a+4.656
.+4.530
Neg logit contributions
umm-4.229
lor-4.180
rol-4.004
arians-3.947
ides-3.915
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+0.882
 and+0.688
,+0.655
 a+0.600
.+0.371
Neg logit contributions
rol-8.133
lor-8.126
umm-7.961
osity-7.919
ides-7.871
Token bites
Token ID26081
Feature activation-1.00e+00

Pos logit contributions
 and+5.166
,+5.028
 the+4.929
.+4.881
 a+4.788
Neg logit contributions
umm-3.809
lor-3.767
rol-3.742
osity-3.676
arians-3.663
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+1.983
,+1.865
 the+1.727
 a+1.678
.+1.544
Neg logit contributions
rol-6.977
umm-6.953
ocol-6.808
lor-6.740
osity-6.697
Token hair
Token ID4190
Feature activation-1.00e+00

Pos logit contributions
 and+4.979
 the+4.961
,+4.879
 a+4.662
.+4.575
Neg logit contributions
umm-4.029
lor-4.008
rol-3.869
ides-3.816
arians-3.744
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.656
 and+0.642
,+0.516
 a+0.478
.+0.165
Neg logit contributions
umm-8.279
lor-8.267
rol-8.156
ides-8.102
plets-7.984
Token Jill
Token ID22772
Feature activation-1.00e+00

Pos logit contributions
 and+3.606
,+3.441
 the+3.413
.+3.282
 a+3.241
Neg logit contributions
umm-5.727
lor-5.636
rol-5.600
osity-5.509
pill-5.354
Token screams
Token ID26557
Feature activation-1.00e+00

Pos logit contributions
 and+4.669
 the+4.579
,+4.552
 a+4.458
.+4.438
Neg logit contributions
lor-4.424
umm-4.401
rol-4.393
arians-4.268
osity-4.219
Token knee
Token ID10329
Feature activation-1.00e+00

Pos logit contributions
 and+4.791
,+4.739
 the+4.635
.+4.557
 a+4.439
Neg logit contributions
lor-4.216
umm-4.121
bek-4.033
rol-4.008
arians-3.986
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+0.379
 and+0.273
,+0.197
 a+0.195
.+0.068
Neg logit contributions
lor-8.600
umm-8.536
rol-8.489
arians-8.350
bek-8.291
Token put
Token ID1234
Feature activation-1.00e+00

Pos logit contributions
 and+4.356
,+4.282
.+4.232
 the+4.066
 a+3.978
Neg logit contributions
lor-4.660
rob-4.612
bek-4.587
umm-4.584
rol-4.569
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+1.923
,+1.756
.+1.537
 the+1.482
 a+1.310
Neg logit contributions
rol-7.365
umm-7.275
lor-7.262
ocol-7.113
rob-7.105
Token band
Token ID4097
Feature activation-1.00e+00

Pos logit contributions
 and+4.173
,+4.025
 the+4.013
.+3.875
 a+3.783
Neg logit contributions
lor-5.154
umm-5.067
rol-5.023
bek-4.922
vation-4.805
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+6.676
 the+6.561
,+6.531
.+6.342
 a+6.339
Neg logit contributions
lor-2.651
rol-2.603
umm-2.593
bek-2.387
rob-2.372
Tokenaid
Token ID1698
Feature activation-1.00e+00

Pos logit contributions
 and+7.409
,+7.261
 the+7.214
.+7.094
 a+7.020
Neg logit contributions
umm-1.983
lor-1.950
rol-1.933
bek-1.741
osity-1.702
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.912
 the+1.792
,+1.775
 a+1.631
.+1.587
Neg logit contributions
umm-7.356
lor-7.336
rol-7.293
bek-7.103
osity-7.075
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
 and+1.769
,+1.602
 the+1.408
.+1.373
 a+1.321
Neg logit contributions
lor-7.321
umm-7.249
rol-7.145
ocol-7.019
arians-7.014
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.745
 and+0.703
,+0.543
 a+0.537
.+0.255
Neg logit contributions
lor-8.271
umm-8.216
rol-8.192
ides-7.994
arians-7.994
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.449
,+3.296
 the+3.259
.+3.167
 a+3.102
Neg logit contributions
umm-5.875
lor-5.743
rol-5.670
osity-5.615
arians-5.491
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.158
 the+2.054
,+1.999
 a+1.951
.+1.771
Neg logit contributions
rol-6.925
umm-6.911
lor-6.770
arians-6.704
osity-6.656
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.483
,+3.322
 the+3.295
.+3.170
 a+3.127
Neg logit contributions
umm-5.904
lor-5.825
rol-5.786
arians-5.639
osity-5.624
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.441
,+1.265
 the+1.259
.+1.123
 a+1.106
Neg logit contributions
umm-7.879
lor-7.870
rol-7.815
bek-7.635
osity-7.617
Token"
Token ID1
Feature activation-1.00e+00

Pos logit contributions
 and+3.423
 the+3.246
,+3.243
.+3.113
 a+3.088
Neg logit contributions
umm-6.009
lor-5.947
rol-5.901
bek-5.759
osity-5.729
TokenUh
Token ID34653
Feature activation-1.00e+00

Pos logit contributions
 and+6.281
,+6.165
 the+6.074
.+5.979
 a+5.908
Neg logit contributions
umm-3.246
lor-3.120
rol-3.094
bek-2.983
arians-2.927
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.722
 the+3.533
,+3.492
.+3.366
 a+3.349
Neg logit contributions
umm-5.674
lor-5.608
rol-5.586
bek-5.475
osity-5.396
Tokenoh
Token ID1219
Feature activation-1.00e+00

Pos logit contributions
 and+6.133
 the+6.007
,+5.981
.+5.845
 a+5.836
Neg logit contributions
umm-3.242
lor-3.126
rol-3.091
osity-2.913
arians-2.910
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+3.036
 the+2.893
,+2.799
 a+2.717
.+2.691
Neg logit contributions
umm-6.334
lor-6.249
rol-6.201
bek-6.064
arians-6.029
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+3.098
.+2.822
,+2.802
 the+2.737
 a+2.675
Neg logit contributions
umm-6.201
lor-6.069
rol-6.014
arians-5.961
osity-5.914
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+2.546
 the+2.467
 a+2.335
,+2.315
.+2.211
Neg logit contributions
umm-6.638
lor-6.608
rol-6.491
bek-6.384
arians-6.364
Token we
Token ID356
Feature activation-1.00e+00

Pos logit contributions
 and+3.595
,+3.354
.+3.330
 the+3.247
 a+3.177
Neg logit contributions
umm-5.628
lor-5.583
rol-5.479
osity-5.470
bek-5.443
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.941
.+2.720
,+2.687
 the+2.548
 a+2.497
Neg logit contributions
umm-6.321
rol-6.215
bek-6.141
osity-6.124
lor-6.122
Token girl
Token ID2576
Feature activation-1.00e+00

Pos logit contributions
 and+5.018
 the+4.849
,+4.840
.+4.675
 a+4.609
Neg logit contributions
umm-4.465
lor-4.275
rol-4.127
arians-4.093
bek-4.088
Token saw
Token ID2497
Feature activation-1.00e+00

Pos logit contributions
 and+3.548
 the+3.537
,+3.441
 a+3.359
.+3.329
Neg logit contributions
umm-5.544
lor-5.526
rol-5.494
bek-5.469
arians-5.344
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
 and+2.638
,+2.443
 the+2.303
.+2.278
 a+2.105
Neg logit contributions
lor-6.663
rol-6.544
umm-6.513
osity-6.345
arians-6.290
Token spark
Token ID9009
Feature activation-1.00e+00

Pos logit contributions
 and+2.241
 the+2.177
,+2.096
 a+1.970
.+1.868
Neg logit contributions
rol-6.778
umm-6.733
lor-6.728
osity-6.577
arians-6.479
Tokenle
Token ID293
Feature activation-1.00e+00

Pos logit contributions
 and+6.137
 the+6.031
,+5.987
 a+5.886
.+5.781
Neg logit contributions
umm-3.192
rol-3.097
lor-3.074
osity-2.882
ides-2.856
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
 the+2.313
,+2.280
 a+2.188
.+2.038
Neg logit contributions
rol-6.701
umm-6.652
lor-6.579
osity-6.431
ocol-6.368
Token she
Token ID673
Feature activation-1.00e+00

Pos logit contributions
 and+3.908
,+3.714
 the+3.584
.+3.578
 a+3.452
Neg logit contributions
umm-5.208
rol-5.183
osity-5.116
lor-5.077
bek-5.073
Token loved
Token ID6151
Feature activation-1.00e+00

Pos logit contributions
 and+4.004
 the+3.917
,+3.904
.+3.757
 a+3.712
Neg logit contributions
rol-5.186
lor-5.180
umm-5.130
bek-5.100
arians-4.978
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
 and+2.146
,+1.985
 the+1.889
 a+1.782
.+1.748
Neg logit contributions
umm-7.100
rol-7.026
lor-7.016
arians-6.829
osity-6.816
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.077
 the+0.965
,+0.865
 a+0.735
.+0.627
Neg logit contributions
umm-8.097
rol-8.008
lor-7.976
osity-7.742
bek-7.738
Token nest
Token ID16343
Feature activation-1.00e+00

Pos logit contributions
 and+5.034
 the+4.981
,+4.889
 a+4.723
.+4.691
Neg logit contributions
umm-4.359
lor-4.109
rol-4.048
osity-3.970
arians-3.917
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+3.272
 the+3.188
,+3.144
 a+3.040
.+2.880
Neg logit contributions
rol-5.972
umm-5.908
lor-5.907
osity-5.653
bek-5.649
Token
Token ID220
Feature activation-1.00e+00

Pos logit contributions
 and+4.204
,+4.055
 the+4.027
.+3.954
 a+3.889
Neg logit contributions
lor-5.016
umm-4.993
rol-4.979
osity-4.851
arians-4.799
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.620
,+1.475
 the+1.447
 a+1.286
.+1.251
Neg logit contributions
rol-7.719
lor-7.715
umm-7.577
osity-7.422
ides-7.353
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+2.784
,+2.638
 the+2.618
.+2.488
 a+2.465
Neg logit contributions
lor-6.592
rol-6.487
umm-6.429
bek-6.306
rob-6.269
TokenB
Token ID33
Feature activation-1.00e+00

Pos logit contributions
 and+5.727
,+5.572
 the+5.555
.+5.444
 a+5.399
Neg logit contributions
lor-3.673
umm-3.609
rol-3.579
bek-3.487
arians-3.425
Tokenigg
Token ID6950
Feature activation-1.00e+00

Pos logit contributions
 and+7.168
 the+7.012
,+6.997
.+6.857
 a+6.847
Neg logit contributions
umm-2.260
lor-2.188
rol-2.160
bek-1.988
osity-1.985
Tokenie
Token ID494
Feature activation-1.00e+00

Pos logit contributions
 and+5.685
,+5.509
 the+5.508
.+5.371
 a+5.360
Neg logit contributions
umm-3.692
lor-3.627
rol-3.592
osity-3.422
bek-3.421
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+3.563
 the+3.519
,+3.417
 a+3.381
.+3.364
Neg logit contributions
lor-5.548
rol-5.382
umm-5.375
bek-5.307
arians-5.289
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.234
 the+1.030
,+0.953
.+0.912
 a+0.866
Neg logit contributions
lor-8.065
umm-8.049
rol-7.989
osity-7.858
bek-7.811
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.846
 the+2.702
,+2.654
.+2.566
 a+2.519
Neg logit contributions
lor-6.248
rol-6.193
umm-6.178
osity-6.150
arians-6.013
Token veterinarian
Token ID41404
Feature activation-1.00e+00

Pos logit contributions
 and+6.122
,+5.909
 the+5.865
.+5.746
 a+5.657
Neg logit contributions
umm-3.363
lor-3.348
arians-3.157
rol-3.119
bek-2.980
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+3.883
 the+3.857
,+3.703
 a+3.646
.+3.638
Neg logit contributions
lor-5.235
arians-5.102
rol-5.062
umm-5.038
bek-4.953
Token
Token ID220
Feature activation-1.00e+00

Pos logit contributions
 and+4.384
 the+4.196
,+4.191
 a+4.086
.+4.073
Neg logit contributions
umm-4.800
lor-4.783
rol-4.737
arians-4.614
bek-4.554
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.623
,+1.457
 the+1.451
 a+1.311
.+1.248
Neg logit contributions
lor-7.735
rol-7.679
umm-7.670
osity-7.443
arians-7.432
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.759
,+1.578
 the+1.569
 a+1.451
.+1.448
Neg logit contributions
lor-7.567
umm-7.461
rol-7.459
arians-7.312
rob-7.288
TokenB
Token ID33
Feature activation-1.00e+00

Pos logit contributions
 and+6.206
,+6.039
 the+6.030
.+5.923
 a+5.882
Neg logit contributions
lor-3.178
umm-3.156
rol-3.088
bek-2.995
arians-2.968
Tokenobby
Token ID11369
Feature activation-1.00e+00

Pos logit contributions
 and+7.849
 the+7.697
,+7.673
.+7.535
 a+7.528
Neg logit contributions
umm-1.601
lor-1.531
rol-1.503
bek-1.329
osity-1.327
Token replied
Token ID8712
Feature activation-1.00e+00

Pos logit contributions
 and+3.978
 the+3.906
,+3.823
.+3.767
 a+3.748
Neg logit contributions
lor-5.153
umm-5.089
rol-5.082
bek-5.006
arians-4.992
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
 the+1.197
,+1.058
 a+1.025
.+1.018
Neg logit contributions
umm-7.837
lor-7.808
rol-7.752
osity-7.616
bek-7.599
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.536
 the+2.383
,+2.346
.+2.239
 a+2.220
Neg logit contributions
umm-6.575
lor-6.556
rol-6.476
osity-6.428
bek-6.385
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+5.445
,+5.287
 the+5.272
.+5.136
 a+5.107
Neg logit contributions
umm-4.015
lor-3.943
rol-3.901
bek-3.743
osity-3.729
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.868
 the+0.760
,+0.700
 a+0.582
.+0.498
Neg logit contributions
umm-8.342
lor-8.264
rol-8.250
bek-8.096
arians-8.068
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+4.207
,+4.040
 the+4.018
.+3.951
 a+3.906
Neg logit contributions
umm-5.128
rol-4.970
lor-4.895
osity-4.845
bek-4.742
Token told
Token ID1297
Feature activation-1.00e+00

Pos logit contributions
 and+3.384
 the+3.315
,+3.309
.+3.217
 a+3.128
Neg logit contributions
rol-5.699
lor-5.685
umm-5.636
bek-5.632
arians-5.487
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+3.298
,+3.111
.+2.996
 the+2.902
 a+2.820
Neg logit contributions
umm-6.027
rol-5.890
lor-5.831
ocol-5.731
osity-5.598
Token three
Token ID1115
Feature activation-1.00e+00

Pos logit contributions
 and+3.873
,+3.688
 the+3.682
.+3.596
 a+3.492
Neg logit contributions
lor-5.386
umm-5.382
bek-5.203
rol-5.199
osity-5.158
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+4.960
 the+4.919
,+4.766
 a+4.740
.+4.626
Neg logit contributions
lor-4.258
umm-4.236
rol-4.117
bek-4.024
arians-3.972
Tokenyear
Token ID1941
Feature activation-1.00e+00

Pos logit contributions
 and+7.668
 the+7.492
,+7.490
.+7.358
 a+7.336
Neg logit contributions
umm-1.750
lor-1.662
rol-1.620
bek-1.459
osity-1.457
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.459
 the+3.328
,+3.276
 a+3.170
.+3.150
Neg logit contributions
umm-5.937
lor-5.861
rol-5.796
bek-5.630
osity-5.619
Tokenold
Token ID727
Feature activation-1.00e+00

Pos logit contributions
 and+6.872
,+6.694
 the+6.692
.+6.554
 a+6.527
Neg logit contributions
umm-2.542
lor-2.478
rol-2.446
bek-2.277
osity-2.262
Token daughter
Token ID4957
Feature activation-1.00e+00

Pos logit contributions
 and+4.934
,+4.749
 the+4.748
.+4.617
 a+4.593
Neg logit contributions
umm-4.336
lor-4.305
rol-4.267
bek-4.133
osity-4.111
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.677
 the+1.492
,+1.404
.+1.335
 a+1.311
Neg logit contributions
lor-7.430
rol-7.414
umm-7.359
osity-7.318
bek-7.196
Token told
Token ID1297
Feature activation-1.00e+00

Pos logit contributions
 and+3.384
 the+3.315
,+3.309
.+3.217
 a+3.128
Neg logit contributions
rol-5.699
lor-5.685
umm-5.636
bek-5.632
arians-5.487
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+3.298
,+3.111
.+2.996
 the+2.902
 a+2.820
Neg logit contributions
umm-6.027
rol-5.890
lor-5.831
ocol-5.731
osity-5.598
Token three
Token ID1115
Feature activation-1.00e+00

Pos logit contributions
 and+3.873
,+3.688
 the+3.682
.+3.596
 a+3.492
Neg logit contributions
lor-5.386
umm-5.382
bek-5.203
rol-5.199
osity-5.158
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+4.960
 the+4.919
,+4.766
 a+4.740
.+4.626
Neg logit contributions
lor-4.258
umm-4.236
rol-4.117
bek-4.024
arians-3.972
Tokenyear
Token ID1941
Feature activation-1.00e+00

Pos logit contributions
 and+7.668
 the+7.492
,+7.490
.+7.358
 a+7.336
Neg logit contributions
umm-1.750
lor-1.662
rol-1.620
bek-1.459
osity-1.457
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 and+3.459
 the+3.328
,+3.276
 a+3.170
.+3.150
Neg logit contributions
umm-5.937
lor-5.861
rol-5.796
bek-5.630
osity-5.619
Tokenold
Token ID727
Feature activation-1.00e+00

Pos logit contributions
 and+6.872
,+6.694
 the+6.692
.+6.554
 a+6.527
Neg logit contributions
umm-2.542
lor-2.478
rol-2.446
bek-2.277
osity-2.262
Token daughter
Token ID4957
Feature activation-1.00e+00

Pos logit contributions
 and+4.934
,+4.749
 the+4.748
.+4.617
 a+4.593
Neg logit contributions
umm-4.336
lor-4.305
rol-4.267
bek-4.133
osity-4.111
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.677
 the+1.492
,+1.404
.+1.335
 a+1.311
Neg logit contributions
lor-7.430
rol-7.414
umm-7.359
osity-7.318
bek-7.196
Token attach
Token ID10199
Feature activation-1.00e+00

Pos logit contributions
 and+4.148
,+3.904
.+3.807
 the+3.800
 a+3.685
Neg logit contributions
umm-4.912
lor-4.891
rol-4.852
bek-4.832
arians-4.747
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+2.399
,+2.277
.+2.032
 the+2.020
 a+1.833
Neg logit contributions
rol-6.731
umm-6.606
lor-6.542
ocol-6.420
rob-6.379
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+4.630
,+4.507
 the+4.460
.+4.353
 a+4.239
Neg logit contributions
umm-4.920
rol-4.864
lor-4.692
osity-4.652
arians-4.478
Token found
Token ID1043
Feature activation-1.00e+00

Pos logit contributions
 and+3.746
,+3.628
 the+3.615
.+3.591
 a+3.471
Neg logit contributions
lor-5.458
rol-5.415
umm-5.309
bek-5.294
arians-5.234
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.515
,+2.331
.+2.264
 the+2.115
 a+1.896
Neg logit contributions
rol-6.872
lor-6.824
umm-6.768
osity-6.493
rob-6.459
Token tiny
Token ID7009
Feature activation-1.00e+00

Pos logit contributions
 and+4.216
 the+4.093
,+4.012
.+3.970
 a+3.842
Neg logit contributions
umm-5.123
lor-5.057
bek-4.929
rol-4.872
arians-4.780
Token spider
Token ID19230
Feature activation-1.00e+00

Pos logit contributions
 and+5.087
 the+5.035
,+4.880
.+4.868
 a+4.825
Neg logit contributions
umm-4.221
lor-4.036
rol-3.928
bek-3.853
osity-3.827
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 the+3.739
 and+3.736
,+3.575
 a+3.553
.+3.469
Neg logit contributions
lor-5.467
umm-5.317
rol-5.316
arians-5.157
rob-5.114
Token inside
Token ID2641
Feature activation-1.00e+00

Pos logit contributions
 and+3.277
,+3.186
 the+3.040
.+3.018
 a+2.937
Neg logit contributions
umm-6.063
lor-5.987
rol-5.974
bek-5.852
osity-5.742
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.228
,+1.089
 a+1.015
 the+1.010
.+0.882
Neg logit contributions
rol-7.707
lor-7.637
umm-7.629
ocol-7.411
bek-7.396
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.785
.+3.711
 the+3.602
 a+3.460
Neg logit contributions
rol-5.260
umm-5.200
lor-5.150
bek-5.135
osity-5.058
Token laughed
Token ID13818
Feature activation-1.00e+00

Pos logit contributions
 and+3.921
 the+3.807
,+3.791
.+3.698
 a+3.603
Neg logit contributions
rol-5.266
lor-5.259
umm-5.095
bek-5.085
arians-5.043
Token as
Token ID355
Feature activation-1.00e+00

Pos logit contributions
 and+1.601
 the+1.511
,+1.440
 a+1.271
.+1.260
Neg logit contributions
umm-7.546
rol-7.500
lor-7.439
osity-7.314
arians-7.175
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+1.976
,+1.817
 the+1.794
.+1.642
 a+1.586
Neg logit contributions
umm-7.098
lor-7.045
rol-7.031
rob-6.881
arians-6.859
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+2.929
,+2.748
.+2.591
 the+2.467
 a+2.321
Neg logit contributions
umm-6.312
rol-6.274
rob-6.232
ocol-6.209
lor-6.092
Token toys
Token ID14958
Feature activation-1.00e+00

Pos logit contributions
 and+3.904
,+3.787
 the+3.681
.+3.632
 a+3.521
Neg logit contributions
umm-5.401
arians-5.137
rob-5.107
rol-5.102
bek-5.073
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+1.084
 the+1.045
,+0.947
 a+0.912
.+0.731
Neg logit contributions
umm-7.847
lor-7.774
rol-7.772
arians-7.764
rob-7.629
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.490
,+1.272
.+1.089
 the+0.909
 a+0.881
Neg logit contributions
umm-7.619
ocol-7.579
rol-7.578
lor-7.576
rob-7.492
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.630
,+4.458
.+4.290
 the+4.274
 a+4.112
Neg logit contributions
umm-4.758
rol-4.551
lor-4.544
rob-4.413
bek-4.366
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.585
,+3.429
 the+3.309
.+3.233
 a+3.215
Neg logit contributions
umm-5.771
lor-5.724
rol-5.645
bek-5.518
rob-5.429
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.565
 and+0.557
 a+0.366
,+0.366
.+0.116
Neg logit contributions
umm-8.555
lor-8.463
rol-8.425
arians-8.313
osity-8.214
Token Sara
Token ID24799
Feature activation-1.00e+00

Pos logit contributions
 and+3.748
,+3.585
 the+3.546
.+3.457
 a+3.403
Neg logit contributions
umm-5.604
rol-5.547
lor-5.503
osity-5.304
arians-5.282
Token has
Token ID468
Feature activation-1.00e+00

Pos logit contributions
 and+4.085
 the+3.977
,+3.945
.+3.840
 a+3.834
Neg logit contributions
umm-5.134
lor-5.059
rol-5.057
arians-4.915
rob-4.890
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.935
,+0.872
 the+0.600
.+0.586
 a+0.268
Neg logit contributions
rol-8.225
lor-8.164
umm-8.084
rob-7.989
arians-7.941
Token me
Token ID502
Feature activation-1.00e+00

Pos logit contributions
 and+2.954
,+2.829
.+2.669
 the+2.564
 a+2.412
Neg logit contributions
rol-6.266
lor-6.127
ocol-6.116
umm-6.107
rob-5.990
Token here
Token ID994
Feature activation-1.00e+00

Pos logit contributions
 and+4.307
,+4.172
 the+4.133
.+4.009
 a+3.896
Neg logit contributions
rol-4.950
lor-4.885
beh-4.743
bek-4.709
umm-4.655
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.377
 the+2.354
,+2.233
 a+2.105
.+2.064
Neg logit contributions
rol-6.526
lor-6.499
umm-6.445
arians-6.320
bek-6.318
Token long
Token ID890
Feature activation-1.00e+00

Pos logit contributions
 and+5.056
 the+4.949
,+4.820
.+4.740
 a+4.714
Neg logit contributions
lor-4.214
umm-4.127
bek-4.117
rol-4.001
arians-3.915
Token time
Token ID640
Feature activation-1.00e+00

Pos logit contributions
 and+4.755
 the+4.706
,+4.548
.+4.539
 a+4.529
Neg logit contributions
lor-4.459
umm-4.192
rol-4.184
ides-4.113
ocol-4.083
Token ago
Token ID2084
Feature activation-1.00e+00

Pos logit contributions
 the+5.795
 and+5.717
 a+5.543
,+5.533
.+5.369
Neg logit contributions
lor-3.203
rol-3.199
umm-3.154
ides-3.143
reetings-3.077
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+2.677
 and+2.643
 a+2.495
,+2.403
.+2.294
Neg logit contributions
lor-6.406
umm-6.306
rol-6.226
ides-6.198
arians-6.161
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+5.262
,+5.093
 the+5.082
.+4.956
 a+4.887
Neg logit contributions
umm-4.172
lor-4.112
rol-4.016
osity-3.925
arians-3.879
Token forgot
Token ID16453
Feature activation-1.00e+00

Pos logit contributions
 and+3.756
 the+3.629
,+3.605
.+3.480
 a+3.470
Neg logit contributions
umm-5.466
lor-5.449
rol-5.418
arians-5.298
ides-5.267
Token about
Token ID546
Feature activation-1.00e+00

Pos logit contributions
 and+2.236
,+2.095
 the+2.003
 a+1.914
.+1.875
Neg logit contributions
rol-6.691
lor-6.673
osity-6.583
umm-6.566
arians-6.463
Token me
Token ID502
Feature activation-1.00e+00

Pos logit contributions
 and+3.159
,+2.996
.+2.715
 the+2.629
 a+2.584
Neg logit contributions
umm-5.877
arians-5.874
ocol-5.870
lor-5.804
rol-5.696
Token more
Token ID517
Feature activation-1.00e+00

Pos logit contributions
 and+5.339
 the+5.106
,+5.018
 a+4.914
.+4.864
Neg logit contributions
umm-4.094
ocol-3.725
arians-3.703
rol-3.689
bek-3.654
Token freeze
Token ID16611
Feature activation-1.00e+00

Pos logit contributions
 and+4.839
 the+4.706
,+4.658
 a+4.578
.+4.441
Neg logit contributions
umm-4.448
ocol-4.181
osity-4.093
arians-4.073
beh-4.061
Token tag
Token ID7621
Feature activation-1.00e+00

Pos logit contributions
 and+3.735
 the+3.607
 a+3.575
,+3.556
.+3.383
Neg logit contributions
umm-5.357
rol-5.308
lor-5.284
arians-5.161
osity-5.109
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+2.080
 the+2.045
 a+1.936
,+1.919
.+1.728
Neg logit contributions
rol-6.873
lor-6.842
umm-6.832
arians-6.749
osity-6.654
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.452
,+3.212
.+3.001
 the+2.969
 a+2.940
Neg logit contributions
umm-5.710
ocol-5.650
lor-5.522
rol-5.501
bek-5.391
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.930
,+4.741
 the+4.708
.+4.533
 a+4.517
Neg logit contributions
umm-4.620
lor-4.336
rol-4.298
bek-4.220
ocol-4.082
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+4.021
,+3.871
 the+3.825
 a+3.780
.+3.641
Neg logit contributions
umm-5.354
lor-5.272
rol-5.092
bek-5.067
isher-4.999
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.473
 and+1.435
 a+1.364
,+1.255
.+1.066
Neg logit contributions
umm-7.583
lor-7.551
rol-7.478
arians-7.383
ides-7.305
Token Do
Token ID2141
Feature activation-1.00e+00

Pos logit contributions
 and+5.396
,+5.236
 the+5.219
.+5.077
 a+5.033
Neg logit contributions
umm-4.074
lor-4.008
rol-3.914
osity-3.799
arians-3.756
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.334
,+3.102
 the+3.060
.+3.016
 a+2.921
Neg logit contributions
lor-6.053
umm-5.986
rol-5.870
arians-5.840
osity-5.793
Token understand
Token ID1833
Feature activation-1.00e+00

Pos logit contributions
 and+4.971
 the+4.914
,+4.843
 a+4.682
.+4.682
Neg logit contributions
lor-4.126
umm-4.120
rol-4.105
ides-4.097
reetings-4.014
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+1.799
 the+1.650
,+1.637
.+1.465
 a+1.435
Neg logit contributions
umm-7.289
lor-7.243
rol-7.240
rob-7.102
arians-7.026
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+2.765
,+2.566
.+2.420
 the+2.340
 a+2.204
Neg logit contributions
umm-6.503
rol-6.420
rob-6.374
ocol-6.354
lor-6.272
Token blocks
Token ID7021
Feature activation-1.00e+00

Pos logit contributions
 and+3.716
,+3.586
 the+3.526
.+3.449
 a+3.360
Neg logit contributions
umm-5.538
arians-5.285
rol-5.268
bek-5.241
rob-5.234
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+0.570
 the+0.505
,+0.404
 a+0.377
.+0.165
Neg logit contributions
umm-8.522
rol-8.419
lor-8.406
rob-8.221
arians-8.212
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.106
,+0.894
.+0.654
 the+0.574
 a+0.564
Neg logit contributions
umm-8.018
rol-7.949
lor-7.914
ocol-7.854
rob-7.788
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.562
,+4.405
 the+4.263
.+4.220
 a+4.106
Neg logit contributions
umm-4.848
lor-4.671
rol-4.664
rob-4.517
bek-4.458
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.575
,+3.424
 the+3.307
.+3.225
 a+3.202
Neg logit contributions
umm-5.782
lor-5.769
rol-5.680
bek-5.537
rob-5.470
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.674
 and+0.659
 a+0.489
,+0.462
.+0.239
Neg logit contributions
umm-8.490
lor-8.366
rol-8.308
arians-8.220
osity-8.096
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+3.817
,+3.653
 the+3.617
.+3.530
 a+3.469
Neg logit contributions
umm-5.566
rol-5.485
lor-5.441
osity-5.260
arians-5.191
Token liked
Token ID8288
Feature activation-1.00e+00

Pos logit contributions
 and+3.262
,+3.102
 the+3.101
.+2.977
 a+2.948
Neg logit contributions
umm-6.034
rol-5.981
lor-5.916
bek-5.810
osity-5.799
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.608
,+1.495
.+1.376
 the+1.333
 a+1.246
Neg logit contributions
umm-7.501
rol-7.428
osity-7.339
lor-7.314
bek-7.243
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+3.111
,+2.934
.+2.801
 the+2.601
 a+2.584
Neg logit contributions
umm-6.079
bek-6.057
rol-5.991
lor-5.920
osity-5.900
Token mom
Token ID1995
Feature activation-1.00e+00

Pos logit contributions
 and+3.258
,+3.056
 the+3.044
.+2.881
 a+2.823
Neg logit contributions
umm-6.105
arians-5.887
lor-5.845
rol-5.784
bek-5.783
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
 and+3.909
 the+3.825
,+3.782
 a+3.678
.+3.635
Neg logit contributions
rol-5.252
umm-5.240
lor-5.125
bek-5.022
arians-4.977
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+2.412
 the+2.292
,+2.273
 a+2.139
.+2.089
Neg logit contributions
rol-6.739
lor-6.698
umm-6.608
bek-6.495
osity-6.468
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.116
,+1.964
.+1.754
 the+1.499
 a+1.470
Neg logit contributions
ocol-6.772
umm-6.747
bek-6.744
lor-6.743
rol-6.727
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.866
,+4.692
 the+4.574
.+4.506
 a+4.368
Neg logit contributions
umm-4.610
lor-4.565
bek-4.351
rol-4.344
arians-4.302
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.632
,+3.519
 the+3.379
.+3.328
 a+3.304
Neg logit contributions
umm-5.683
lor-5.638
rol-5.456
bek-5.428
arians-5.304
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.708
 and+0.656
,+0.533
 a+0.515
.+0.280
Neg logit contributions
umm-8.283
lor-8.261
rol-8.238
bek-8.015
osity-8.010
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+3.589
,+3.450
 the+3.402
.+3.304
 a+3.265
Neg logit contributions
umm-5.790
rol-5.636
lor-5.587
osity-5.525
bek-5.476
Token unp
Token ID8593
Feature activation-1.00e+00

Pos logit contributions
 and+3.524
,+3.391
 the+3.327
.+3.269
 a+3.198
Neg logit contributions
rol-5.668
lor-5.559
umm-5.553
bek-5.528
arians-5.429
Tokenacked
Token ID6021
Feature activation-1.00e+00

Pos logit contributions
 and+7.144
,+6.945
 the+6.944
.+6.814
 a+6.769
Neg logit contributions
umm-2.300
lor-2.205
rol-2.129
osity-2.001
bek-1.995
Token house
Token ID2156
Feature activation-1.00e+00

Pos logit contributions
 and+6.108
,+5.924
 the+5.924
.+5.775
 a+5.757
Neg logit contributions
umm-3.377
lor-3.244
rol-3.203
bek-3.033
osity-2.994
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.037
 the+0.935
 a+0.861
,+0.857
.+0.576
Neg logit contributions
umm-8.216
rol-8.017
lor-7.959
arians-7.814
osity-7.784
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+3.852
 the+3.730
,+3.712
.+3.560
 a+3.501
Neg logit contributions
umm-5.693
lor-5.400
osity-5.368
rol-5.339
bek-5.238
Token explored
Token ID18782
Feature activation-1.00e+00

Pos logit contributions
 and+3.796
,+3.742
 the+3.731
.+3.656
 a+3.564
Neg logit contributions
lor-5.336
rol-5.329
umm-5.318
bek-5.260
arians-5.132
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.102
,+0.942
.+0.868
 the+0.855
 a+0.786
Neg logit contributions
rol-7.926
umm-7.920
lor-7.853
rob-7.709
bek-7.648
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+5.564
 the+5.394
,+5.392
.+5.252
 a+5.236
Neg logit contributions
umm-3.891
lor-3.782
rol-3.751
bek-3.581
osity-3.561
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.674
,+3.521
 the+3.507
 a+3.393
.+3.383
Neg logit contributions
umm-5.727
lor-5.612
rol-5.550
bek-5.401
osity-5.329
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.543
 the+1.532
,+1.375
 a+1.346
.+1.326
Neg logit contributions
umm-7.533
rol-7.404
lor-7.349
osity-7.215
arians-7.166
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.920
,+3.768
.+3.731
 the+3.638
 a+3.540
Neg logit contributions
umm-5.266
rol-5.264
lor-5.160
bek-5.140
osity-5.107
Token bedroom
Token ID14043
Feature activation-1.00e+00

Pos logit contributions
 and+5.168
,+5.019
 the+4.995
.+4.833
 a+4.825
Neg logit contributions
umm-4.359
rol-4.135
lor-4.112
bek-3.951
arians-3.918
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+5.188
 the+5.090
,+4.960
 a+4.933
.+4.825
Neg logit contributions
umm-4.175
rol-3.984
lor-3.984
arians-3.757
osity-3.752
Token decided
Token ID3066
Feature activation-1.00e+00

Pos logit contributions
 and+3.900
,+3.637
.+3.545
 the+3.494
 a+3.420
Neg logit contributions
bek-5.200
umm-5.147
osity-5.136
rol-5.068
arians-5.040
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.476
,+1.260
 the+1.204
.+1.121
 a+1.044
Neg logit contributions
lor-7.710
osity-7.572
umm-7.563
rol-7.560
bek-7.526
Token decor
Token ID11705
Feature activation-1.00e+00

Pos logit contributions
 and+4.400
,+4.136
 the+4.105
.+4.007
 a+3.971
Neg logit contributions
umm-4.939
lor-4.811
rol-4.708
arians-4.701
ides-4.678
Tokenate
Token ID378
Feature activation-1.00e+00

Pos logit contributions
 and+6.109
,+5.941
 the+5.841
.+5.755
 a+5.707
Neg logit contributions
lor-3.307
umm-3.275
rol-3.192
rob-3.044
arians-3.010
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.047
,+1.885
.+1.598
 the+1.597
 a+1.504
Neg logit contributions
rol-7.151
lor-7.051
umm-7.041
ocol-6.925
rob-6.838
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+5.245
,+5.086
 the+5.049
.+4.909
 a+4.846
Neg logit contributions
umm-4.258
lor-4.060
rol-4.011
arians-3.875
bek-3.851
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+4.163
,+3.992
 the+3.934
 a+3.820
.+3.789
Neg logit contributions
umm-5.313
lor-5.233
rol-5.071
bek-4.960
arians-4.860
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.708
 the+1.552
,+1.527
 a+1.377
.+1.237
Neg logit contributions
umm-7.515
rol-7.473
lor-7.390
bek-7.282
osity-7.258
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+3.257
,+3.085
 the+3.049
.+2.935
 a+2.932
Neg logit contributions
umm-6.051
rol-6.020
lor-5.960
osity-5.822
arians-5.788
Token collected
Token ID7723
Feature activation-1.00e+00

Pos logit contributions
 and+4.180
,+4.011
 the+4.006
.+3.871
 a+3.847
Neg logit contributions
umm-5.190
lor-5.101
rol-5.099
bek-4.937
osity-4.917
Token boxes
Token ID10559
Feature activation-1.00e+00

Pos logit contributions
 and+4.043
,+3.894
 the+3.817
.+3.791
 a+3.607
Neg logit contributions
rol-5.174
umm-5.150
lor-5.022
rob-4.867
ocol-4.856
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+4.037
,+3.894
 the+3.879
 a+3.774
.+3.752
Neg logit contributions
rol-5.303
umm-5.107
lor-5.097
bek-5.060
osity-5.053
Token both
Token ID1111
Feature activation-1.00e+00

Pos logit contributions
 and+4.666
,+4.472
 the+4.357
.+4.315
 a+4.235
Neg logit contributions
rol-4.765
umm-4.571
lor-4.536
osity-4.456
beh-4.443
Token rushing
Token ID15795
Feature activation-1.00e+00

Pos logit contributions
 and+4.643
,+4.414
 the+4.390
.+4.268
 a+4.231
Neg logit contributions
rol-4.795
umm-4.679
lor-4.564
osity-4.554
bek-4.475
Token around
Token ID1088
Feature activation-1.00e+00

Pos logit contributions
 and+2.732
,+2.617
 the+2.612
 a+2.498
.+2.426
Neg logit contributions
rol-6.494
umm-6.292
osity-6.204
lor-6.176
beh-6.137
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.099
,+1.960
 the+1.912
 a+1.867
.+1.784
Neg logit contributions
umm-6.847
rol-6.821
lor-6.717
arians-6.715
bek-6.620
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+5.423
,+5.320
 the+5.245
.+5.117
 a+5.103
Neg logit contributions
umm-3.963
lor-3.840
rol-3.769
arians-3.703
rob-3.655
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.793
,+3.672
 the+3.607
 a+3.510
.+3.475
Neg logit contributions
umm-5.554
lor-5.540
rol-5.394
bek-5.318
arians-5.289
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 the+1.982
 and+1.954
 a+1.799
,+1.790
.+1.578
Neg logit contributions
rol-7.099
umm-7.074
lor-7.006
arians-6.865
osity-6.847
Token laughing
Token ID14376
Feature activation-1.00e+00

Pos logit contributions
 and+4.412
,+4.401
.+4.344
 the+4.312
 a+4.189
Neg logit contributions
rol-4.734
umm-4.567
osity-4.526
lor-4.479
arians-4.453
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+0.971
 and+0.913
,+0.861
 a+0.758
.+0.712
Neg logit contributions
umm-8.026
rol-8.010
lor-7.876
osity-7.798
arians-7.776
Token barking
Token ID48961
Feature activation-1.00e+00

Pos logit contributions
 and+4.613
,+4.402
 the+4.324
 a+4.298
.+4.270
Neg logit contributions
rol-4.712
umm-4.646
osity-4.573
beh-4.456
bek-4.424
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.820
,+1.632
 a+1.564
 the+1.553
.+1.400
Neg logit contributions
umm-7.306
rol-7.295
lor-7.139
arians-7.118
ocol-7.019
Token piano
Token ID19132
Feature activation-1.00e+00

Pos logit contributions
 and+5.147
,+5.092
 the+5.024
 a+4.894
.+4.832
Neg logit contributions
umm-4.215
lor-4.031
rol-3.972
arians-3.967
rob-3.887
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.113
 the+2.080
 a+1.958
,+1.942
.+1.653
Neg logit contributions
rol-7.029
lor-6.992
umm-6.987
arians-6.934
osity-6.755
Token around
Token ID1088
Feature activation-1.00e+00

Pos logit contributions
 and+4.986
,+4.823
.+4.726
 the+4.707
 a+4.704
Neg logit contributions
rol-4.260
umm-4.193
osity-4.137
arians-4.102
bek-3.981
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.077
,+1.871
 the+1.835
 a+1.806
.+1.654
Neg logit contributions
rol-7.067
umm-7.061
lor-6.924
osity-6.849
arians-6.835
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+5.044
,+4.990
 the+4.880
 a+4.761
.+4.723
Neg logit contributions
umm-4.311
lor-4.106
rol-4.082
arians-4.073
vation-3.976
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+4.026
,+3.920
 the+3.803
 a+3.702
.+3.671
Neg logit contributions
lor-5.385
umm-5.365
rol-5.243
bek-5.116
arians-5.060
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.626
 the+1.521
 a+1.388
,+1.371
.+1.125
Neg logit contributions
rol-7.563
umm-7.516
lor-7.443
arians-7.309
osity-7.304
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+4.250
,+4.082
 the+4.044
.+3.920
 a+3.890
Neg logit contributions
umm-5.178
lor-5.145
rol-5.118
osity-4.932
arians-4.893
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+2.045
,+1.877
 the+1.844
.+1.723
 a+1.690
Neg logit contributions
lor-7.328
umm-7.258
rol-7.251
bek-7.049
osity-7.044
TokenSoon
Token ID28093
Feature activation-1.00e+00

Pos logit contributions
 and+6.102
,+5.934
 the+5.914
.+5.791
 a+5.756
Neg logit contributions
umm-3.298
lor-3.274
rol-3.216
bek-3.058
osity-3.044
Token comes
Token ID2058
Feature activation-1.00e+00

Pos logit contributions
 and+3.705
 the+3.694
 a+3.580
,+3.578
.+3.549
Neg logit contributions
rol-5.299
lor-5.285
umm-5.215
arians-5.141
osity-5.134
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.518
,+2.464
 the+2.423
 a+2.260
.+2.251
Neg logit contributions
rol-6.478
umm-6.304
osity-6.202
bek-6.194
lor-6.184
Token sees
Token ID7224
Feature activation-1.00e+00

Pos logit contributions
 and+3.643
,+3.494
 the+3.406
.+3.380
 a+3.307
Neg logit contributions
rol-5.546
lor-5.542
umm-5.524
bek-5.426
osity-5.403
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.495
,+2.377
.+2.162
 the+2.110
 a+2.016
Neg logit contributions
rol-6.505
umm-6.440
ocol-6.432
lor-6.413
osity-6.307
Token tidy
Token ID43044
Feature activation-1.00e+00

Pos logit contributions
 and+4.514
,+4.452
 the+4.399
.+4.258
 a+4.170
Neg logit contributions
umm-4.843
rol-4.587
lor-4.556
arians-4.492
rob-4.433
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.664
,+4.506
 the+4.495
 a+4.464
.+4.342
Neg logit contributions
umm-4.658
rol-4.387
lor-4.335
osity-4.247
bek-4.202
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.871
,+3.751
 the+3.628
 a+3.588
.+3.508
Neg logit contributions
umm-5.556
lor-5.298
rol-5.242
bek-5.131
ocol-5.090
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.711
 the+1.635
 a+1.533
,+1.514
.+1.298
Neg logit contributions
umm-7.407
rol-7.304
lor-7.273
arians-7.107
osity-7.100
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.596
,+3.442
 the+3.416
.+3.312
 a+3.274
Neg logit contributions
umm-5.768
lor-5.659
rol-5.615
osity-5.530
bek-5.439
Token smiles
Token ID21845
Feature activation-1.00e+00

Pos logit contributions
 and+4.177
 the+4.062
,+4.053
.+3.987
 a+3.937
Neg logit contributions
umm-4.986
lor-4.969
rol-4.957
arians-4.825
osity-4.814
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.365
 the+0.313
,+0.259
 a+0.118
.+0.089
Neg logit contributions
umm-8.725
lor-8.661
rol-8.610
osity-8.558
arians-8.443
Tokenigo
Token ID14031
Feature activation-1.00e+00

Pos logit contributions
 and+6.451
 the+6.304
,+6.252
 a+6.158
.+6.108
Neg logit contributions
umm-3.006
lor-2.958
rol-2.879
bek-2.730
arians-2.727
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+4.617
 the+4.578
 a+4.452
,+4.448
.+4.297
Neg logit contributions
lor-4.563
umm-4.463
rol-4.442
arians-4.375
rob-4.285
Token rush
Token ID10484
Feature activation-1.00e+00

Pos logit contributions
 and+4.086
,+3.983
 the+3.948
.+3.940
 a+3.857
Neg logit contributions
rol-5.081
lor-4.987
osity-4.952
umm-4.882
bek-4.842
Token into
Token ID656
Feature activation-1.00e+00

Pos logit contributions
 and+3.441
,+3.315
 the+3.288
 a+3.159
.+3.154
Neg logit contributions
rol-5.883
umm-5.779
lor-5.695
osity-5.587
bek-5.517
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.314
.+2.056
 the+1.903
 a+1.792
Neg logit contributions
rol-6.727
ocol-6.636
lor-6.619
umm-6.598
arians-6.436
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.952
,+4.802
 the+4.688
.+4.623
 a+4.518
Neg logit contributions
umm-4.620
rol-4.336
lor-4.307
bek-4.191
rob-4.134
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.801
,+3.702
 the+3.549
 a+3.501
.+3.486
Neg logit contributions
umm-5.587
lor-5.570
rol-5.497
bek-5.297
isher-5.219
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.471
 and+1.416
 a+1.284
,+1.256
.+1.066
Neg logit contributions
umm-7.634
rol-7.592
lor-7.555
arians-7.357
osity-7.348
Token Am
Token ID1703
Feature activation-1.00e+00

Pos logit contributions
 and+4.500
,+4.326
 the+4.308
.+4.181
 a+4.147
Neg logit contributions
umm-4.902
lor-4.882
rol-4.841
osity-4.664
arians-4.616
Tokenigo
Token ID14031
Feature activation-1.00e+00

Pos logit contributions
 and+6.912
 the+6.744
,+6.732
.+6.601
 a+6.589
Neg logit contributions
umm-2.511
lor-2.436
rol-2.392
bek-2.234
osity-2.226
Token started
Token ID2067
Feature activation-1.00e+00

Pos logit contributions
 and+3.266
 the+3.235
,+3.138
 a+3.131
.+3.094
Neg logit contributions
lor-5.810
umm-5.715
rol-5.712
arians-5.650
rob-5.594
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+3.112
 the+2.983
,+2.950
.+2.779
 a+2.758
Neg logit contributions
lor-5.937
umm-5.917
rob-5.864
rol-5.838
arians-5.799
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+4.150
,+3.977
.+3.807
 the+3.769
 a+3.642
Neg logit contributions
umm-4.919
rob-4.859
ocol-4.856
rol-4.823
arians-4.769
Token toys
Token ID14958
Feature activation-1.00e+00

Pos logit contributions
 and+4.220
,+4.053
 the+3.991
.+3.901
 a+3.819
Neg logit contributions
umm-5.077
arians-4.861
lor-4.856
rob-4.849
rol-4.832
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 the+0.496
 and+0.434
 a+0.323
,+0.309
.+0.072
Neg logit contributions
arians-8.360
lor-8.337
umm-8.324
rol-8.272
rob-8.181
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.219
,+0.988
.+0.781
 the+0.693
 a+0.634
Neg logit contributions
umm-7.850
lor-7.817
ocol-7.747
rol-7.743
rob-7.691
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.672
,+4.468
 the+4.377
.+4.326
 a+4.197
Neg logit contributions
umm-4.745
lor-4.585
rol-4.517
rob-4.426
bek-4.325
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.557
,+3.399
 the+3.309
.+3.210
 a+3.194
Neg logit contributions
lor-5.767
umm-5.766
rol-5.650
bek-5.541
rob-5.457
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.559
 and+0.506
 a+0.354
,+0.342
.+0.036
Neg logit contributions
umm-8.529
lor-8.492
rol-8.396
arians-8.356
ides-8.282
Token But
Token ID887
Feature activation-1.00e+00

Pos logit contributions
 and+3.810
,+3.649
 the+3.607
.+3.497
 a+3.462
Neg logit contributions
umm-5.461
rol-5.388
lor-5.296
osity-5.196
arians-5.166
Token sometimes
Token ID3360
Feature activation-1.00e+00

Pos logit contributions
 and+2.684
.+2.423
 the+2.345
,+2.335
 a+2.251
Neg logit contributions
umm-6.334
rol-6.331
bek-6.274
lor-6.270
osity-6.257
Token they
Token ID484
Feature activation-1.00e+00

Pos logit contributions
 and+2.394
 the+2.081
.+2.050
 a+2.026
,+1.983
Neg logit contributions
umm-6.605
lor-6.553
arians-6.517
osity-6.479
rol-6.423
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+4.423
 the+4.271
,+4.203
.+4.184
 a+4.158
Neg logit contributions
umm-4.916
lor-4.704
rol-4.668
bek-4.577
osity-4.551
Token thick
Token ID6546
Feature activation-1.00e+00

Pos logit contributions
 and+3.604
,+3.462
 the+3.424
.+3.343
 a+3.250
Neg logit contributions
umm-5.721
rol-5.549
lor-5.538
bek-5.472
osity-5.408
Token curtain
Token ID29461
Feature activation-1.00e+00

Pos logit contributions
 and+4.843
 the+4.732
,+4.644
.+4.600
 a+4.592
Neg logit contributions
umm-4.489
lor-4.333
rol-4.250
bek-4.139
osity-4.121
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+2.545
 the+2.416
,+2.381
 a+2.253
.+2.216
Neg logit contributions
umm-6.797
lor-6.717
rol-6.692
osity-6.519
bek-6.498
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.233
,+1.133
.+0.879
 the+0.804
 a+0.713
Neg logit contributions
umm-7.968
lor-7.899
rol-7.875
bek-7.696
rob-7.650
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.883
,+4.703
 the+4.674
.+4.529
 a+4.489
Neg logit contributions
umm-4.618
lor-4.481
rol-4.445
bek-4.235
osity-4.205
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.688
,+3.539
 the+3.460
.+3.362
 a+3.338
Neg logit contributions
umm-5.719
lor-5.692
rol-5.617
bek-5.455
osity-5.401
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.568
 the+0.495
,+0.404
 a+0.318
.+0.145
Neg logit contributions
umm-8.687
lor-8.636
rol-8.574
arians-8.388
osity-8.378
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.701
,+3.511
 the+3.508
.+3.391
 a+3.350
Neg logit contributions
umm-5.692
lor-5.596
rol-5.565
osity-5.378
arians-5.361
Token loved
Token ID6151
Feature activation-1.00e+00

Pos logit contributions
 and+3.538
,+3.418
 the+3.409
.+3.322
 a+3.224
Neg logit contributions
lor-5.643
umm-5.616
rol-5.600
bek-5.539
arians-5.475
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
 and+2.872
,+2.758
.+2.619
 the+2.594
 a+2.515
Neg logit contributions
umm-6.309
rol-6.251
lor-6.202
osity-6.136
arians-6.095
Token Mom
Token ID11254
Feature activation-1.00e+00

Pos logit contributions
 and+2.466
,+2.235
 the+1.977
.+1.947
 a+1.896
Neg logit contributions
umm-6.704
rol-6.598
ocol-6.512
lor-6.512
arians-6.474
Tokenmy
Token ID1820
Feature activation-1.00e+00

Pos logit contributions
 and+3.642
 the+3.545
,+3.516
 a+3.453
.+3.307
Neg logit contributions
rol-5.513
umm-5.506
lor-5.451
arians-5.347
osity-5.296
Token coming
Token ID2406
Feature activation-1.00e+00

Pos logit contributions
 and+2.989
 the+2.914
,+2.873
 a+2.815
.+2.657
Neg logit contributions
umm-6.114
rol-6.080
lor-6.052
arians-5.954
osity-5.900
Token into
Token ID656
Feature activation-1.00e+00

Pos logit contributions
 and+2.037
,+1.885
 the+1.870
 a+1.771
.+1.648
Neg logit contributions
umm-7.100
rol-7.099
lor-6.910
arians-6.863
osity-6.834
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.480
,+2.363
.+2.051
 the+1.976
 a+1.919
Neg logit contributions
umm-6.584
ocol-6.547
lor-6.486
rol-6.384
arians-6.356
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.540
,+4.377
 the+4.275
.+4.195
 a+4.148
Neg logit contributions
umm-4.894
lor-4.772
rol-4.685
rob-4.555
bek-4.504
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.703
,+3.561
 the+3.453
 a+3.435
.+3.347
Neg logit contributions
umm-5.658
lor-5.548
rol-5.445
bek-5.312
rob-5.304
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.517
 the+0.505
 a+0.411
,+0.403
.+0.095
Neg logit contributions
umm-8.523
lor-8.431
rol-8.401
arians-8.250
bek-8.199
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.443
,+3.265
 the+3.249
.+3.113
 a+3.083
Neg logit contributions
umm-5.937
lor-5.814
rol-5.791
osity-5.627
arians-5.583
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.551
,+1.381
 the+1.341
.+1.205
 a+1.199
Neg logit contributions
lor-7.708
umm-7.658
rol-7.634
bek-7.483
osity-7.451
Token"
Token ID1
Feature activation-1.00e+00

Pos logit contributions
 and+4.307
,+4.124
 the+4.110
.+3.991
 a+3.960
Neg logit contributions
umm-5.052
lor-5.027
rol-4.970
bek-4.866
osity-4.812
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+1.733
,+1.581
 the+1.537
.+1.459
 a+1.352
Neg logit contributions
umm-7.308
lor-7.285
rol-7.279
arians-7.093
rob-7.087
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.161
,+1.965
.+1.809
 the+1.551
 a+1.415
Neg logit contributions
rol-7.103
umm-7.053
ocol-6.941
lor-6.902
arians-6.887
Token bell
Token ID8966
Feature activation-1.00e+00

Pos logit contributions
 and+4.893
,+4.755
 the+4.714
.+4.581
 a+4.520
Neg logit contributions
umm-4.542
rol-4.412
lor-4.318
rob-4.242
arians-4.229
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+1.934
 the+1.891
,+1.793
 a+1.717
.+1.594
Neg logit contributions
umm-7.219
rol-7.103
lor-7.049
osity-6.967
arians-6.963
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.877
,+1.669
.+1.439
 the+1.260
 a+1.186
Neg logit contributions
umm-7.301
rol-7.224
lor-7.216
ocol-7.117
bek-7.042
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.756
,+4.621
 the+4.509
.+4.414
 a+4.339
Neg logit contributions
umm-4.747
lor-4.578
rol-4.561
rob-4.408
bek-4.386
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.586
,+3.440
 the+3.338
.+3.253
 a+3.241
Neg logit contributions
lor-5.791
umm-5.791
rol-5.668
bek-5.566
rob-5.459
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.977
 the+0.949
,+0.785
 a+0.760
.+0.552
Neg logit contributions
umm-8.184
rol-8.048
lor-8.018
arians-7.870
osity-7.828
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.987
,+3.815
 the+3.800
.+3.681
 a+3.639
Neg logit contributions
umm-5.415
lor-5.306
rol-5.295
osity-5.146
arians-5.109
Token saw
Token ID2497
Feature activation-1.00e+00

Pos logit contributions
 and+3.265
,+3.191
 the+3.172
.+3.119
 a+3.026
Neg logit contributions
rol-5.795
lor-5.710
umm-5.701
bek-5.641
arians-5.633
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+1.905
,+1.732
.+1.654
 the+1.518
 a+1.310
Neg logit contributions
rol-7.338
umm-7.258
lor-7.236
osity-7.110
rob-7.087
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.905
 the+1.870
 a+1.732
,+1.727
.+1.641
Neg logit contributions
lor-7.170
rol-7.120
umm-7.112
arians-7.048
osity-6.977
Token I
Token ID314
Feature activation-1.00e+00

Pos logit contributions
 and+4.045
,+3.882
 the+3.807
.+3.704
 a+3.588
Neg logit contributions
lor-4.995
umm-4.985
beh-4.940
vation-4.919
rol-4.911
Tokenâ
Token ID22940
Feature activation-1.00e+00

Pos logit contributions
 and+4.710
 the+4.685
,+4.575
 a+4.467
.+4.431
Neg logit contributions
rol-4.515
lor-4.497
umm-4.325
bek-4.284
ides-4.273
Token€
Token ID26391
Feature activation-1.00e+00

Pos logit contributions
 and+6.863
,+6.688
 the+6.685
.+6.553
 a+6.525
Neg logit contributions
umm-2.548
lor-2.464
rol-2.427
bek-2.261
osity-2.260
Tokenâ„¢
Token ID8151
Feature activation-1.00e+00

Pos logit contributions
 and+7.347
 the+7.197
,+7.151
 a+7.031
.+7.024
Neg logit contributions
lor-2.019
umm-2.016
rol-1.932
arians-1.782
bek-1.772
Tokenll
Token ID297
Feature activation-1.00e+00

Pos logit contributions
 and+5.534
 the+5.367
,+5.340
 a+5.212
.+5.212
Neg logit contributions
umm-3.827
lor-3.770
rol-3.762
bek-3.569
osity-3.556
Token walk
Token ID2513
Feature activation-1.00e+00

Pos logit contributions
 and+4.129
 the+4.128
,+4.047
 a+3.956
.+3.877
Neg logit contributions
rol-4.987
lor-4.922
ides-4.866
arians-4.826
umm-4.798
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+2.583
 the+2.457
,+2.439
 a+2.306
.+2.271
Neg logit contributions
rol-6.576
lor-6.477
umm-6.455
bek-6.335
arians-6.315
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+2.969
,+2.710
 the+2.532
.+2.521
 a+2.385
Neg logit contributions
lor-6.404
rol-6.259
umm-6.218
ocol-6.090
arians-6.028
Token around
Token ID1088
Feature activation-1.00e+00

Pos logit contributions
 and+2.945
 the+2.923
,+2.742
 a+2.707
.+2.555
Neg logit contributions
lor-6.198
arians-6.077
rol-6.071
ides-5.935
umm-5.908
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.395
,+2.233
 a+2.144
 the+2.138
.+2.000
Neg logit contributions
lor-6.673
rol-6.595
umm-6.582
arians-6.528
ocol-6.440
Token noise
Token ID7838
Feature activation-1.00e+00

Pos logit contributions
 and+4.085
,+3.956
 the+3.855
.+3.768
 a+3.665
Neg logit contributions
umm-5.164
rol-5.036
rob-4.935
ocol-4.918
lor-4.903
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+0.678
 and+0.632
,+0.531
 a+0.524
.+0.316
Neg logit contributions
umm-8.282
rol-8.218
lor-8.120
osity-8.045
arians-8.033
Token came
Token ID1625
Feature activation-1.00e+00

Pos logit contributions
 and+4.298
,+4.144
 the+3.967
.+3.926
 a+3.854
Neg logit contributions
rol-4.799
osity-4.689
umm-4.684
beh-4.595
lor-4.561
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+2.398
,+2.333
 the+2.309
 a+2.111
.+2.090
Neg logit contributions
rol-6.722
umm-6.558
lor-6.463
osity-6.451
beh-6.433
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.268
,+3.130
.+2.909
 the+2.820
 a+2.724
Neg logit contributions
umm-5.836
bek-5.743
rol-5.674
lor-5.663
ocol-5.633
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.788
,+4.668
 the+4.584
.+4.440
 a+4.394
Neg logit contributions
umm-4.637
lor-4.517
rol-4.392
arians-4.298
bek-4.256
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.557
,+3.439
 the+3.366
 a+3.264
.+3.221
Neg logit contributions
umm-5.790
lor-5.705
rol-5.574
bek-5.475
arians-5.423
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.092
 the+1.046
 a+0.930
,+0.921
.+0.637
Neg logit contributions
umm-7.976
rol-7.926
lor-7.866
arians-7.737
osity-7.702
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.450
 the+3.295
,+3.279
.+3.146
 a+3.143
Neg logit contributions
umm-5.923
lor-5.825
rol-5.810
osity-5.640
bek-5.626
Token saw
Token ID2497
Feature activation-1.00e+00

Pos logit contributions
 and+3.174
,+3.108
 the+3.047
.+3.018
 a+2.907
Neg logit contributions
rol-5.994
lor-5.915
bek-5.903
umm-5.864
arians-5.789
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.505
,+2.347
.+2.246
 the+2.151
 a+2.084
Neg logit contributions
rol-6.759
umm-6.670
lor-6.601
osity-6.484
rob-6.478
Token best
Token ID1266
Feature activation-1.00e+00

Pos logit contributions
 and+4.229
 the+4.166
,+4.042
 a+3.889
.+3.869
Neg logit contributions
umm-4.969
lor-4.807
arians-4.784
rol-4.773
rob-4.620
Token friend
Token ID1545
Feature activation-1.00e+00

Pos logit contributions
 the+3.526
 and+3.396
 a+3.258
,+3.237
.+3.042
Neg logit contributions
lor-5.750
umm-5.661
rol-5.635
arians-5.477
osity-5.448
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+1.392
 the+1.377
,+1.199
 a+1.181
.+0.981
Neg logit contributions
lor-7.690
rol-7.612
umm-7.569
arians-7.499
bek-7.391
Token long
Token ID890
Feature activation-1.00e+00

Pos logit contributions
 and+4.772
 the+4.731
,+4.557
.+4.428
 a+4.420
Neg logit contributions
umm-4.536
lor-4.484
bek-4.345
rol-4.285
osity-4.234
Token time
Token ID640
Feature activation-1.00e+00

Pos logit contributions
 the+4.217
 and+4.126
 a+3.977
,+3.976
.+3.860
Neg logit contributions
lor-5.053
umm-4.893
rol-4.770
ides-4.739
bek-4.697
Token ago
Token ID2084
Feature activation-1.00e+00

Pos logit contributions
 and+5.051
 the+5.050
,+4.843
 a+4.823
.+4.574
Neg logit contributions
umm-3.997
rol-3.969
lor-3.937
osity-3.825
arians-3.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+1.539
 and+1.490
 a+1.366
,+1.281
.+1.072
Neg logit contributions
lor-7.543
umm-7.503
rol-7.407
arians-7.354
ides-7.316
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+5.224
 the+5.054
,+5.047
.+4.899
 a+4.853
Neg logit contributions
umm-4.217
lor-4.165
rol-4.116
osity-3.993
arians-3.938
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+4.180
 the+4.110
,+4.043
 a+3.902
.+3.896
Neg logit contributions
lor-5.051
rol-4.982
umm-4.963
arians-4.863
bek-4.771
Token my
Token ID616
Feature activation-1.00e+00

Pos logit contributions
 and+3.743
,+3.599
 the+3.386
.+3.383
 a+3.151
Neg logit contributions
rol-5.524
lor-5.443
umm-5.363
ocol-5.272
arians-5.253
Token dog
Token ID3290
Feature activation-1.00e+00

Pos logit contributions
 and+4.295
 the+4.205
,+4.101
.+3.947
 a+3.942
Neg logit contributions
umm-5.080
lor-4.887
rol-4.876
arians-4.850
osity-4.747
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+2.140
 the+1.994
,+1.973
.+1.816
 a+1.792
Neg logit contributions
umm-6.956
rol-6.911
lor-6.909
arians-6.717
rob-6.702
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+2.299
,+2.049
.+1.924
 the+1.771
 a+1.648
Neg logit contributions
umm-7.054
rol-7.032
rob-6.888
ocol-6.875
lor-6.856
Token toys
Token ID14958
Feature activation-1.00e+00

Pos logit contributions
 and+4.121
,+3.956
 the+3.839
.+3.818
 a+3.681
Neg logit contributions
umm-5.221
arians-4.998
rob-4.964
rol-4.946
lor-4.929
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+0.951
 the+0.904
,+0.799
 a+0.773
.+0.602
Neg logit contributions
umm-8.016
lor-7.962
rol-7.923
arians-7.897
rob-7.753
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.154
,+0.936
.+0.770
 the+0.636
 a+0.576
Neg logit contributions
umm-8.052
lor-8.009
rol-7.994
ocol-7.876
rob-7.851
Token living
Token ID2877
Feature activation-1.00e+00

Pos logit contributions
 and+4.683
,+4.514
.+4.352
 the+4.332
 a+4.162
Neg logit contributions
umm-4.733
lor-4.572
rol-4.523
rob-4.408
bek-4.333
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+3.676
,+3.521
 the+3.361
.+3.320
 a+3.285
Neg logit contributions
umm-5.700
lor-5.651
rol-5.588
bek-5.432
rob-5.363
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 the+0.599
 and+0.570
 a+0.401
,+0.382
.+0.137
Neg logit contributions
umm-8.522
lor-8.442
rol-8.409
arians-8.268
osity-8.187
Token They
Token ID1119
Feature activation-1.00e+00

Pos logit contributions
 and+3.719
,+3.545
 the+3.534
.+3.424
 a+3.395
Neg logit contributions
umm-5.636
rol-5.580
lor-5.493
osity-5.366
bek-5.315
Token liked
Token ID8288
Feature activation-1.00e+00

Pos logit contributions
 and+3.194
,+3.021
 the+3.020
.+2.891
 a+2.862
Neg logit contributions
umm-6.127
rol-6.068
lor-6.032
bek-5.899
osity-5.884
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.534
,+1.435
.+1.315
 the+1.233
 a+1.130
Neg logit contributions
umm-7.571
rol-7.523
osity-7.431
lor-7.417
arians-7.352
Token got
Token ID1392
Feature activation-1.00e+00

Pos logit contributions
 and+4.358
 the+4.260
,+4.250
.+4.181
 a+4.128
Neg logit contributions
rol-4.811
lor-4.787
umm-4.746
bek-4.698
rob-4.646
Token closer
Token ID5699
Feature activation-1.00e+00

Pos logit contributions
 and+4.023
,+3.816
 the+3.723
.+3.718
 a+3.543
Neg logit contributions
rol-5.310
umm-5.183
lor-5.097
beh-5.019
osity-5.011
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.177
 the+1.111
 a+1.000
.+0.952
,+0.939
Neg logit contributions
umm-7.718
lor-7.704
rol-7.665
osity-7.560
arians-7.415
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.635
,+2.399
.+2.341
 the+2.203
 a+2.064
Neg logit contributions
umm-6.516
lor-6.455
rol-6.444
bek-6.330
osity-6.295
Token noticed
Token ID6810
Feature activation-1.00e+00

Pos logit contributions
 and+4.381
 the+4.296
,+4.274
.+4.222
 a+4.115
Neg logit contributions
lor-4.775
rol-4.688
umm-4.599
bek-4.583
arians-4.546
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+2.211
,+2.029
.+1.884
 the+1.837
 a+1.625
Neg logit contributions
lor-6.930
rol-6.849
umm-6.767
rob-6.647
osity-6.642
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.408
,+2.239
.+2.135
 the+1.977
 a+1.907
Neg logit contributions
umm-6.718
rol-6.683
lor-6.645
bek-6.512
osity-6.504
Token traffic
Token ID4979
Feature activation-1.00e+00

Pos logit contributions
 and+5.249
,+5.162
 the+5.118
.+4.954
 a+4.915
Neg logit contributions
umm-4.212
rol-3.991
arians-3.914
lor-3.781
plets-3.752
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.458
,+3.341
 the+3.341
 a+3.262
.+3.157
Neg logit contributions
umm-5.704
rol-5.628
lor-5.516
arians-5.442
plets-5.422
Token increasing
Token ID3649
Feature activation-1.00e+00

Pos logit contributions
 and+4.473
,+4.356
 the+4.201
.+4.176
 a+4.004
Neg logit contributions
rol-4.735
umm-4.637
osity-4.458
lor-4.403
bek-4.366
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.618
,+2.488
 the+2.483
 a+2.336
.+2.240
Neg logit contributions
umm-6.436
rol-6.402
lor-6.257
bek-6.149
osity-6.149
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.980
,+0.796
 the+0.671
.+0.655
 a+0.428
Neg logit contributions
lor-8.395
umm-8.393
rol-8.332
rob-8.111
osity-8.108
Token switch
Token ID5078
Feature activation-1.00e+00

Pos logit contributions
 and+4.603
 the+4.439
,+4.381
.+4.315
 a+4.209
Neg logit contributions
umm-4.767
lor-4.655
bek-4.568
rol-4.516
osity-4.455
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+2.040
 the+1.940
,+1.892
 a+1.766
.+1.699
Neg logit contributions
umm-7.207
lor-7.121
rol-7.114
osity-6.952
bek-6.927
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
 and+1.304
,+1.198
.+0.919
 the+0.884
 a+0.786
Neg logit contributions
umm-7.691
lor-7.653
rol-7.567
bek-7.520
ocol-7.436
Token mummy
Token ID38567
Feature activation-1.00e+00

Pos logit contributions
 and+4.584
 the+4.391
,+4.365
.+4.233
 a+4.203
Neg logit contributions
umm-4.779
lor-4.766
rol-4.504
arians-4.486
bek-4.480
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+2.815
 the+2.741
,+2.636
 a+2.612
.+2.477
Neg logit contributions
umm-6.334
lor-6.204
rol-6.125
arians-6.058
osity-6.029
Token house
Token ID2156
Feature activation-1.00e+00

Pos logit contributions
 and+4.781
 the+4.674
,+4.616
 a+4.492
.+4.417
Neg logit contributions
umm-4.531
lor-4.452
rol-4.341
arians-4.281
bek-4.250
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.643
 the+0.601
,+0.477
 a+0.442
.+0.195
Neg logit contributions
umm-8.499
lor-8.405
rol-8.396
osity-8.227
arians-8.206
Token tried
Token ID3088
Feature activation-1.00e+00

Pos logit contributions
 and+3.177
,+2.972
 the+2.917
.+2.906
 a+2.704
Neg logit contributions
lor-5.920
umm-5.870
rol-5.856
osity-5.810
bek-5.752
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.380
,+1.262
 the+1.167
.+1.119
 a+1.013
Neg logit contributions
lor-7.565
osity-7.550
rol-7.546
umm-7.482
bek-7.435
Token press
Token ID1803
Feature activation-1.00e+00

Pos logit contributions
 and+4.384
,+4.291
 the+4.147
.+4.109
 a+3.960
Neg logit contributions
umm-4.666
rol-4.598
bek-4.584
arians-4.582
beh-4.565
Token back
Token ID736
Feature activation-1.00e+00

Pos logit contributions
 and+3.944
,+3.737
 the+3.661
.+3.474
 a+3.433
Neg logit contributions
umm-5.176
rol-5.105
lor-5.032
beh-4.973
ides-4.917
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+3.492
,+3.364
 the+3.168
.+3.087
 a+2.981
Neg logit contributions
rol-5.585
lor-5.501
umm-5.436
beh-5.405
bek-5.361
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
 and+2.724
,+2.585
.+2.205
 the+2.095
 a+1.977
Neg logit contributions
umm-6.287
arians-6.229
ocol-6.221
bek-6.191
agna-6.104
Token home
Token ID1363
Feature activation-1.00e+00

Pos logit contributions
 and+4.550
 the+4.340
,+4.333
 a+4.209
.+4.179
Neg logit contributions
umm-4.883
arians-4.653
rol-4.604
lor-4.588
bek-4.561
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.584
 the+0.562
,+0.438
 a+0.325
.+0.085
Neg logit contributions
rol-8.261
umm-8.250
lor-8.218
bek-8.211
arians-8.187
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.151
,+2.980
 the+2.955
.+2.850
 a+2.784
Neg logit contributions
umm-6.182
rol-6.105
lor-6.091
osity-5.920
arians-5.913
Token ants
Token ID27842
Feature activation-1.00e+00

Pos logit contributions
 and+5.065
,+4.927
 the+4.891
.+4.785
 a+4.730
Neg logit contributions
umm-4.460
rol-4.138
lor-4.068
arians-4.042
bek-4.021
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+3.258
 the+3.194
,+3.101
 a+3.069
.+3.054
Neg logit contributions
rol-5.880
lor-5.795
umm-5.764
arians-5.632
bek-5.620
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.586
,+3.410
 the+3.351
.+3.322
 a+3.176
Neg logit contributions
umm-5.638
rol-5.638
lor-5.497
arians-5.428
osity-5.395
Token tough
Token ID5802
Feature activation-1.00e+00

Pos logit contributions
 and+3.693
 the+3.475
,+3.461
.+3.364
 a+3.351
Neg logit contributions
umm-5.676
rol-5.576
lor-5.467
bek-5.453
osity-5.420
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 the+1.030
 and+0.961
 a+0.830
,+0.810
.+0.666
Neg logit contributions
umm-8.075
rol-7.908
lor-7.879
arians-7.873
unn-7.796
Token man
Token ID582
Feature activation-1.00e+00

Pos logit contributions
 and+3.694
 the+3.536
,+3.530
.+3.389
 a+3.365
Neg logit contributions
umm-5.705
lor-5.640
rol-5.470
bek-5.435
osity-5.424
Token told
Token ID1297
Feature activation-1.00e+00

Pos logit contributions
 and+3.697
 the+3.612
,+3.553
.+3.513
 a+3.464
Neg logit contributions
lor-5.433
rol-5.405
umm-5.343
bek-5.275
arians-5.256
Token Jon
Token ID5966
Feature activation-1.00e+00

Pos logit contributions
 and+5.798
,+5.616
 the+5.528
.+5.519
 a+5.445
Neg logit contributions
umm-3.470
lor-3.382
rol-3.381
rob-3.182
osity-3.129
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+4.477
,+4.304
 the+4.301
.+4.168
 a+4.151
Neg logit contributions
umm-4.846
lor-4.824
rol-4.764
bek-4.621
osity-4.615
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.763
,+2.600
.+2.534
 the+2.449
 a+2.405
Neg logit contributions
umm-6.410
lor-6.358
rol-6.308
osity-6.206
rob-6.199
Token wanted
Token ID2227
Feature activation-1.00e+00

Pos logit contributions
 and+3.605
 the+3.504
,+3.472
.+3.339
 a+3.312
Neg logit contributions
lor-5.583
umm-5.568
rol-5.549
arians-5.416
osity-5.399
Token him
Token ID683
Feature activation-1.00e+00

Pos logit contributions
 and+1.495
,+1.344
.+1.219
 the+1.183
 a+0.993
Neg logit contributions
umm-7.541
lor-7.523
rol-7.447
arians-7.319
osity-7.317
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.433
 the+1.243
,+1.240
.+1.073
 a+0.977
Neg logit contributions
lor-7.726
rol-7.614
osity-7.448
umm-7.446
beh-7.399
Token fly
Token ID6129
Feature activation-1.00e+00

Pos logit contributions
 and+4.057
,+3.787
 the+3.771
.+3.642
 a+3.579
Neg logit contributions
lor-5.158
umm-4.897
rol-4.879
arians-4.855
ides-4.837
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+2.525
,+2.388
 the+2.368
.+2.203
 a+2.178
Neg logit contributions
rol-6.424
lor-6.306
umm-6.216
bek-6.153
rob-6.079
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.718
,+2.579
.+2.306
 the+2.001
 a+1.901
Neg logit contributions
lor-6.388
umm-6.385
rob-6.292
ocol-6.286
rol-6.240
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+0.565
 the+0.355
.+0.198
,+0.186
 a+0.164
Neg logit contributions
lor-8.601
rol-8.587
umm-8.574
osity-8.397
bek-8.342
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+4.704
,+4.556
 the+4.501
.+4.376
 a+4.314
Neg logit contributions
lor-4.762
rol-4.648
umm-4.590
bek-4.545
osity-4.407
TokenThe
Token ID464
Feature activation-1.00e+00

Pos logit contributions
 and+5.405
,+5.253
 the+5.214
.+5.095
 a+5.035
Neg logit contributions
lor-4.064
rol-3.961
umm-3.957
bek-3.891
arians-3.752
Token lake
Token ID13546
Feature activation-1.00e+00

Pos logit contributions
 and+4.895
,+4.722
 the+4.684
.+4.582
 a+4.476
Neg logit contributions
umm-4.563
lor-4.341
bek-4.302
arians-4.264
rol-4.263
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.195
 the+3.078
,+3.029
 a+3.006
.+2.977
Neg logit contributions
umm-6.002
rol-5.972
lor-5.912
bek-5.749
osity-5.717
Token so
Token ID523
Feature activation-1.00e+00

Pos logit contributions
 and+3.657
,+3.497
.+3.430
 the+3.430
 a+3.197
Neg logit contributions
rol-5.630
umm-5.393
lor-5.272
osity-5.220
rim-5.113
Token big
Token ID1263
Feature activation-1.00e+00

Pos logit contributions
 and+3.564
,+3.367
 the+3.280
.+3.270
 a+3.152
Neg logit contributions
umm-5.748
rol-5.736
osity-5.508
bek-5.452
lor-5.415
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.534
 the+1.493
 a+1.415
,+1.386
.+1.297
Neg logit contributions
umm-7.649
lor-7.409
rol-7.309
rob-7.265
arians-7.253
Token there
Token ID612
Feature activation-1.00e+00

Pos logit contributions
 and+4.123
,+3.933
.+3.845
 the+3.624
 a+3.551
Neg logit contributions
umm-5.274
bek-5.187
rol-5.184
osity-5.012
lor-4.941
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+2.363
 the+2.206
,+2.193
.+2.103
 a+2.081
Neg logit contributions
rol-6.776
lor-6.706
umm-6.645
bek-6.481
osity-6.445
Token lots
Token ID6041
Feature activation-1.00e+00

Pos logit contributions
 and+3.719
,+3.569
.+3.492
 the+3.364
 a+3.179
Neg logit contributions
rol-5.785
umm-5.642
lor-5.438
osity-5.347
arians-5.231