TOP ACTIVATIONS
MAX = 1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

MIN ACTIVATIONS
MIN = 1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

INTERVAL 1.000 - 1.000
CONTAINS 0.000%

TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.343
,+4.330
.+4.109
 the+4.103
 a+3.960
Neg logit contributions
umm-4.028
lor-4.008
ocol-3.948
 former-3.912
arians-3.823
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.721
,+0.694
 the+0.467
.+0.454
 a+0.332
Neg logit contributions
lor-7.660
umm-7.640
ocol-7.531
 former-7.513
arians-7.458
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.343
,+4.330
.+4.109
 the+4.103
 a+3.960
Neg logit contributions
umm-4.028
lor-4.008
ocol-3.948
 former-3.912
arians-3.823
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.632
,+3.569
.+3.383
 the+3.378
 a+3.203
Neg logit contributions
lor-4.728
umm-4.678
 former-4.668
ocol-4.616
arians-4.584
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.147
,+3.126
.+2.959
 the+2.914
 a+2.783
Neg logit contributions
lor-5.146
umm-5.134
arians-5.074
 former-5.059
ocol-5.030
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.632
,+3.569
.+3.383
 the+3.378
 a+3.203
Neg logit contributions
lor-4.728
umm-4.678
 former-4.668
ocol-4.616
arians-4.584
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.343
,+4.330
.+4.109
 the+4.103
 a+3.960
Neg logit contributions
umm-4.028
lor-4.008
ocol-3.948
 former-3.912
arians-3.823
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.721
,+0.694
 the+0.467
.+0.454
 a+0.332
Neg logit contributions
lor-7.660
umm-7.640
ocol-7.531
 former-7.513
arians-7.458
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.343
,+4.330
.+4.109
 the+4.103
 a+3.960
Neg logit contributions
umm-4.028
lor-4.008
ocol-3.948
 former-3.912
arians-3.823
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.879
,+2.857
.+2.609
 the+2.575
 a+2.443
Neg logit contributions
umm-5.473
lor-5.457
ocol-5.448
 former-5.383
arians-5.336
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.456
,+1.420
.+1.206
 the+1.091
 a+0.939
Neg logit contributions
umm-6.918
lor-6.914
ocol-6.888
 former-6.856
arians-6.745
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.632
,+3.569
.+3.383
 the+3.378
 a+3.203
Neg logit contributions
lor-4.728
umm-4.678
 former-4.668
ocol-4.616
arians-4.584
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.147
,+3.126
.+2.959
 the+2.914
 a+2.783
Neg logit contributions
lor-5.146
umm-5.134
arians-5.074
 former-5.059
ocol-5.030
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.632
,+3.569
.+3.383
 the+3.378
 a+3.203
Neg logit contributions
lor-4.728
umm-4.678
 former-4.668
ocol-4.616
arians-4.584
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.690
,+2.673
 the+2.471
.+2.462
 a+2.293
Neg logit contributions
lor-5.655
umm-5.621
 former-5.582
ocol-5.509
arians-5.499
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.617
,+3.607
.+3.444
 the+3.371
 a+3.223
Neg logit contributions
umm-4.713
lor-4.680
 former-4.671
ocol-4.623
arians-4.581
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.358
,+1.322
.+1.102
 the+1.061
 a+0.913
Neg logit contributions
lor-7.070
umm-6.971
 former-6.928
ocol-6.912
arians-6.845
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.613
,+3.572
.+3.374
 the+3.334
 a+3.174
Neg logit contributions
lor-4.887
umm-4.823
 former-4.745
ocol-4.730
arians-4.703
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
,+2.868
 and+2.864
.+2.715
 the+2.684
 a+2.522
Neg logit contributions
lor-5.440
umm-5.354
arians-5.308
 former-5.307
ocol-5.243
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.787
,+1.760
.+1.562
 the+1.528
 a+1.375
Neg logit contributions
lor-6.580
umm-6.527
 former-6.482
ocol-6.451
arians-6.428
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.282
,+2.249
.+2.041
 the+1.910
 a+1.790
Neg logit contributions
lor-6.056
umm-6.045
 former-6.040
ocol-6.032
arians-5.941
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.694
,+3.665
.+3.474
 the+3.445
 a+3.274
Neg logit contributions
umm-4.693
 former-4.681
lor-4.647
arians-4.565
ocol-4.536
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.096
,+1.073
 the+0.906
.+0.874
 a+0.735
Neg logit contributions
lor-7.260
umm-7.214
 former-7.169
arians-7.074
ocol-7.051
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.485
,+3.462
.+3.277
 the+3.180
 a+3.024
Neg logit contributions
lor-4.871
 former-4.818
umm-4.776
ocol-4.726
arians-4.710
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.992
,+2.955
.+2.740
 the+2.699
 a+2.534
Neg logit contributions
umm-5.440
lor-5.430
 former-5.347
ocol-5.304
arians-5.268
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.478
,+0.447
 the+0.272
.+0.243
 a+0.138
Neg logit contributions
umm-7.848
lor-7.766
ocol-7.746
 former-7.691
arians-7.646
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.056
,+2.020
 the+1.860
.+1.835
 a+1.686
Neg logit contributions
umm-6.270
lor-6.266
 former-6.171
ocol-6.146
arians-6.102
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.038
,+2.992
.+2.806
 the+2.737
 a+2.598
Neg logit contributions
lor-5.312
umm-5.295
 former-5.254
ocol-5.246
arians-5.169
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.103
,+4.077
.+3.873
 the+3.829
 a+3.657
Neg logit contributions
umm-4.311
lor-4.243
 former-4.229
ocol-4.228
arians-4.144
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.665
,+3.632
.+3.453
 the+3.443
 a+3.274
Neg logit contributions
lor-4.683
umm-4.664
 former-4.567
ocol-4.556
arians-4.502
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.268
,+2.221
.+2.029
 the+2.002
 a+1.829
Neg logit contributions
lor-6.089
umm-6.036
 former-6.032
ocol-5.985
arians-5.927
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.793
,+3.758
.+3.557
 the+3.542
 a+3.366
Neg logit contributions
umm-4.589
lor-4.535
ocol-4.522
 former-4.512
arians-4.432
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.064
,+3.968
.+3.795
 the+3.748
 a+3.585
Neg logit contributions
lor-4.377
umm-4.318
ocol-4.264
 former-4.230
arians-4.176
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.779
,+1.737
 the+1.538
.+1.530
 a+1.393
Neg logit contributions
lor-6.547
umm-6.529
 former-6.460
ocol-6.444
arians-6.411
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.833
,+0.783
 the+0.612
.+0.580
 a+0.464
Neg logit contributions
lor-7.493
umm-7.479
ocol-7.387
 former-7.354
arians-7.321
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.864
,+2.834
.+2.627
 the+2.577
 a+2.416
Neg logit contributions
lor-5.563
umm-5.531
 former-5.472
ocol-5.440
arians-5.389
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
,+2.908
 and+2.902
.+2.743
 the+2.710
 a+2.547
Neg logit contributions
lor-5.398
umm-5.341
 former-5.304
arians-5.269
ocol-5.231
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.567
,+3.548
.+3.345
 the+3.276
 a+3.081
Neg logit contributions
lor-4.770
 former-4.758
umm-4.752
ocol-4.709
arians-4.641