TOP ACTIVATIONS
MAX = 1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

MIN ACTIVATIONS
MIN = 1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

INTERVAL 1.000 - 1.000
CONTAINS 0.000%

TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.460
,+4.419
 the+4.259
.+4.233
 a+4.042
Neg logit contributions
lor-4.081
umm-4.021
 former-3.920
ocol-3.888
arians-3.887
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.748
,+0.696
 the+0.537
.+0.477
 a+0.335
Neg logit contributions
lor-7.807
umm-7.699
arians-7.581
 former-7.579
ocol-7.540
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.460
,+4.419
 the+4.259
.+4.233
 a+4.042
Neg logit contributions
lor-4.081
umm-4.021
 former-3.920
ocol-3.888
arians-3.887
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.724
,+3.635
 the+3.508
.+3.483
 a+3.262
Neg logit contributions
lor-4.826
umm-4.701
 former-4.698
arians-4.670
osity-4.589
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.216
,+3.167
.+3.034
 the+3.023
 a+2.820
Neg logit contributions
lor-5.264
arians-5.180
umm-5.172
 former-5.111
osity-5.059
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.724
,+3.635
 the+3.508
.+3.483
 a+3.262
Neg logit contributions
lor-4.826
umm-4.701
 former-4.698
arians-4.670
osity-4.589
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.460
,+4.419
 the+4.259
.+4.233
 a+4.042
Neg logit contributions
lor-4.081
umm-4.021
 former-3.920
ocol-3.888
arians-3.887
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.748
,+0.696
 the+0.537
.+0.477
 a+0.335
Neg logit contributions
lor-7.807
umm-7.699
arians-7.581
 former-7.579
ocol-7.540
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.460
,+4.419
 the+4.259
.+4.233
 a+4.042
Neg logit contributions
lor-4.081
umm-4.021
 former-3.920
ocol-3.888
arians-3.887
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.961
,+2.916
 the+2.690
.+2.689
 a+2.492
Neg logit contributions
lor-5.536
umm-5.473
ocol-5.414
arians-5.409
 former-5.398
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.476
,+1.413
.+1.229
 the+1.136
 a+0.913
Neg logit contributions
lor-7.066
umm-6.989
 former-6.947
ocol-6.920
arians-6.883
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.724
,+3.635
 the+3.508
.+3.483
 a+3.262
Neg logit contributions
lor-4.826
umm-4.701
 former-4.698
arians-4.670
osity-4.589
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.216
,+3.167
.+3.034
 the+3.023
 a+2.820
Neg logit contributions
lor-5.264
arians-5.180
umm-5.172
 former-5.111
osity-5.059
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.724
,+3.635
 the+3.508
.+3.483
 a+3.262
Neg logit contributions
lor-4.826
umm-4.701
 former-4.698
arians-4.670
osity-4.589
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.775
,+2.727
 the+2.589
.+2.554
 a+2.342
Neg logit contributions
lor-5.764
umm-5.656
 former-5.624
arians-5.599
osity-5.535
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.718
,+3.674
.+3.540
 the+3.507
 a+3.284
Neg logit contributions
lor-4.794
umm-4.741
 former-4.700
arians-4.671
ocol-4.595
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.372
,+1.309
.+1.121
 the+1.113
 a+0.894
Neg logit contributions
lor-7.241
umm-7.056
 former-7.029
arians-7.001
ocol-6.948
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.683
,+3.615
.+3.452
 the+3.444
 a+3.212
Neg logit contributions
lor-5.003
umm-4.859
arians-4.809
 former-4.796
bek-4.730
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.920
,+2.892
 the+2.770
.+2.769
 a+2.536
Neg logit contributions
lor-5.587
arians-5.442
umm-5.432
 former-5.394
osity-5.307
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.828
,+1.777
 the+1.612
.+1.610
 a+1.389
Neg logit contributions
lor-6.703
umm-6.554
arians-6.538
 former-6.535
rol-6.454
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.346
,+2.285
.+2.112
 the+2.018
 a+1.824
Neg logit contributions
lor-6.184
 former-6.099
umm-6.093
arians-6.057
ocol-6.025
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.820
,+3.762
 the+3.609
.+3.606
 a+3.367
Neg logit contributions
lor-4.715
umm-4.678
 former-4.675
arians-4.618
ocol-4.473
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.127
,+1.079
 the+0.991
.+0.912
 a+0.746
Neg logit contributions
lor-7.402
umm-7.265
 former-7.243
arians-7.198
osity-7.076
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.565
,+3.514
.+3.363
 the+3.304
 a+3.075
Neg logit contributions
lor-4.983
 former-4.861
umm-4.815
arians-4.811
osity-4.710
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.015
,+2.951
.+2.771
 the+2.762
 a+2.526
Neg logit contributions
lor-5.593
umm-5.522
 former-5.444
arians-5.420
osity-5.352
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.464
,+0.409
 the+0.320
.+0.233
 a+0.123
Neg logit contributions
umm-7.913
lor-7.888
arians-7.766
ocol-7.763
 former-7.749
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.158
,+2.096
 the+1.995
.+1.944
 a+1.751
Neg logit contributions
lor-6.357
umm-6.282
 former-6.198
arians-6.185
ocol-6.109
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+3.033
.+2.881
 the+2.844
 a+2.634
Neg logit contributions
lor-5.425
umm-5.326
 former-5.303
arians-5.271
ocol-5.230
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.176
,+4.120
.+3.955
 the+3.944
 a+3.704
Neg logit contributions
lor-4.374
umm-4.346
 former-4.276
arians-4.251
ocol-4.206
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.728
,+3.668
 the+3.549
.+3.524
 a+3.307
Neg logit contributions
lor-4.801
umm-4.700
 former-4.618
arians-4.607
ocol-4.542
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.364
,+2.288
 the+2.138
.+2.129
 a+1.891
Neg logit contributions
lor-6.166
 former-6.047
umm-6.023
arians-5.990
ocol-5.930
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.878
,+3.816
 the+3.669
.+3.647
 a+3.419
Neg logit contributions
lor-4.622
umm-4.601
 former-4.539
arians-4.513
ocol-4.487
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.108
,+3.968
.+3.831
 the+3.818
 a+3.581
Neg logit contributions
lor-4.520
umm-4.359
arians-4.291
 former-4.286
ocol-4.271
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.804
,+1.734
 the+1.609
.+1.555
 a+1.395
Neg logit contributions
lor-6.686
umm-6.580
arians-6.541
 former-6.531
osity-6.468
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.863
,+0.785
 the+0.689
.+0.614
 a+0.470
Neg logit contributions
lor-7.634
umm-7.537
arians-7.447
 former-7.422
osity-7.402
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.916
,+2.859
.+2.687
 the+2.670
 a+2.437
Neg logit contributions
lor-5.695
umm-5.585
 former-5.539
arians-5.511
ocol-5.442
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.955
,+2.931
.+2.798
 the+2.797
 a+2.563
Neg logit contributions
lor-5.540
umm-5.408
arians-5.399
 former-5.380
osity-5.267
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.620
,+3.574
.+3.406
 the+3.369
 a+3.101
Neg logit contributions
lor-4.900
 former-4.824
umm-4.802
arians-4.761
ocol-4.709