TOP ACTIVATIONS
MAX = -1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

MIN ACTIVATIONS
MIN = -1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

INTERVAL -1.000 - -1.000
CONTAINS 0.000%

TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token covers
Token ID8698
Feature activation-1.00e+00

Pos logit contributions
,+4.516
 and+4.512
.+4.342
 the+4.283
 a+4.161
Neg logit contributions
umm-4.196
lor-4.096
ocol-4.084
rol-3.956
 former-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.768
,+0.762
.+0.552
 the+0.530
 a+0.423
Neg logit contributions
umm-7.904
lor-7.853
ocol-7.765
rol-7.732
arians-7.664
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token covers
Token ID8698
Feature activation-1.00e+00

Pos logit contributions
,+4.516
 and+4.512
.+4.342
 the+4.283
 a+4.161
Neg logit contributions
umm-4.196
lor-4.096
ocol-4.084
rol-3.956
 former-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token talking
Token ID3375
Feature activation-1.00e+00

Pos logit contributions
 and+3.778
,+3.724
.+3.582
 the+3.540
 a+3.380
Neg logit contributions
umm-4.826
lor-4.819
ocol-4.752
 former-4.720
rol-4.708
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.274
,+3.271
.+3.151
 the+3.054
 a+2.945
Neg logit contributions
umm-5.337
lor-5.270
arians-5.228
ocol-5.204
rol-5.180
Token talking
Token ID3375
Feature activation-1.00e+00

Pos logit contributions
 and+3.778
,+3.724
.+3.582
 the+3.540
 a+3.380
Neg logit contributions
umm-4.826
lor-4.819
ocol-4.752
 former-4.720
rol-4.708
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token covers
Token ID8698
Feature activation-1.00e+00

Pos logit contributions
,+4.516
 and+4.512
.+4.342
 the+4.283
 a+4.161
Neg logit contributions
umm-4.196
lor-4.096
ocol-4.084
rol-3.956
 former-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.768
,+0.762
.+0.552
 the+0.530
 a+0.423
Neg logit contributions
umm-7.904
lor-7.853
ocol-7.765
rol-7.732
arians-7.664
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token covers
Token ID8698
Feature activation-1.00e+00

Pos logit contributions
,+4.516
 and+4.512
.+4.342
 the+4.283
 a+4.161
Neg logit contributions
umm-4.196
lor-4.096
ocol-4.084
rol-3.956
 former-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
,+3.041
 and+3.036
.+2.809
 the+2.728
 a+2.632
Neg logit contributions
umm-5.597
ocol-5.581
lor-5.497
rol-5.437
arians-5.417
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.538
,+1.520
.+1.350
 the+1.174
 a+1.045
Neg logit contributions
umm-7.161
ocol-7.108
lor-7.077
 former-6.975
arians-6.934
Token talking
Token ID3375
Feature activation-1.00e+00

Pos logit contributions
 and+3.778
,+3.724
.+3.582
 the+3.540
 a+3.380
Neg logit contributions
umm-4.826
lor-4.819
ocol-4.752
 former-4.720
rol-4.708
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.274
,+3.271
.+3.151
 the+3.054
 a+2.945
Neg logit contributions
umm-5.337
lor-5.270
arians-5.228
ocol-5.204
rol-5.180
Token talking
Token ID3375
Feature activation-1.00e+00

Pos logit contributions
 and+3.778
,+3.724
.+3.582
 the+3.540
 a+3.380
Neg logit contributions
umm-4.826
lor-4.819
ocol-4.752
 former-4.720
rol-4.708
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
,+2.824
 and+2.818
.+2.652
 the+2.624
 a+2.463
Neg logit contributions
umm-5.797
lor-5.763
ocol-5.653
 former-5.646
rol-5.639
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
,+3.786
 and+3.772
.+3.674
 the+3.542
 a+3.418
Neg logit contributions
umm-4.873
ocol-4.757
lor-4.755
 former-4.715
arians-4.695
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.361
,+1.345
.+1.158
 the+1.066
 a+0.944
Neg logit contributions
lor-7.317
umm-7.261
ocol-7.188
rol-7.159
 former-7.107
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+3.749
,+3.726
.+3.575
 the+3.482
 a+3.344
Neg logit contributions
umm-5.023
lor-5.005
ocol-4.899
arians-4.851
rol-4.849
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
,+2.995
 and+2.975
.+2.886
 the+2.802
 a+2.662
Neg logit contributions
lor-5.590
umm-5.590
arians-5.486
rol-5.456
ocol-5.448
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.878
,+1.871
.+1.718
 the+1.632
 a+1.501
Neg logit contributions
umm-6.763
lor-6.739
ocol-6.658
rol-6.655
arians-6.615
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.410
,+2.396
.+2.230
 the+2.033
 a+1.941
Neg logit contributions
umm-6.231
ocol-6.202
lor-6.165
 former-6.112
arians-6.083
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+3.812
,+3.802
.+3.656
 the+3.576
 a+3.427
Neg logit contributions
umm-4.906
lor-4.777
 former-4.770
arians-4.728
ocol-4.715
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+1.184
 and+1.183
 the+1.035
.+1.026
 a+0.882
Neg logit contributions
umm-7.412
lor-7.400
 former-7.258
arians-7.228
rol-7.205
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+3.637
,+3.636
.+3.497
 the+3.338
 a+3.204
Neg logit contributions
lor-4.959
umm-4.925
 former-4.863
ocol-4.854
arians-4.823
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.122
,+3.105
.+2.928
 the+2.834
 a+2.691
Neg logit contributions
umm-5.637
lor-5.546
ocol-5.464
rol-5.417
 former-5.417
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.471
,+0.461
 the+0.298
.+0.297
 a+0.196
Neg logit contributions
umm-8.137
ocol-8.007
lor-7.950
rol-7.899
arians-7.868
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.159
,+2.143
.+2.003
 the+1.989
 a+1.835
Neg logit contributions
umm-6.482
lor-6.399
ocol-6.325
arians-6.261
 former-6.256
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.162
.+3.021
 the+2.893
 a+2.782
Neg logit contributions
umm-5.452
lor-5.395
ocol-5.387
 former-5.297
arians-5.282
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.314
,+4.307
.+4.148
 the+4.051
 a+3.901
Neg logit contributions
umm-4.433
ocol-4.322
lor-4.282
 former-4.225
arians-4.215
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+3.842
,+3.829
.+3.696
 the+3.638
 a+3.490
Neg logit contributions
umm-4.808
lor-4.750
ocol-4.670
rol-4.623
arians-4.594
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.292
,+2.261
.+2.112
 the+2.038
 a+1.884
Neg logit contributions
umm-6.299
lor-6.292
ocol-6.236
 former-6.195
arians-6.153
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+3.942
,+3.925
.+3.770
 the+3.703
 a+3.549
Neg logit contributions
umm-4.772
ocol-4.675
lor-4.636
 former-4.568
arians-4.563
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.169
,+4.077
.+3.952
 the+3.853
 a+3.711
Neg logit contributions
umm-4.531
lor-4.527
ocol-4.460
arians-4.342
rol-4.336
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+1.866
,+1.842
.+1.675
 the+1.644
 a+1.525
Neg logit contributions
umm-6.743
lor-6.689
ocol-6.635
arians-6.584
rol-6.567
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.867
,+0.832
.+0.671
 the+0.670
 a+0.547
Neg logit contributions
umm-7.744
lor-7.683
ocol-7.626
rol-7.553
arians-7.534
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.988
,+2.978
.+2.812
 the+2.709
 a+2.569
Neg logit contributions
umm-5.733
lor-5.694
ocol-5.615
 former-5.554
rol-5.549
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
,+3.043
 and+3.020
.+2.924
 the+2.838
 a+2.698
Neg logit contributions
umm-5.559
lor-5.535
arians-5.435
ocol-5.418
rol-5.404
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
,+3.713
 and+3.708
.+3.552
 the+3.422
 a+3.240
Neg logit contributions
umm-4.908
lor-4.853
ocol-4.853
 former-4.815
rol-4.811