Skip to main content

Table 7 Three selected examples of rules generated by HIall and HIseq. Where # rules is the total number of rules found, # pc is the number of positive examples covered in the training data, # pnc is the number of positive examples not covered in the training data, % CovP is the percentage coverage of the positive examples in the training data, % CovN is the percentage coverage of negative examples in the training data, # uc is the number of uncertain examples covered, and # unc the number of uncertain examples not covered.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

HIall

HIall

       

1CPC

2

120

0

100.00

1.00

1

2

1MPP

1

91

1

98.91

0.00

2

13

1MLA

1

17

5

77.27

0.00

4

13

HIseq

PDB

# rules

# pc

# pnc

% CovP

% CovN

# uc

# unc

1CPC

2

89

31

74.17

1.20

1

2

1MPP

3

62

30

67.39

1.20

3

12

1MLA

1

16

6

72.73

0.20

5

12