BMC Bioinformatics

Table 7 Three selected examples of rules generated by HI^all and HI^seq. Where # rules is the total number of rules found, # pc is the number of positive examples covered in the training data, # pnc is the number of positive examples not covered in the training data, % CovP is the percentage coverage of the positive examples in the training data, % CovN is the percentage coverage of negative examples in the training data, # uc is the number of uncertain examples covered, and # unc the number of uncertain examples not covered.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

HI^all
HI^all
1CPC	2	120	0	100.00	1.00	1	2
1MPP	1	91	1	98.91	0.00	2	13
1MLA	1	17	5	77.27	0.00	4	13
HI^seq
PDB	# rules	# pc	# pnc	% CovP	% CovN	# uc	# unc
1CPC	2	89	31	74.17	1.20	1	2
1MPP	3	62	30	67.39	1.20	3	12
1MLA	1	16	6	72.73	0.20	5	12

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com