Skip to main content

Table 7 Number of proteins remaining in various locations, after removing redundant proteins, at cut-off 40%, 60% and 90% using program CD-HIT

From: Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs

 

Sequences remaining after removal of similar sequences

CD-HIT cut-off (% identity)

Cytoplasmic (340*)

Integral-membrane (402)

Secretory (50)

Membrane-attached (60)

90

223

262

34

38

60

118

195

20

29

40

117

182

17

27

  1. * Number in bracket is total number of proteins in a location