Figure 3From: Clustering protein environments for function prediction: finding PROSITE motifs in 3DDistribution of cluster sizes. The number of residues in each cluster ranges from as few as 2 to as many as 6,731. The mean and median sizes are 437.2 and 232, respectively, and the standard deviation is 589.8. As discussed in the text, the long tail may represent internal hydrophobic environments.Back to article page