Fig. 4From: A statistical approach to identify, monitor, and manage incomplete curated data setsActual vs. predicted expression experiment count per gene. a) Model predicted expression experiment count for the test data set was plotted against the actual expression experiment count per gene. A strong linear correlation was observed (R2 = 0.95), indicating that the model was accurate at predicting the number of expression experiments per gene. b) A histogram of expression experiment count residuals (actual number – predicted number) showed a single mode centered close to 0. Green and red bars are counts of genes inside or outside the 95% confidence interval respectivelyBack to article page