Skip to main content
Fig. 5 | BMC Bioinformatics

Fig. 5

From: A statistical approach to identify, monitor, and manage incomplete curated data sets

Fig. 5

Quantification of model results. A confusion matrix of results from manual evaluation of model predictions for genes having negative (a) or positive (c) residuals. Columns show model predictions, rows show actual data status after manual validation. The actual expression experiment count was plotted against the predicted number of missing expression experiments per gene for each of the manually validated genes around the lower (b) and upper (d) 95% CIs. The horizontal line indicates the 95% confidence interval set at two times the root mean square error of the model. Genes above that line are predicted to be missing gene expression annotation, while genes below the line are predicted to not be missing gene expression annotations. Red dots are genes that were confirmed to be missing gene expression annotations. Green dots are genes that were confirmed to not be missing gene expression annotations

Back to article page