Fig. 2From: A statistical approach to identify, monitor, and manage incomplete curated data setsDescriptive statistics for the data used in model training and testing. Descriptive statistics are shown for the three data files used as input for training and testing the model: GenePublication.txt (top), ConstructComponents.txt (middle), and MachineLearningReport.txt (bottom)Back to article page