Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: Automating document classification for the Immune Epitope Database

Figure 2

Comparison of Naive Bayes Classifier performance in cross validation. AUCs of Naïve Bayes classifier incorporating various dimensionality reduction techniques were compared in each round of the 10-fold cross-validation side by side. Abstract: AUC of classifier trained on the raw words of abstracts. PubMed: AUC of classifier trained on raw words in abstract, MeSH heading, title, author etc. PubMed+FS: AUC of classifier trained on subset of raw words selected from abstract, MESH heading, title, author etc using combined cutoff of IG >2.00e-05 and DF >3. PubMed+FS+FE: AUC of classifier trained on a subset of feature generated from raw words in abstract, MeSH heading, title, author etc by first applying feature extraction followed by feature selection. Using combined cutoff of IG >2.00e-05 and DF >3.

Back to article page