Skip to main content
Fig. 2 | BMC Bioinformatics

Fig. 2

From: PlasForest: a homology-based random forest classifier for plasmid detection in genomic datasets

Fig. 2

Chosen features and their importances in the classification process. A Schematic representation of the features extracted from contigs, including homology-based features (number of hits, maximum overlap, average overlap, median overlap, variance of overlaps, contig size) and sequence-based feature (G + C content). B Impurity-based feature importance computed with scikit-learn library for the seven features kept in the classifier

Back to article page