Skip to main content

Table 6 Performance of machine learning models on E. coli sequencing dataset (threshold: total aligned bases = 80 bp)

From: MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads

Metrics

Accuracy

Precision

Recall

F1-score

k-mer size

7

9

11

13

15

7

9

11

13

15

7

9

11

13

15

7

9

11

13

15

SVM

0.81

0.67

0.88

0.99

0.99

0.81

0.67

0.89

1

1

0.99

0.99

0.99

0.99

0.99

0.89

0.80

0.93

0.99

0.99

RF

0.88

0.83

0.49

0.26

0.18

0.9

0.84

0.48

0.25

0.17

0.99

0.99

0.99

0.99

0.99

0.94

0.91

0.65

0.4

0.29

LR

0.74

0.66

0.77

 × 

 × 

0.75

0.66

0.78

 × 

 × 

0.99

0.99

0.99

 × 

 × 

0.85

0.79

0.87

 × 

 × 

NB

0.88

0.9

0.98

0.99

0.99

0.89

0.91

1

1

1

0.99

0.99

0.99

0.99

0.99

0.93

0.94

0.99

0.99

0.99

XGBoost

0.36

0.21

0.15

 × 

 × 

0.36

0.20

0.14

 × 

 × 

0.99

0.99

0.99

 × 

 × 

0.52

0.34

0.25

 × 

 ×