Skip to main content

Table 3 Discrimination success rates and performance using various method combinations for the dataset containing all sequences shown in Table 1.

From: PTIGS-IdIt, a system for species identification by DNA sequences of the psbA-trnH intergenic spacer region

 

Include

Not include

Method

Correct

Wrong

Ratio

Time

Correct

Wrong

Ratio

Time

B

6291

4846

0.5649

0.4213

5323

5814

0.4780

0.5653

B+P

7744

3393

0.6953

5.0552

6496

4641

0.5833

6.4200

B+E

8650

2487

0.7767

36.7524

7034

4103

0.6316

52.3093

D

8477

2660

0.7612

0.2496

6669

4468

0.5988

0.5347

D+P

8477

2660

0.7612

2.3828

6670

4467

0.5989

2.4413

D+E

8687

2450

0.7800

21.5453

7363

3774

0.6611

15.6762

B+P+E

8651

2486

0.7768

12.9270

7096

4041

0.6372

11.6186

D+P+E

8686

2451

0.7799

9.8835

7401

3736

0.6645

9.7989

  1. Ratio indicates the number of correctly identified/total number of tests. The performance shows the average time in second taken to complete a query. The base methods are B: BLAST; P: P Distance; E: Edit Distance; D: DNFP. “Included” means that the query sequences are included in the reference database, while “excluded” means that the query sequences are not included in the database when performing the analyses.