Skip to main content

Table 1 SeqTrim accuracy evaluated with artificial sequences

From: SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read

 

Vu+I

Vu+I+Vd

 

Expected

Obs (enz)

Obs (-enz)

Expected

Obs (enz)

Obs (-enz)

Sequence size

362

  

412

  

Insert length

312

311.71

311.71

312

310.7

311.51

Insert-start point

51

51.18

51.2

51

51.2

51.2

Insert-end point

362

361.9

36.91

362

360.9

361.71

Rejected

 

3

0

 

1

0

Mistakenly processed

 

9

0

 

5

0

 

Vu+I+pA

Vu+I+pA+Vd

 

Expected

Obs (enz)

Obs (-enz)

Expected

Obs (enz)

Obs (-enz)

Sequence size

392

  

442

  

Insert length

312

311.32

311.33

312

310.72

311.33

Insert-start point

51

51.2

51.2

51

51.2

51.2

Insert-end point

362

361.52

361.53

362

360.92

361.53

Rejected

 

1

0

 

1

0

Mistakenly processed

 

3

0

 

5

0

 

Vu+Au+I

Vu+Au+I+Ad+Vd

 

Expected

Obs (enz)

Obs (-enz)

Expected

Obs (enz)

Obs (-enz)

Sequence size

375

  

438

  

Insert length

312

311.9

311.91

312

311.9

311.91

Insert-start point

64

64

64

64

64

64

Insert-end point

375

374.9

374.91

375

374.9

374.91

Rejected

 

2

0

 

0

0

Mistakenly processed

 

6

0

 

0

0

 

Vu+Au+I+pA

Vu+Au+I+pA+Ad+Vd

 

Expected

Obs (enz)

Obs (-enz)

Expected

Obs (enz)

Obs (-enz)

Sequence size

405

  

468

  

Insert length

312

311.53

311.53

312

311.53

311.53

Insert-start point

64

64

64

64

64

64

Insert-end point

375

374.53

374.53

375

374.53

374.53

Rejected

 

0

0

 

0

0

Mistakenly processed

 

0

0

 

0

0

  1. Vu, 50 nucleotides preceding the Bam HI restriction site of pBlueScript-FL. Vd, 50 nucleotides following the Hin dIII restriction site of pBlueScript-FL. I, a fragment of 312 nucleotides from Pinus pinaster genomic DNA. pA, poly-A tail of 30 A's. Au, upstream 5'-adaptor, containing the sequence GATCCGTTGCTGTCGTCG. Ad, downstream 3'-adaptor, containing the sequence CGGCCGCGTCGACAAGCT. 'Expected' corresponds to theoretical mean values for each set of artificial sequences. 'Obs (enz)' are the mean values obtained using SeqTrim with the cloning restriction sites specified. 'Obs (-enz)' are the mean values obtained using SeqTrim with no cloning restriction site specified.