Skip to main content

Table 3 Datasets used for benchmarking

From: FastqPuri: high-performance preprocessing of RNA-seq data

Dataset

Data origin

Species

Number of reads

Read length (bp)

Dataset 1

this study

Homo sapiens

51 559 773

100

Dataset 2

this study

Arabidopsis thaliana

18 858 554

2 x 100

Dataset 3

RNA-QC-Chain [20]

Nannochloropsis oceanica

7 045 705

2 x 100

Dataset 4

SRR1216135 (SRA run)

Homo sapiens

10 908 030

2 x 100

Dataset 5

simulated, this study

Homo sapiens + Mus musculus

6 034 700

100