Skip to main content

Table 1 Description of the training and test series considered with number of samples/outliers

From: Integration of RNA-Seq data with heterogeneous microarray data for breast cancer profiling

TRAINING SERIES

Series

Platform

Technology

Quality samples

Excluded outliers

Samples origin

GSE52712

Affymetrix

Microarray

19

1

Manchester (UK)

GSE40987

Affymetrix

Microarray

10

0

Boston (USA)

GSE52262

Affymetrix

Microarray

16

0

Houston (USA)

GSE12790

Affymetrix

Microarray

20

1

San Francisco (USA)

GSE46834

Illumina

Microarray

8

0

New York (USA)

GSE68651

Illumina

Microarray

35

1

Southampton (UK)

GSE74251

Illumina

RNA-Seq

12

0

Philadelphia (USA)

GSE74377

Illumina

RNA-Seq

12

0

Iowa (USA)

TOTAL

Integrated

 

132

3

 

TEST SERIES

Series

Platform

Technology

Quality samples

Excluded outliers

Samples origin

GSE78011

Illumina

RNA-Seq

3

0

Louisville (USA)

GSE81593

Illumina

RNA-Seq

3

0

New York (USA)

GSE75292

Illumina

Microarray

6

1

Goyang (South Korea)

GSE29327

Affymetrix

Microarray

6

0

South San Francisco (USA)

GSE30931

Illumina

Microarray

12

0

Goettingen (Germany)

GSE48398

Illumina

Microarray

36

0

Texas (USA)

GSE35928

Affymetrix

Microarray

6

0

Piscataway (USA)

GSE57339

Illumina

Microarray

12

0

New Haven (USA)

GSE45715

Illumina

Microarray

42

0

Miami (USA)

TOTAL

Integrated

 

126

1