Skip to main content

Table 1 Number of sites of training and independent testing set.

From: Characterization and identification of protein O-GlcNAcylation sites with substrate specificity

Data resource

O-GlcNAcylated sites (Positive data)

Non-O-GlcNAcylated sites (Negative data)

Training set

dbOGAP

Serine

240

16740

  

Threonine

135

10079

  

Ser and Thr

375

26819

Independent testing set

UniProtKB

Serine

57

4488

  

Threonine

51

2978

  

Ser and Thr

108

7466

 

OGlycBase

Serine

24

1013

  

Threonine

24

694

  

Ser and Thr

48

1707

 

PhosphoSitePlus

Serine

779

58082

  

Threonine

582

34217

  

Ser and Thr

1361

92299

 

Non-redundant dataset

Serine

578

41075

  

Threonine

470

23920

  

Ser and Thr

1048

64995