Skip to main content

Table 4 Additional P. falciparum dPUC predictions lead to refined protein annotations

From: Using context to improve protein domain identification

Protein ID

Standard Pfam domains

Additional dPUC domains

Suggested reannotation (this study)

PFE1240w

Radical_SAM, Wyosine_form

Flavodoxin_1

Wybutosine synthesis protein (TYW1 ortholog), putative

PFF1490w

THF_DHG_CYH_C

THF_DHG_CYH

Tetrahydrofolate dehydrogenase/cyclohydrolase (MTD1 ortholog, MIS1/ADE3 homolog without FTHFS domain), putative

MAL8P1.139

DDA1*

WD40

Regulator of (H+)-ATPase in Vacuolar membrane (RAV1 ortholog), putative

PF08_0124

CactinC_cactus

Cactin_mid

CACTIN homolog, putative

PF10_0152

 

NTP_transf_2, PAP_assoc

Non-canonical cytoplasmic specific poly(A) RNA polymerase protein (CID13 ortholog), putative

MAL13P1.170

NTP_transf_2

PAP_assoc

Non-canonical poly(A) RNA polymerase protein (PAP2/TRF5 ortholog), putative

PFI1560c

DUF21

CBS, cNMP_binding

Required for mitochondrial morphology (MAM3 ortholog), putative

PF10_0126

 

WD40

Phosphoinositide binding protein (HSV2/ATG18 ortholog), putative

PFI0510c

BRCT

IMS

DNA repair protein (REV1 ortholog), putative

MAL13P1.54

WD40

LisH

Alternative splicing regulator (SMU-1 ortholog), putative

PF14_0052

cobW

CobW_C

COBW domain-containing protein 1 (CBWD1 ortholog), putative

PF08_0012

SET, Pre-SET

YDG_SRA

Histone lysine N-methyltransferase, putative

PFE1445c

 

FG-GAP

T-cell immunomodulatory protein (human TIP homolog), putative

PFL0975w

IQ

RCC1

Unconventional myosin fused to IQ and RCC1 domains, putative

PF11_0276

Abhydro_lipase

Abhydrolase_1

Steryl ester hydrolase (TGL1/YEH1/YEH2 ortholog), putative

PF13_0190

Aha1_N

TPR_2, TPR_1

Chaperone binding protein, putative

PF11_0287

CRAL_TRIO

CRAL_TRIO_N

CRAL/TRIO protein, putative

PF11_0197

Ank

ACBP

Acyl-CoA-binding protein, putative

PF14_0647

TLD

TBC

Rab GTPase activator, putative

PFL0575w

Amino_oxidase, Thi4*

PHD

PHD finger and flavin containing amine oxidoreductase, putative

MAL13P1.246

E1-E2_ATPase

Cation_ATPase_C

E1-E2 ATPase, putative

PF11_0116

 

Nol1_Nop2_Fmu

Nol1/Nop2/Fmu-like protein, putative

MAL7P1.127

 

Pkinase

Rab GTPase activator and protein kinase, putative

PFC0425w

 

zf-C3HC4, PHD

PHD finger protein, putative

PFI0975c

 

RCC1

Regulator of chromosome condensation, putative

PFD0900w

 

RCC1

Regulator of chromosome condensation, putative

MAL7P1.132

 

Pkinase

Protein kinase, putative

PFF0810c

 

Ras

Ras GTPase, putative

PFL1990c

 

zf-CCHC, RRM_1

RNA binding protein, putative

PF07_0066

 

RRM_1

RNA binding protein, putative

PF13_0147

 

RRM_1

RNA binding protein, putative

PFF1120c

 

EGF

EGF-like membrane protein, putative

PF14_0262

WD40

TPR_1

WD40 and TPR repeats protein, putative

PFI0275w

 

WD40

WD40 repeat and EF hand protein, putative

PF10_0285

 

WD40

WD40 repeat protein, putative

PF11_0195

 

WD40

WD40 repeat protein, putative

PF14_0640

 

WD40

WD40 repeat protein, putative

MAL13P1.308

 

Arm

ARM repeat protein, putative

  1. These dPUC predictions were novel compared to the Standard Pfam, and were consistent with existing domain predictions from SMART or Superfamily (and often present in orthologs too). The number of repeats per family is not shown.
  2. dPUC predictions always contained the Standard Pfam domains, so only the additional domains are listed, except when marked with an asterisk (*; MAL8P1.139 has DDA1 in Standard Pfam but not in dPUC Pfam; PFL0575w has a Thi4 in Standard Pfam but it is replaced by another Amino_oxidase domain [belonging to the same Pfam clan] in dPUC Pfam).
  3. All proteins have the current PlasmoDB 6.0 annotation of "conserved Plasmodium protein, unknown function" except: MAL8P1.139, PFI1560c, MAL13P1.246, MAL7P1.127 "conserved Plasmodium membrane protein, unknown function"; PFE1240w, PF11_0287 "conserved protein, unknown function"; MAL13P1.170 "nucleotidyltransferase, putative"; PF08_0012 "SET domain protein, putative"; PFF1120c "conserved Apicomplexan protein, unknown function"; PF14_0262 "probable protein, unknown function".