Skip to main content

Table 1 Performance of different models for the prediction of quaternary states (qs). Balanced Accuracy and F1 scores

From: Protein language models can capture protein quaternary state

 

Annotation transfer, based on

pLM MLP model, based on

Sequence

pLM

ESM-2 embeddings (QUEEN)

Protbert embeddings

No information about sequence homologs available 1

BA2

0.15

0.23

0.36

0.19

F13

0.43

0.54

0.52

0.41

Full homology information available

BA

0.6

0.67

–

–

F1

0.79

0.85

  
  1. 1 no sequence with > 30% sequence available to transfer from; 2BA: balanced accuracy; 3 F1: F1 score; Precision and Recall values are provided in Additional file 3: Table S3, and Precision-Recall and ROC curves are provided in Additional file 3: Figure S3. See Methods for definitions