Skip to main content

Table 1 Distribution of one protein in the PPI to multiple nodes in the pathway.

From: New challenges for text mining: mapping between text and manually curated pathways

Distribution

Frequency

Ratio

1

12,718

0.0737

2

16,180

0.0937

3

31,769

0.1841

4

49,408

0.2863

5

3,403

0.0197

6

18,205

0.1055

7

3,454

0.0200

8

6,435

0.0373

9

4,797

0.0278

10

2,082

0.0121

11

2,125

0.0123

12

35

0.0002

13

262

0.0015

18

46

0.0003

>20

21,655

0.1255

  1. A single node in the PPI network tends to correspond to multiple nodes in the manually constructed pathway according to its state transitions. The distribution describes the number of nodes in the pathway that proteins in the pairs extracted from MEDLINE correspond to. The frequency, on the other hand, means how frequent each distribution is. That is, the frequency 16,180 of the distribution 2 means that protein names mapped to two nodes in the pathway occur 16,180 times in pairs extracted from MEDLINE. In the manually curated pathway a single protein in extracted pairs has 7.81 nodes on average which can be associated with it.