Skip to main content

Table 1 Statistics of data sets

From: A simplified approach to disulfide connectivity prediction from protein sequences

Data set

# chains

All

None

Mix

PDBselect

1,589

488

1,051

50

SPX-

2,547

1,650

757

140

  1. Statistics for the PDBselect and the SPX- data sets. The three types of chains are defined as follows. All: all cysteines are intra-chain bonded half-cystines. None: all cysteines are either free, metal bound, or inter-chain bonded. Mix: Both cases are present.