Skip to main content

Table 4 Data on the simulated datasets

From: MaxAlign: maximizing usable data in an alignment

  Tree 1 Tree 2 Tree 3 Pfam
Average sequence identity 19% 30% 42% -
Alignment length 1080 629 597 404
Sequence length 173 177 169 171
Original number of sequences 32 33 46 -
Average number of sequences after MaxAlign 14.1 22.6 28.8 -
Average number of indels per sequence 66.6 54.3 48.5 32
Average length of indels 13.6 8.3 8.8 7
  1. Description of the simulated alignments used for testing the accuracy of phylogenetic inference with MaxAlign and removal of gapped columns, as well as the Pfam estimates used to tune the simulation parameters.