Personal tools
You are here: Home Publikationen AxPcoords & parallel AxParafit: statistical co-phylogenetic analyses on thousands of taxa.
Navigation
 
Document Actions

Alexandros Stamatakis, Alexander F Auch, Jan Meier-Kolthoff, and Markus Göker (2007)

AxPcoords & parallel AxParafit: statistical co-phylogenetic analyses on thousands of taxa.

BMC Bioinformatics, 8:405.

BACKGROUND: Current tools for Co-phylogenetic analyses are not able tocope with the continuous accumulation of phylogenetic data. The sophisticated statistical test for host-parasite co-phylogenetic analyses implemented in Parafit does not allow it to handle large datasets in reasonable times. The Parafit and DistPCoA programs are the by far most compute-intensive components of the Parafit analysis pipeline. We present AxParafit and AxPcoords (Ax stands forAccelerated) which are highly optimized versions of Parafit and DistPCoA respectively.

RESULTS: Both programs have been entirely re-written in C. Via optimization of the algorithm and the C code as well as integration of highly tuned BLAS and LAPACK methods AxParafit runs 5-61 times faster than Parafit witha lower memory footprint (up to 35% reduction) while the performance benefit increases with growing dataset size. The MPI-based parallel implementation of AxParafit shows good scalability on up to 128 processors, even on medium-sized datasets. The parallel analysis with AxParafit on 128 CPUs for a medium-sized dataset with an 512 by 512 association matrix is more than 1,200/128 times faster per processor than the sequential Parafit run. AxPcoords is 8-26 times faster than DistPCoA and numerically stable on large datasets. We outline the substantial benefits of using parallel AxParafit by example of a large-scale empirical study on smut fungi and their host plants. To the best of our knowledge, this study represents the largest co-phylogenetic analysis to date.

CONCLUSIONS: The highly efficient AxPcoords and AxParafit programs allow for large-scale co-phylogenetic analyses on several thousands of taxa for the first time. In addition, AxParafit and AxPcoords have been integrated into the easy-to-use CopyCat tool.

 

Powered by Plone CMS, the Open Source Content Management System