Heinz Stockinger, Alexander F Auch, Markus Göker, Jan Meier-Kolthoff, and Alexandros Stamatakis (2009)
Large-Scale Co-Phylogenetic Analysis on the Grid
International Journal of Grid and High Performance Computing, 1(1):39-54.
Phylogenetic
data analysis represents an extremely compute-intensive area of
Bioinformatics and thus requires high-performance technologies. Another
compute- and memory-intensive problem is that of hostparasite
co-phylogenetic analysis: given two phylogenetic trees, one for the
hosts (e.g., mammals) and one for their respective parasites (e.g.,
lice) the question arises whether host and parasite trees are more
similar to each other than expected by chance alone. CopyCat is an
easy-to-use tool that allows biologists to conduct such co-phylogenetic
studies within an elaborate statistical framework based on the highly
optimized sequential and parallel A xParafit program. We have developed
enhanced versions of these tools that efficiently exploit a Grid
environment and therefore facilitate large-scale data analyses.
Furthermore, we developed a freely accessible client tool that provides
co-phylogenetic analysis capabilities. Since the computational bulk of
the problem is embarrassingly parallel, it fits well to a computational
Grid and reduces the response time of large scale analyses.