HyDe: Hybridization Detection Using Phylogenetic Invariants¶
HyDe is a software package that detects hybridization in phylogenomic
data sets using phylogenetic invariants. The primary interface for HyDe is a Python
module called phyde (Pythonic Hybridization Detection).
phyde provides a suite of tools for performing hypothesis tests on triples of taxa
to detect hybridization. To ensure that the necessary
dependencies for phyde are available, we suggest using a Python distribution such
as Miniconda.
To facilitate analyses using the Python module, three scripts are provided to conduct hybridization detection analyses directly from the command line:
run_hyde.py: runs a standard hybridization detection analysis on all triples in all directions. Results are also filtered based on whether there is significant evidence for hybridization.individual_hyde.py: tests each individual within a putative hybrid population using a list of specified triples specified.bootstrap_hyde.py: conducts bootstrap resampling of the individuals within the putative hybrid lineages for each specified triple.
These last two scripts need to be given a three column table of triples (P1, Hybrid, P2) that you wish to test:
sp1 sp2 sp3
sp1 sp3 sp4
sp3 sp4 sp5
.
.
.
You can also use a results file from a previous analysis as a triples file.
For example, you can use the filtered results from the run_hyde.py script so that
you only run analyses on triples that have significant levels of hybridization.
If you only have a few hypotheses that you want to test, then you can also pass
a triples file to run_hyde.py and it will only test those hypotheses rather than
testing everything.
Multithreaded versions of these scripts are also available (run_hyde_mp.py,
individual_hyde_mp.py, and bootstrap_hyde_mp.py).
Make sure you have the multiprocess module installed before you use them:
pip install multiprocess.
Getting Help¶
If you have questions about running HyDe, please feel free to use the gitter chatroom to get help:
If you have a problem while running HyDe and you think it may be a bug, please consider filing an issue on GitHub:
Features¶
Conduct hypothesis tests using multiple individuals per population.
Test each individual within a putative hybrid lineage to assess if hybridization is uniform.
Test all possible triples of taxa and process results from within Python.
Bootstrap individuals within taxa to assess patterns of hybrid speciation vs. introgression.
Visualize the distributions of various quantities (Test Statistic, Hybridization Parameter, D-Statistic) using bootstrap replicates.
Calculate the D-Statistic (ABBA-BABA) using site pattern counts returned during a hypothesis test.