Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic. Scientific Reports, 6: 39489. doi: 10.1038/srep39489 (2016).

Publication Latest Publications

Title: Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic
Authors: Yebra G, Hodcroft EB1, Ragonnet-Cronin ML1, Pillay D2, Brown AJ1; PANGEA_HIV Consortium de Oliveira T; ICONIC Project..
Journal: Scientific Reports,6:39489. doi: 10.1038/srep39489 (2016)

Journal Impact Factor (I.F.): 5.578
Number of citations (Google Scholar): 3

Abstract

HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR?+??), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.

Download: Full text paper

Citation: Yebra G, Hodcroft EB1, Ragonnet-Cronin ML1, Pillay D2, Brown AJ1; PANGEA_HIV Consortium de Oliveira T; ICONIC Project.. Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic Scientific Reports,6:39489. doi: 10.1038/srep39489 (2016).


Sexual partnership age pairings and risk of HIV acquisition in rural South Africa
Journal: AIDS (2017)

Incidence rate estimation, periodic testing and the limitations of the mid-point imputation approach
Journal: International Journal of Epidemiology (2017)

Mutational Correlates of Virological Failure in Individuals Receiving a WHO-Recommended Tenofovir-Containing First-Line Regimen: An International Collaboration
Journal: EBioMedicine (2017)
All publications...


KwaZulu-Natal Research Innovation and Sequencing Platform (KRISP), K-RITH Tower Building, Nelson R Mandela School of Medicine, UKZN

Contact: Prof. Tulio de Oliveira, Tel: +27 31 260 4898, Email: tuliodna@gmail.com & deoliveira@ukzn.ac.za

Page design updated 2013. Many of the pages were previously hosted at bioafrica.net.