• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Genetic epidemiology algorithms for tracking drug resistance variants and genomic clustering of plasmodium species

    Thumbnail
    View/Open
    RAVISHANKAR-DISSERTATION-2019.pdf (2.665Mb)
    Date
    2019-09-03
    Author
    Ravishankar, Shashidhar
    Metadata
    Show full item record
    Abstract
    The goal of this thesis is to develop algorithms for the analysis of P. falciparum, P. brasilianum, and P. malariae. Malaria is endemic in many parts of the world, including regions of central Africa, South America, and South East Asia. There are five known species that cause malaria in humans: P. falciparum, P. vivax, P. malariae, P. ovale, and P. knowlesi. P. knowlesi is a zoonotic parasite restricted to mostly South East Asia.According to a World Health Organization (WHO) report from 2018, these five species were responsible for nearly 219 million infections, resulting in an estimated 435,000 deaths related to malaria in 2017. In this work, we highlight algorithms that can identify the similarity between Plasmodium species and detect drug-resistant P. falciparum parasites. The two specific aims in this work, describe two novel algorithms for genomic clustering and molecular surveillance from Next-generation Sequencing (NGS) data. First, we describe a consensus-based variant identification framework molecular surveillance of drug resistance in infectious disease. We highlight its utility by identifying mutations associated with drug resistance in malaria isolates. The scalability of the framework is highlighted by analyzing 8351 M. tuberculosis isolates for the genotypic prediction of drug resistance. In the second aim, we describe a k-mer based alignment-free algorithm for the estimation of similarity between isolates from raw NGS data. Using a weighted Jaccard distance, we describe an exact method for estimation of the distance between isolates from k-mer count data. The memory efficiency, scalability, and accuracy of the algorithm was demonstrated using in-silico datasets generated from genomes of 12 Plasmodium species, as well as real-world isolates from an outbreak of C. auris in Colombia. The improved accuracy and scalability offered by the methods described in this work can facilitate the use of NGS in public health.
    URI
    http://hdl.handle.net/1853/62281
    Collections
    • Georgia Tech Theses and Dissertations [23403]
    • School of Biology Theses and Dissertations [437]

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology