• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Towards Automatic Analysis of Audio Recordings from Children with Autism Spectrum Disorder

    Thumbnail
    View/Open
    CAULLEY-DISSERTATION-2022.pdf (5.273Mb)
    dissertation_presentation_final.pptx (16.63Mb)
    dissertation_presentation_final.pdf (7.572Mb)
    Desmond_Caulley_defense.mp4 (323.3Mb)
    Date
    2022-05-03
    Author
    Caulley, Desmond
    Metadata
    Show full item record
    Abstract
    Autism spectrum disorder (ASD) is a neurodevelopmental disorder that can negatively impact learning, behavior, and social communication and interaction. In the United States, 1 in 59 children aged eight were diagnosed with ASD, according to the CDC’s 2014 report. Unfortunately, manual analysis of recordings of children with ASD is expensive, time-consuming, and does not scale well. This dissertation addresses general approaches for automatic analysis of audio recordings of children with ASD. First, we demonstrate that environmental feature representation in the i-vector space can be used to improve the diarization of the audio recordings. Next, we address the issue of diarizing audio recordings of infants and toddlers. We design a fine-tuning mechanism that is applied to a time-delay neural network (TDNN) to improve the classification accuracy of recordings from infants and toddlers. One metric of interest for clinicians is the child’s response rate to questions from parents. We build an interrogative utterance detector that features a stack of convolutional neural network (CNN) layers with a self-attention mechanism. We can identify question segments from parents with this proposed architecture and subsequently analyze response rates to those questions from the child. Other vocalization metrics evaluated here are conversational turns, child utterance frequency and duration, and adult question rates.
    URI
    http://hdl.handle.net/1853/66397
    Collections
    • Georgia Tech Theses and Dissertations [23877]
    • School of Electrical and Computer Engineering Theses and Dissertations [3381]

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology