• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Visualization of textual content from social media and online communities

    Thumbnail
    View/Open
    HU-DISSERTATION-2018.pdf (4.385Mb)
    Date
    2018-01-22
    Author
    Hu, Mengdie
    Metadata
    Show full item record
    Abstract
    In this thesis, I explore design principles for interactive visualizations that facilitate analysis of large quantities of text documents from social media and online communities. I summarize characteristics of such text documents, including their huge volume, short and informal expressions, high density of repeated language patterns, high noise-to-information ratio, and the prevalence of conflicting opinions. All of these characteristics pose challenges for analyzing the data, in addition to the difficulties of processing natural language. I focus on two domains of text, consumer reviews and social media posts, and show that analytical tasks in both domains share three common steps: 1) gaining an overall impression of the dataset by learning the major topics, 2) finding interesting facets of the dataset that are worth exploration, 3) reading the original documents to gain insights. I introduce two visualization systems that address these tasks for the two domains I study. OpinionBlocks presents a novel visualization interface for reading consumer reviews and enables crowd-correction of text analysis errors. SentenTree is a new visualization technique uniquely suited for social media text analysis by providing the key benefits of both word-based (a variation of word cloud) and sentence-based (as represented by Word Tree) visual metaphors while overcoming some of the limitations of each.
    URI
    http://hdl.handle.net/1853/59828
    Collections
    • College of Computing Theses and Dissertations [1191]
    • Georgia Tech Theses and Dissertations [23877]
    • School of Interactive Computing Theses and Dissertations [144]

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology