• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    New insights on the power of active learning

    Thumbnail
    View/Open
    BERLIND-DISSERTATION-2015.pdf (1.211Mb)
    Date
    2015-07-22
    Author
    Berlind, Christopher
    Metadata
    Show full item record
    Abstract
    Traditional supervised machine learning algorithms are expected to have access to a large corpus of labeled examples, but the massive amount of data available in the modern world has made unlabeled data much easier to acquire than accompanying labels. Active learning is an extension of the classical paradigm intended to lessen the expense of the labeling process by allowing the learning algorithm to intelligently choose which examples should be labeled. In this dissertation, we demonstrate that the power to make adaptive label queries has benefits beyond reducing labeling effort over passive learning. We develop and explore several novel methods for active learning that exemplify these new capabilities. Some of these methods use active learning for a non-standard purpose, such as computational speedup, structure discovery, and domain adaptation. Others successfully apply active learning in situations where prior results have given evidence of its ineffectiveness. Specifically, we first give an active algorithm for learning disjunctions that is able to overcome a computational intractability present in the semi-supervised version of the same problem. This is the first known example of the computational advantages of active learning. Next, we investigate using active learning to determine structural properties (margins) of the data-generating distribution that can further improve learning rates. This is in contrast to most active learning algorithms which either assume or ignore structure rather than seeking to identify and exploit it. We then give an active nearest neighbors algorithm for domain adaptation, the task of learning a predictor for some target domain using mostly examples from a different source domain. This is the first formal analysis of the generalization and query behavior of an active domain adaptation algorithm. Finally, we show a situation where active learning can outperform passive learning on very noisy data, circumventing prior results that active learning cannot have a significant advantage over passive learning in high-noise regimes.
    URI
    http://hdl.handle.net/1853/53948
    Collections
    • College of Computing Theses and Dissertations [1191]
    • Georgia Tech Theses and Dissertations [23877]

    Related items

    Showing items related by title, author, creator and subject.

    • USING A HANDS-ON ROBOTICS PROJECT TO AFFECT SKILL DEVELOPMENT IN A CONTROL ANALYSIS COURSE 

      Inghilleri, Niccolo (Georgia Institute of Technology, 2021-05-05)
      This study aims to assess the impact on skill development of a hands-on experimentation and learning device within the undergraduate aerospace control analysis curriculum at Georgia Institute of Technology. The Transportable ...
    • On sparse representations and new meta-learning paradigms for representation learning 

      Mehta, Nishant A. (Georgia Institute of Technology, 2013-05-15)
      Given the "right" representation, learning is easy. This thesis studies representation learning and meta-learning, with a special focus on sparse representations. Meta-learning is fundamental to machine learning, and it ...
    • Group Intelligence Audio Track 

      Parsons, Christopher J.; Salaita, Meisa K.; Hughes, Catherine H.; Lynn, David G.; Fristoe, Adam; Fristoe, Ariel; Grover, Martha A. (2017-04)
      The Group Intelligence Audio Track as supplemental to the article: "Group Intelligence: An Active Learning Exploration of Diversity in Evolution" in the Journal of Chemical Education (DOI: 10.1021/acs.jchemed.6b00518)

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology