• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    •   SMARTech Home
    • Georgia Tech Theses and Dissertations
    • Georgia Tech Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Image retrieval and geolocalization with deep learning

    Thumbnail
    View/Open
    VO-DISSERTATION-2019.pdf (34.02Mb)
    Date
    2019-01-09
    Author
    Vo, Nam Ngoc
    Metadata
    Show full item record
    Abstract
    This work studies image localization task and explores image ranking/retrieval approach. Deep Learning has advanced many computer vision task including image retrieval; in addition, location tagged image data has become increasingly abundant. The first contribution is a study of image geolocalization at planet scale (Im2GPS: predicting GPS coordinate from image data) comparing 2 deep learning approaches: image classification and image retrieval. We analyze the trade off between localization accuracy at different granularity levels. Image retrieval approach has great advantage when it comes to geolocalization at fine levels (street, city) and still competitive at coarse levels (country, continent). Next, we investigate different architectures for matching and retrieving crossview images. The application is to do localization using image retrieval approach where the query images are normal streetview images, but reference images in the database are overhead viewpoint (satellite images). The third contribution is exploring state of the art Deep Metric Learning (DML) techniques in image retrieval. We first look at it in the context of fine grained image retrieval, which is much well studied in the literature, and analyze generalization performance when switching embedding layer. Lastly, we apply DML techniques to training deep networks for image retrieval and Im2GPS geolocalization task. Our experiment shows that DML trained systems outperform a classification trained system as feature extractors, result in better image retrieval and geolocalization performance.
    URI
    http://hdl.handle.net/1853/61194
    Collections
    • College of Computing Theses and Dissertations [1156]
    • Georgia Tech Theses and Dissertations [23406]
    • School of Interactive Computing Theses and Dissertations [130]

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology