• Login
    View Item 
    •   SMARTech Home
    • Georgia Tech Library
    • Georgia Tech Library Sponsored Conferences
    • 4th International Conference on Open Repositories
    • View Item
    •   SMARTech Home
    • Georgia Tech Library
    • Georgia Tech Library Sponsored Conferences
    • 4th International Conference on Open Repositories
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    High-Throughput Workflow for Computer-Assisted Human Parsing of Biological Specimen Label Data

    Thumbnail
    View/Open
    176-669-1-PB.docx (15.61Kb)
    176-670-1-PB.pdf (29.94Kb)
    Date
    2009-05
    Author
    Amin, Aliasgar
    Arsiwala, Zainab
    Best, Jason
    Huang, Jane Q.
    McCotter, Melody
    Moen, William E.
    Neill, Amanda
    Metadata
    Show full item record
    Abstract
    Hundreds of thousands of specimens in herbaria and natural history museums worldwide are potential candidates for digitization, making them more accessible to researchers. An herbarium contains collections of preserved plant specimens created for scientific use. Herbarium specimens are ideal natural history objects for digitization, as the plants are pressed flat and dried, and mounted on individual sheets of paper, creating a nearly two-dimensional object. Building digital repositories of herbarium specimens can increase use and exposure of the collections while simultaneously reducing physical handling. As important as the digitized specimens are, the data contained on the associated specimen labels provide critical information about each specimen (e.g., scientific name, geographic location of specimen, etc.). The volume and heterogeneity of these printed label data present challenges in transforming them into meaningful digital form to support research. The Apiary Project is addressing these challenges by exploring and developing transformation processes in a systematic workflow that yields high-quality machine-processable label data in a cost- and time-efficient manner. The University of North Texas's Texas Center for Digital Knowledge (TxCDK) and the Botanical Research Institute of Texas (BRIT), with funding from an Institute of Museum and Library Services National Leadership Grant, are conducting fundamental research with the goal of identifying how human intelligence can be combined with machine processes for effective and efficient transformation of specimen label information. The results of this research will yield a new workflow model for effective and efficient label data transformation, correction, and enhancement.
    URI
    http://hdl.handle.net/1853/28412
    Collections
    • 4th International Conference on Open Repositories [135]
    • 4th International Conference on Open Repositories (4th - Atlanta - 2009) [132]

    Browse

    All of SMARTechCommunities & CollectionsDatesAuthorsTitlesSubjectsTypesThis CollectionDatesAuthorsTitlesSubjectsTypes

    My SMARTech

    Login

    Statistics

    View Usage StatisticsView Google Analytics Statistics
    facebook instagram twitter youtube
    • My Account
    • Contact us
    • Directory
    • Campus Map
    • Support/Give
    • Library Accessibility
      • About SMARTech
      • SMARTech Terms of Use
    Georgia Tech Library266 4th Street NW, Atlanta, GA 30332
    404.894.4500
    • Emergency Information
    • Legal and Privacy Information
    • Human Trafficking Notice
    • Accessibility
    • Accountability
    • Accreditation
    • Employment
    © 2020 Georgia Institute of Technology