Home
    • Login
    View Item 
    •   TDL DSpace Home
    • Texas Conference on Digital Libraries Proceedings
    • 2010 Texas Conference on Digital Libraries
    • View Item
    •   TDL DSpace Home
    • Texas Conference on Digital Libraries Proceedings
    • 2010 Texas Conference on Digital Libraries
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Scanning to PDFA: Buildling a Digital Collection fo rAccess AND Preservation

    Thumbnail
    View/Open
    tcdl2010_Clement_Scanning_PDFA.pdf (1.686Mb)
    Date
    2010-05-17
    Author
    Clement, Gail
    Halling, Derek
    Burford, Nancy
    Carrigan, Esther
    Moberly, Heather
    Metadata
    Show full item record
    Abstract
    The Texas A&M University Medical Sciences Library partnered with Oklahoma State University Libraries to digitize the Index-Catalogue of Medical and Veterinary Zoology, a multilingual periodical published by the US Government Printing Office. This series is a key resource, a historical compendium of the parasitological literature of importance to researchers in re-emerging diseases and global animal health. The compilation of content began in 1892, and resulted in over 100 separate publications comprising over 20,000 pages.With generous grant support from the National Library of Medicine, the Library has digitized 67 publications as of March 10, 2010. This undertaking is intended as a demonstration project to encourage the digitization and preservation of veterinary grey literature.Conversion methods involved high resolution scanning of bound volumes and creation of archival master files in uncompressed TIFF format. Derivative versions of page image files were processed via optical character recognition (OCR) using multiple dictionaries to capture text in English, Spanish, French, German, Dutch, Greek and Russian languages. Each volume was recompiled as a single PDF file with text behind page image, and saved using the PDF/A-1b profile for archiving. Achieving PDF/A compliance was a challenge given the multiplicity of fonts required to represent the typefaces and character sets comprising this body of content. Specific solutions used to address the challenge of PDF/A compliance will be demonstrated.
    URI
    http://hdl.handle.net/123456789/66966
    Collections
    • 2010 Texas Conference on Digital Libraries

    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    TDL
    Theme by @mire NV
     

     

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    TDL
    Theme by @mire NV