Introducing MAGPIE (Metadata Assignment GUI Providing Ingest and Export)

dc.contributor.affiliationTexas A&M University
dc.contributor.authorWelling, William
dc.contributor.authorElmquist, Stephanie
dc.contributor.authorCreel, James
dc.contributor.authorHuff, Jeremy
dc.contributor.authorSavell, Jason
dc.contributor.authorMathew, Rincy
dc.contributor.authorHahn, Doug
dc.contributor.authorBolton, Michael
dc.date.accessioned2016-06-16T19:55:41Z
dc.date.available2016-06-16T19:55:41Z
dc.date.issued2015-05-26
dc.descriptionPresentation for the 2016 Texas Conference on Digital Libraries (TCDL).en_US
dc.description.abstractThe Libraries at Texas A&M University have curated immense output from graduate programs for many decades. With the advent of the Vireo ETD (Electronic Thesis and Dissertation) submittal system, dissertations have been submitted in digital format and made available for download from TAMU’s OAKTrust institutional repository. However, many older dissertations are only discoverable through TAMU’s Voyager based online card catalog and are publicly available to visiting researchers in print format. A current digitization effort will make available these dissertations online at OAKTrust. The tool being developed for this purpose is designated MAGPIE (Metadata Assignment GUI Providing Ingest and Export). For the dissertation use case, librarians specified that the tool should display scanned PDF files and OCR (optical character recognition) text output from a file system. The tool then presents these data to annotators (typically, student workers) to augment and amend metadata. The presentation interface reads metadata, in this case MARC records, from TAMU’s Voyager card catalog database, thereby pre-populating important fields, such as the title and author name. However, a number of other fields, such as the abstract and names of committee members, do not exist in the card catalog but are available in the document itself. The annotator can simply copy and paste these character strings from the source document into a metadata input form specifically configured for the legacy dissertation digitization and preservation project. The MAGPIE workflow allows a manager to amend, reject, or approve these metadata entries, and to push approved documents into the OAKTrust repository with a single click. The MAGPIE tool has been developed using the Weaver framework, an open source web-development front-end and web service code-base from TAMU Libraries. The web service is built on top of Spring-boot, which is a popular framework with a large and growing community with documentation and support. The front-end of the web-stack consists of AngularJS and Bootstrap. The Weaver framework offers certain advantages, such as automatic updates of document status in the browser window without a page reload. The MAGPIE tool has also been developed with future projects in mind – the importation of content is modular and customizable, as is the metadata import service, the metadata form, and the export/push functionality. We anticipate that the MAGPIE tool will find use for metadata enhancement and automatic repository deposit of newspapers, images, and other institutional collections with or without existing metadata. In this talk, we will examine the initial use case of scanned legacy dissertations, provide some background on the MAGPIE software and its development, demonstrate the functionality of the tool, and conclude with an overview of future ambitions.en_US
dc.identifier.urihttp://hdl.handle.net/2249.1/76260
dc.languageen_US
dc.sourceTexas Conference on Digital Libraries (TCDL), 2016, Austin, Texas, United States
dc.subjectmetadataen_US
dc.subjectDSpaceen_US
dc.subjectingesten_US
dc.subjectexporten_US
dc.titleIntroducing MAGPIE (Metadata Assignment GUI Providing Ingest and Export)en_US
dc.typePresentationen_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
MAGPIE.pptx
Size:
1.54 MB
Format:
Microsoft Powerpoint XML
Description:
Loading...
Thumbnail Image
Name:
MAGPIE.pdf
Size:
590.52 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: