User-Configurable Discovery Across Collections

Bolton, Michael; Creel, James; Day, Kevin; Hahn, Doug; Huff, Jeremy; Laddusaw, Ryan; Savell, Jason; Welling, William

User-Configurable Discovery Across Collections

dc.contributor.author	Bolton, Michael
dc.contributor.author	Creel, James
dc.contributor.author	Day, Kevin
dc.contributor.author	Hahn, Doug
dc.contributor.author	Huff, Jeremy
dc.contributor.author	Laddusaw, Ryan
dc.contributor.author	Savell, Jason
dc.contributor.author	Welling, William
dc.date.accessioned	2019-07-10T22:19:03Z
dc.date.available	2019-07-10T22:19:03Z
dc.date.issued	2019-05-22
dc.description	Presented by Texas A&M University, 2C \| Technology & Tools, at TCDL 2019.	en_US
dc.description.abstract	A persistent challenge facing the digital library community is how to provide a single discovery interface for a large set of heterogeneous digital collections. To address this challenge, the development team at Texas A&M University Libraries has been actively developing a new open-source application called SAGE (Search Aggregation Engine) available at https://github.com/TAMULib/sage. SAGE functions to combine any number of Solr indices, crosswalk the fields, and generate one (or more) aggregated indices. SAGE is a Java web service with a few abstractions to accomplish the aggregation task. The Java model performs aggregation by way of “Jobs”, which themselves consist of “Readers” and “Writers”. Each of these Java entities is configurable through a browser-based user interface (UI). Each UI-configured Reader brings in a Solr core, a customizable Solr query to filter the core for desired documents, and a configurable mapping from the core's schema to SAGE's internal metadata representation. In the complementary role, Writers map from SAGE's internal metadata representation to a destination Solr schema. A Job can consist of any number of Readers and Writers. Jobs can be triggered through the UI, an API call, or periodic scheduling. When a Job runs, it reads from all of its associated Readers, combines the results by mapping to its internal metadata representation, then writes the result set using each Writer to convert the internal representation to the proper schema for that Writer's associated Solr core. The result is one or more Solr cores, each containing the filtered, crosswalked contents of the originating Solr cores. In addition to these aggregation features, preliminary work has yielded excellent prototyping for dynamic creation of “Discovery View” landing pages. The “Discovery View” feature set can be utilized via the UI or API, and enables dynamic creation of a custom UI for any given Solr core. The administrator may select what fields will be exposed to users as searchable, facetable, and displayable in result metadata. We are unaware of any other existing solution providing UI-based creation of discovery interfaces to arbitrary Solr indices. The plug-and-play styled arrangement of these features combined with SAGE's interface driven architecture provides flexibility and opens the door to future enhancements. One possibility could be drop-in processors for performing transform operations on the aggregated results before writing. Also, the application invites the enticing possibility of Reading/Writing from/to non-Solr sources, such as MARC. SAGE’s ability to combine indices and expose these indices through the UI as Discovery Views entails a significant advancement on existing discovery solutions."	en_US
dc.identifier.uri	https://hdl.handle.net/2249.1/156404
dc.language.iso	en_US	en_US
dc.publisher	Texas Digital Library	en_US
dc.title	User-Configurable Discovery Across Collections	en_US
dc.type	Presentation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: tcdl-2019-SAGE.pptx
Size:: 1.63 MB
Format:: Microsoft Powerpoint XML
Description:: User-Configurable, Texas A&M University ppt

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

2019 Texas Conference on Digital Libraries