An Architecture for Efficient Resource Discovery with Metadata Harvesting in a Multidisciplinary Distributed Repository

To add a paper, Login.

The profusion of non-relevant information for a given query on the Web explains the pressing need for formulating ebullient strategies for pertinent Web resource discovery and retrieval. One of the major requirements for effective document retrieval is its diligently encoded metadata. At the same time metadata standards to be followed for annotating documents from large collections are pretty complex. This is because the standardized global metadata cannot represent all the elusive forms of document metadata for improved retrieval ranking. In this context, we propose an approach to facilitate document retrieval from multidisciplinary domains where each belonging to discrete domains would be indexed in a segregated instance of a repository. This would facilitate document metadata customization for each specific discipline by adding specific metadata themes. Since the approach retains the standard metadata schema in addition to the customized metadata schema, it would result in enhanced resource discovery. The metadata retrieval process will be supported by an extended protocol for metadata harvesting (X-PMH) [1] and will be implemented in each repository. The extended metadata harvesting approach has been used to tie together the metadata customization components made at various repository instances. The proposed framework could be integrated into Open Digital Libraries (ODLs) [2] and shall serve as an intrinsic model that adds value in the context of multidisciplinary metadata simplicity, maintenance, and descriptive metadata availability in the event of repository instance failures. Our approach is to implement this cost-effective architecture using the PKP-OAI (Public Knowledge Project – Open Archive Initiative) [3,4] harvester on DSpace [5], an open source digital repository platform that supports metadata harvesting in its innate form. Once this is fully achieved, a federated search build upon such repository instances using open source technologies [6] would yield promising results in the context of information retrieval.

Keywords: Metadata Harvesting, Federated Searching, Information Retrieval, Multidisciplinary Distributed Repository, Public Knowledge Project, Open Archives Initiative, Extended Protocol for Metadata Harvesting (XPMH), Open Digital Libraries
Stream: Technologies
Presentation Type: Virtual Presentation in English
Paper: , Architecture for Efficient Resource Discovery with Metadata Harvesting in a Multidisciplinary Distributed Repository, An

Jayan C Kurian

Research Staff, Wee Kim Wee School of Communication & Information, Nanyang Technological University
Singapore, Singapore, Singapore

Dr. M.G. Sreekumar

Visiting Professor, Department of Information Science
Faculty of Computer Science and Information Technology, University of Malaya

Kuala Lumpur, Kuala Lumpur, Malaysia

Dr. Dion Hoe-Lian Goh

Associate Professor, Wee Kim Wee School of Communication & Information, Nanyang Technological University
Singapore, Singapore, Singapore

Dr. Diljit Singh

Associate Professor, Faculty of Computer Science and Information Technology, University of Malaya
Kulala Lumpur, Kulala Lumpur, Malaysia

Dr. Abrizah Abdullah

Associate Professor, Faculty of Computer Science and Information Technology, University of Malaya, Malaysia
Kuala Lumpur, Kuala Lumpur, Malaysia

Joy Lynn Wheeler

Librarian, Library Technology & Systems Division, Nanyang Technological University

Joy Wheeler has worked at the NTU Library in the Library Technology Systems Division for the past 2 years. She splits her time as a subject librarian for the Art Design & Media Library.

Ref: M08P0429