Tool for Identification of Duplicate Records Downloaded from Multiple CD-ROMS. A Case Study with SPIRS Based Databases

Authors

  • National Centre for Science Information (NCSI), Indian Institute of Science (IISc), Bangalore 560012
  • National Centre for Science Information (NCSI), Indian Institute of Science (IISc), Bangalore 560012

DOI:

https://doi.org/10.17821/srels/2001/v38i4/49018

Keywords:

Searching Multiple CD-ROM Databases, Duplicate Records, SPIRS (Silver Platter Information Retrieval System) Databases.

Abstract

As research becomes more and more interdisciplinary, literature search from CD-ROM databases is often carried out on more than one CD-ROM database. This results in retrieving duplicate records due to same literature being covered (indexed) in more than one database. The retrieval software does not identify such duplicate records. Three different programs have been written to accomplish the task of identifying the duplicate records. These programs are executed from a shell script to minimize manual intervention. The various fields that have been used (extracted) to identify the duplicate records include the article title, year, volume number, issue number and pagination. The shell script when executed prompts for input file that may contain duplicate records. The programs identify the duplicate records and write them to a new file.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Published

2001-12-02

How to Cite

Kavi, P. P., & Jayakant, F. (2001). Tool for Identification of Duplicate Records Downloaded from Multiple CD-ROMS. A Case Study with SPIRS Based Databases. Journal of Information and Knowledge, 38(4), 279–286. https://doi.org/10.17821/srels/2001/v38i4/49018
Received 2014-05-20
Accepted 2014-05-20
Published 2001-12-02