Introduction
Designed to support management of federal depository library collections, DOCUMENTS DATA MINER
©
is a search engine combining files from the latest version of the
List of Classes
of United States Government Publications available for Selection by Depository Libraries, the
Item Lister’s
Current Item Number Selection Profiles for Depository Libraries, and the
Federal Depositories Library Directory. Available information on inactive and discontinued items has been added to the database.
Query features of the DOCUMENTS DATA MINER 2©
include:
- Field searchable List of Classes and Inactive/Discontinued items.
- Title searching by keyword or truncation, in addition to full title.
- Reports on titles by format or by status (active, inactive/discontinued, or all1).
- Profiles of all Federal Depository Libraries, with multiple design features.
- Union List functions.
- Summary of selection totals by Agency for each depository.
- List of all Sudoc Stems attached to a requested Item Number.
- List of shipping lists attached to a requested Item Number.
- Searchable Shipping Lists.
- Shipping Lists attached to a requested Item Number.
- Shipping Lists and Shelf Lists filtered by depository number.
- Shelf Lists attached to a requested Item Number.
- Marc record searches and downloads.
- URL searches.
- Electronic Depository Directory.
- E-mail function to Depository Libraries.
Use the Union List Profile feature to establish a session, profiling for a group of depositories by state, region or distance from any depository.
In addition, DOCUMENTS DATA MINER 2© provides the latest version of the Federal Bulletin Board edition of the List of Classes and the Item Lister’s Current Item Number Selection Profiles for Depository Libraries from the FDLP Administration site in scrubbed form for easy integration into local application programs. See the Support function for additional information.
This site is regularly refreshed with updated data from the Government Printing Office’s Federal Bulletin Board Online via GPO Access and the FDLP Administration site. Date of modification is noted on the site’s footer.
Design and Development
DOCUMENTS DATA MINER 2© has been developed
through a partnership between University Libraries at Wichita State University and the
University Computing and Telecommunications Services . Overall project
conception and management was supplied by
Nan Myers, Government Documents Librarian and John Williams, Manager of
Acquisitions.
DOCUMENTS DATA MINER 2 database schemas, query algorithms, and Web
applications were developed by John Ellis
of the University Computing and Telecommunications Services. Graduate projects that contributed
to this effort were conducted by Baban, Kumar and Madan.
The original DOCUMENTS DATA MINER© was developed
with cooperation and funding by the National Institute for Aviation Research and was in turn
an outgrowth of a project undertaken to develop an in-house relational database for government documents at Ablah Library. The prototype was called
GPRD, Government documents Processing Relational Database (pronounced "jeopardy"). The preliminary design of the GPRD database was accomplished through a partnership between
Library faculty and staff and the faculty and graduate students of the University's
Departments of Electrical Engineering, Decision Sciences and Computer Science. Development occurred between Fall 1995 and Spring 1997.
The initial prototype for the data mining function was written by Dr. Xumin Nie, Oracle Corporation.
Combining the original prototype and the model for data mining, the DOCUMENTS
DATA MINER© provides an Internet-accessible relational database for the use of the documents community. An overview of the GPRD database development
was presented at the 6th Annual Federal Depository Library Conference
(April 14-17, 1997) in the paper titled
Managing the Depository Database.
Additional credits: The Sudoc parsing program was supplied by Margaret Mooney, University of California at Riverside. Additionally, certain title information was initially supplied by Mooney and by Paul Arrigo, Washburn University.
Design and development of DOCUMENTS DATA MINER 2©
occurred from the Fall of 2000 through the Spring of 2001.
Inactive/Discontinued Information
The publication Inactive or Discontinued Items from the 1950 Revision of the Classified List (rev.1996) is not yet available for downloading from the FDLP Administration page. Data available in this section was originally mined from the BDLD site maintained by Thomas G. Tyler in March 1997. Data captured from the BDLD's section on "Inactive or Discontinued Items" is based on a 9-track tape of the 1989 edition and manually revised through the 1996 edition. The annotations originated from
Shipping Lists, the List of Classes, Surveys, the BDLD title List of Classes - Additions & Changes, and other Depository Library Program sources.
The DOCUMENTS DATA MINER©project began loading the List of Classes from the FDLP site in October 1997. Any item which no longer appears on the List of Classes automatically is considered non-active by the DATA MINER and information related to that item number moves to the non-active category.
Architecture
The Web site is hosted on a Microsoft 2000 Server (IIS 5.0). All servers are located
in WSU Computing Center. All Relational Databases are Microsoft Sql Server 2000. All web publications use Microsoft's Active Server Pages
(ASP) technology. The Data Miner utilities are written in Microsoft Visual Basic ver. 6.0.,
Javascript, VBscript, and Transaction SQL