The Role of Electronic Document Delivery Services in the Creation of Digital LibrariesOlga V. Barysheva
|
Name of the element |
Identifier |
Definition |
Commentaries |
Title |
Title |
Name assigned to the resource |
The title is usually a name under which the resource is known |
Creator |
Creator |
Person(s) primarily responsible for the creation and contents of the resource |
Examples of creator include the person, organisation or service. The name of the creator usually should be used for indication of the object under description |
Subject and key words |
Subject |
Subject area determining the contents of the resource |
Subject is usually expressed with the help of key words or phrases and classification codes describing the subject coverage of the resource |
Description |
Description |
Note on the contents of the resource |
Description can include (but is not limited by): abstract, table of contents, references to graphic contents presentation or simple textual description |
Publisher |
Publisher |
Person(s) responsible for the publication of the resource |
Publisher can include person, organisation or service. The name of publisher should be used for the identification of the object under description |
Contributor |
Contributor |
Person(s) assisting in the creation of resource contents |
Contributor can include person, organisation or service. The name of contributor should be used for the identification of the object under description |
Date |
Date |
Date connected with event in life cycle of resource |
Date is usually associated with creation or availability of resource. Meaning of date, recommended for practical use at the coding, is defined in ISO 8601 and supports GGGG-MM-DD format |
Type of resource |
Type |
Characteristic or genre of the contents of the resource |
Type includes such terms as general categories, functions, genres or combined levels of the contents. For practical use it is recommended to choose value from a dictionary (e.g. DCT [6]). Element format is used for the description of physical or digital representation of the resource |
Format |
Format |
Physical or digital presentation of resource |
Format usually includes copy type (media-type) or resource size. The format can be used for the definition of technical support and software or other equipment necessary for the display or management of the resource. For practical use it is recommended to choose value from a dictionary (e.g. MIME [7]) |
Identifier of resource |
Identifier |
A unique reference to the resource within a given context |
For practical use it is recommended to identify the resource by means of a line or number corresponding to a formal identification system (URI [8], URL [9], DOI [10], ISBN [11]) |
Source |
Source |
The reference to the original source from which the resource was taken |
The resource can be taken from the original source wholly or partially. For practical use it is recommended to identify the resource by means of a line or number corresponding to a formal identification system. |
Language |
Language |
Language of the resource contents |
For practical purposes it is recommended to use the value of the element language, determined by RFC 1766, including two-lettered language codes (from the ISO 639) optionally followed by two letter country codes (taken from the ISO 3166 [12]). For example, "en " – for English, "fr" – for French, "en-uk" – for British English |
Relation |
Relation |
Reference to related resources |
For practical use it is recommended to identify the resource by means of a line or number corresponding to a formal identification system. |
Coverage |
Coverage |
Extent and limits of the resource content |
Coverage usually includes spatial location (name of area or geographical co-ordinates), time interval (time mark, date or range of dates) or jurisdiction (such as administrative division). It is recommended to choose value from a dictionary (for example, Thesaurus of the geographical names), i.e. it is more convenient to use the names of areas and periods of time instead of digital identifiers (such as systems of co-ordinates or ranges of dates) |
Legal issues |
Rights |
Rights of access limitation and resource protection |
Element rights usually contains a statement about the law governing the operation of the resource, or reference to a service providing this information. The legal information usually includes the data on intellectual property rights for, copyright and other property rights. The absence of the right element can not be the reason for any assumptions concerning the legal status of the resource. |
Each element is determined with the help of a set from 10 attributes, according to the ISO/IEC11179 [13] standard for the description of elements of the data.
It is hard to tell which metadata set is better (they are hardly comparable), which one will be in more use, and which one will be the most efficient for search. At present the Dublin Core seems to have the best future prospects, for it is applicable practically to all kinds of electronic documents and can be interpreted both by machines and humans. In addition, it is international. Moreover, the possibility and necessity of the creation of metadata profiles (Dublin Core can be easily used in this capacity) is stated in the specification of HTML language, version 4.01, recommended by W3 Consortium on December 24, 1999 [14].
In any case, electronic documents cannot be used without descriptions as they cannot then be found. Documents for delivery are supplied with descriptions, they can thus easily be included in a digital library. The main thing is that the metadata scheme within the same digital library should be identical regardless of the acquisition sources, format and place of storage.
The use of electronic documents
We will not deal with the main problems of digital library operation: economic and copyright issues. We will speak only about the interaction of documents.
The main problem is to combine the traditional and electronic documents, especially when it concerns libraries. The creation of links between documents on the basis of mutual quotation, which is already used by the Institute of Scientific Information in Philadelphia (IS) [15], enables the provision of references to the description of the traditional (paper) documents, if these exist in a machine-readable format. For the moment only the Institute of Scientific Information of Social Sciences tries to create a similar mechanism in this country. Although it could be useful to apply this concept in the creation of digital libraries, including networked electronic documents, electronic copies created by publishers, delivery services or services of scanning, description of collections of traditional libraries. Then the issue of whether traditional libraries should acquire publications and documents or information and contents will lose its urgency.
We think that libraries and document delivery services should preserve digitised documents. They only need metadata according to one of the existing schemes. On the other hand, it will not be necessary for each traditional library to create its own digital library - by uniting smaller libraries with the use of the same metadata the libraries of Russia can fully contribute to digital library collection development.
References
- British library facts and figures http://www.bl.uk/;
- FGDC standards ;
- ISO catalogue http://www.iso.ch/infoe/catinfo.html;
- Compulib http://www.citycat.ru/compulib/#Kluch;
- Dublin Core Metadata for Resource discovery http://www.ietf.org/rfc/rfc2413.txt;
- DCT - List of Resource Types: Dublin Core Draft Working Group Report. http://purl.org/DC/documents/wd-typelist.htm;
- MIME - Internet Media Types. ;
- URI, URL - Naming and Addressing: URIs, URLs, ... http://www.w3.org/Addressing/;
- URI - Uniform Resource Identifiers: Generic Syntax, Internet Draft Standard http://www.ics.uci.edu/pub/ietf/uri/rfc2396.txt;
- URL - Uniform Resource Locator Specification http://www.w3.org/Addressing/URL/Overview.html;
- DOI – The Digital Object Identifier http://www.doi.org/;
- ISBN – International Standard Book Numbering http://www.reedref.com/standards/;
- ISO 3166 2-letter country codes http://www.w3.org/International/O-misc-iso3166.html;
- ISO 11179 - Specification and Standardization of Data Elements, Parts 1-6. ftp://sdct-sunsrv1.ncsl.nist.gov/x3l8/11179/;
- HTML 4.01 Specification http://www.w3.org/TR/html4/cover.html#minitoc;
- ISI Web of Science http://www.isinet.com/products/citation/wos.html;
About the Author
Barysheva Olga Vladimirovna, candidate of philological sciences on a speciality "computer science", leading engineer of a Department of Automation of the Russian National Library.
© Olga V. Barysheva, 2000