General Metadata Resources
Sample records provided by metadata standards or guidelines
Note: Metadata standards usually provide the best practice guides in their specifications. The examples provided focus on the contents rather than the formats. It ensures that metadata elements, structure, and value spaces specified by a standard are correctly understood and controlled when applied to real situations. It is not their intention to provide format guidelines as to how a record should be encoded or displayed.
- Dublin Core
http://dublincore.org/documents/dc-citation-guidelines/- See examples provided in this Guidelines for Encoding Bibliographic Citation Information in Dublin Core Metadata.
- CDWA (Categories for the Description of Works of Art), Getty
http://www.getty.edu/research/conducting_research/standards/cdwa/examples.htm- See over 100 examples under “Catalog Examples”.
- VRA Core 3.0 (Visual Resource Association (VRA) Core Categories)
http://www.vraweb.org/projects/vracore3/examples.html
- The last section of the standard included several examples.
Pay attention to the data sets in Example 3 that describe a chair documented by a photograph. The photograph was later copied to a slide format and scanned to create a digital image.
- The last section of the standard included several examples.
- VRA Core 4.0 Beta (Visual Resource Association (VRA) Core Categories)
http://gort.ucsd.edu/escowles/vracore4/- Metadata records for 13 works are provided. For each work, there is a html display record, a full VRA record, and a record encoded with XML.
- Cataloguing Culture Objects (CCO) (website: http://www.vraweb.org/ccoweb/cco/index.html)
- Book Appendix A: Examples. 178 examples are available
- Cataloguing examples: http://www.vraweb.org/ccoweb/cco/examplesindex.html
- DLESE ADN (Digital Library for Earth System Education)
- ADL (ADEPT/DLESE/NASA) metadata framework v0.6.50 (XML records)
http://www.dlese.org/Metadata/adn-item/0.6.50/samples/index.htm
- ADL (ADEPT/DLESE/NASA) metadata framework v0.6.50 (XML records)
Metadata types and functions
NISO’s definitions
There are three main types of metadata:
• Descriptive metadata describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords.
• Structural metadata indicates how compound objects are put together, for example, how pages are ordered to form chapters.
• Administrative metadata provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it. There are several subsets of administrative data; two that are sometimes listed as separate metadata types are:
? Rights management metadata, which deals with intellectual property rights,
and
? Preservation metadata, which contains information needed to archive and preserve a resource.
Source:
NISO. (2004) Understanding Metadata.
Bethesda, MD: NISO Press, p.1
Metadata functions
- Resource discovery
- Allowing resources to be found by relevant criteria;
- Identifying resources;
- Bringing similar resources together;
- Distinguishing dissimilar resources;
- Giving location information.
- Organizing e-resources
- Organizing links to resources based on audience or topic.
- Building these pages dynamically from metadata stored in databases.
- Facilitating interoperability
- Using defined metadata schemes, shared transfer protocols, and crosswalks between schemes, resources across the network can be searched more seamlessly.
- Cross-system search, e.g., using Z39.50 protocol;
- Metadata harvesting, e.g., OAI protocol.
- Using defined metadata schemes, shared transfer protocols, and crosswalks between schemes, resources across the network can be searched more seamlessly.
- Digital identification
- Elements for standard numbers, e.g., ISBN
- The location of a digital object may also be given using:
- a file name
- a URL
- some persistent identifiers, e.g., PURL (Persistent URL); DOI (Digital Object Identifier)
- Combined metadata to act as a set of identifying data, differentiating one object from another for validation purposes.
- Archiving and preservation
- Challenges:
- Digital information is fragile and can be corrupted or altered;
- It may become unusable as storage technologies change.
- Metadata is key to ensuring that resources will survive and continue to be accessible into the future. Archiving and preservation require special elements:
- to track the lineage of a digital object,
- to detail its physical characteristics, and
- to document its behavior in order to emulate it in future technologies.
- Challenges:
Source:
NISO. (2004) Understanding Metadata.
Bethesda, MD: NISO Press, pp.1-2.
Metadata standards — a selected list
- Metadata schemas (also called schemes) generally
specify names of elements and their semantics. - Optionally, they may specify:
- rules for how content must be formulated (for example, how to identify the main title),
- representation rules for content (for example, capitalization rules), and
- allowable content values (for example, terms must be used from a specified controlled vocabulary).
- Many metadata schemas are being developed in a variety of user environments and disciplines.
- Some of the most common ones are listed on this page.
- An expanded list of schemes and element sets is provided on the following page.
Information Resource Description
- DUBLIN CORE
http://dublincore.org/- Dublin Core Metadata Element Set: http://dublincore.org/documents/dces/
- DCMI Metadata Terms:
http://dublincore.org/documents/dcmi-terms/- Including:
- The Dublin Core Metadata Element Set
- Other Elements and Element Refinements http://dublincore.org/documents/dcmi-terms/#H3
- Encoding Schemes
- The DCMI Type Vocabulary
The Dublin Core Metadata Element Set is a standard for cross-domain information resource description. It is now a U.S. national and international standard.
- MARC (MAchine-Readable Cataloging)
http://lcweb.loc.gov/marc/
MARC provides the mechanism by which computers exchange, use, and interpret bibliographic information, and its data elements make up the foundation of most library catalogs used today. MARC became USMARC in the 1980s and MARC 21 in the late 1990s.
MODS (Metadata Object Description Schema)
http://www.loc.gov/standards/mods/
MODS includes a subset of MARC fields and uses language-based tags rather than numeric ones, in some cases regrouping elements from the MARC 21 bibliographic format. MODS is expressed using the XML schema language of the World Wide Web Consortium.Cultural Objects and Visual Resources
- Categories for the Description of Works of Art (CDWA)
http://www.getty.edu/research/conducting_research/standards/cdwa/index.html A standard for describing works of art, architecture, groups of objects, and visual and textual surrogates. - VRA Core Categories version 3.0
http://www.vraweb.org/projects/vracore3/
For creating records to describe works of visual culture as well as the images that document them. - VRA Core Categories version 4.0
http://www.vraweb.org/projects/vracore4/index.html
Educational-purpose
- Learning Object Metadata (LOM)
http://ltsc.ieee.org/wg12/index.html
Schema: IEEE 1484.12.1-2002, 15 July 2002.
http://ltsc.ieee.org/wg12/files/
LOM_1484_12_1_v1_Final_Draft.pdf (PDF file)
Focuses on the minimal set of attributes needed to allow learning objects to be managed, located, and evaluated. Learning Objects are defined here as any entity, digital or non-digital, which can be used, re-used or referenced during technology supported learning.
Archives and Preservation
- EAD (Encoded Archival Description) DTD
http://lcweb.loc.gov/ead/
A standard for encoding archival finding aids using Extensible Markup Language (XML). - PREMIS (Preservation Metadata: Implementation Strategies)
http://www.oclc.org/research/projects/pmwg/
Data Dictionary for Preservation Metadata
http://www.oclc.org/research/projects/pmwg/premis-final.pdf
Defines “core” set of preservation metadata elements, with supporting data dictionary, applicable to a broad range of digital preservation activities
E-Commerce
- The INDECS project
http://www.indecs.org/
The <indecs> metadata framework: Principles, model and data dictionary
http://www.indecs.org/pdf/framework.pdf
Created to address the need, in the digital environment, to put different creation identifiers and their supporting metadata into a framework where they could operate side by side, especially to support the management of intellectual property rights. The main focus of <indecs> is the use of what is commonly (if imprecisely) called content or intellectual property. - ONIX (Online Information Exchange)
http://www.editeur.org/onix.html
Built on the <indecs> Framework, developed and maintained by EDItEUR jointly with book industries.
The ONIX for Books Product Information Message is the international standard for representing and communicating book industry product information in electronic form. It has elements to record a wide range of evaluative and promotional information as well as basic bibliographic and trade data.
Agents
- The Friend of a Friend (FOAF)
http://xmlns.com/foaf/0.1/- the foaf project homepage: http://www.foaf-project.org/
The FOAF project is based around the use of machine readable Web homepages for people, groups, companies, etc. The “FOAF vocabulary” provides a collection of basic terms that can be used in these Web pages.
- the foaf project homepage: http://www.foaf-project.org/
- RDF-vCARD
Representing vCard Objects in RDF/XML
http://www.w3.org/TR/vcard-rdf- This note specifies a Resource Description Framework (RDF) expression that corresponds to the vCard electronic business card profile defined by RFC 2426 [VCARD].
Geospatial Data
- Content Standards for Digital Geospatial Metadata (CSDGM)
http://www.fgdc.gov/metadata/csdgm/index_html
A standard for supporting the collection and processing of geospatial metadata. It is intended to be useable by all levels of government and the private sector.
Media-Specific
- MPEG-4 and MPEG-7 for Audio and Video (ISO/IEC standards developed by MPEG (Moving Picture Experts Group))
- MPEG-4
A standard for multimedia of the fixed and mobile web.
Overview: http://www.chiariglione.org/mpeg/standards/mpeg-4/mpeg-4.htm - MPEG-7
A standard for description and search of audio and visual content.
= Multimedia Content Description Interface
Overview: http://www.chiariglione.org/mpeg/standards/mpeg-7/mpeg-7.htm
Metadata record creation and tools
Review of metadata records
Using templates and other tools
- Dublin Core
- DC-Dot’s Dublin Core metadata editor http://www.ukoln.ac.uk/metadata/dcdot/
(Note: Submit any webpage’s URL and get a suggested metadata record in XHTML format, then use the template to edit the record. Different output formats are available.)
Suggestion: Use the following sources to create metadata records using the above two tools.
- 1. A single-file journal article:
Gregory Crane. Georeferencing in Historical Collections. D-Lib Magazine, May 2004, Volume 10 Number 5. URL:http://www.dlib.org/dlib/may04/crane/05crane.html - 2. A multi-file online tutorial:
Moving Theory into Practice: Digital Imaging Tutorial. Prepared by Cornell University Library/ Research Department
http://www.library.cornell.edu/preservation/
tutorial/contents.html - 3. A dynamic content website:
Explore Mars Now
http://www.exploremarsnow.org - 4. A dynamic content website that provides dynamic information on the screen, based on the search or browse queries:
National Geographic–Expeditions–Lessons http://www.nationalgeographic.com/xpeditions/lessons/01/gk2/marsmobiles.html - 5. A webpage that contains multiple contributors:
The Math Forum: Ask Dr. Math.
http://mathforum.org/library/drmath/view/56672.html
- DC-Dot’s Dublin Core metadata editor http://www.ukoln.ac.uk/metadata/dcdot/
Other tools for metadata creation
- Dublin Core tools
http://dublincore.org/tools
- Utilities
- Creating Metadata (Templates)
- Tools for the Creation/Change of Templates
- Automatic Extraction/Gathering of Metadata
- Automatic Production of Metadata
- Conversion Between Metadata Formats
- Integrated (Tool) Environments
- Commercially Available Software
- MPEG-4
- Learning Object Metadata Editors
http://www.cancore.ca/editors.html
Use this tool online: http://demo.licef.teluq.uquebec.ca/eRIB/ - FGDC Metadata Tools
http://www.fgdc.gov/metadata/geospatial-metadata-tools
A list of metadata creation tools and metadata processing software. Each tool makes use of the Federal Geographic Data Committee’s (FGDC) Content Standards for Digital Geospatial Metadata and may support the Biological Data Profile. - OAI-Specific Tools
http://www.openarchives.org/tools/tools.html
A list of links to the tools implemented by members of the Open Archives Initiative community. - RDF Editors and Tools
http://www.ilrt.bris.ac.uk/discovery/rdf/
resources/#sec-tools
An annotated list with links to the tools. - TEI Software
http://www.tei-c.org/Software/index.html
A list of links to software for creating, managing, and processing TEI documents in SGML or XML. - Customized Templates for EAD-Encoded Finding Aids
http://www.cdlib.org/inside/projects/oac/toolkit/templates/
Templates for OAC (The Online Archive of California) Project participants to generate EAD version 1.0 markup. - Metadata Software Tools
http://www2.sub.uni-goettingen.de/cgi-bin/ssgfi/anzeige.pl?db=meta&sc=T
A registry of 10+ non-format-specific metadata software tools.
- Examples from VRA Core 3.0:
- CULTURE
Qualifiers: None
Description: The name of the culture, people (ethnonym), or adjectival form of a country name from which a Work or Image originates or with which the Work or Image has been associated.
Data Values :Â recommend AAT, LCSH - CREATOR
Qualifiers
Creator.Role
Creator.Attribution
Creator.Personal name
Creator.Corporate name
Description: The names, appellations, or other identifiers assigned to an individual, group, corporate body, or other entity that has contributed to the design, creation, production, manufacture, or alteration of the work or image.
Data Values (controlled): recommend ULAN and AAAF (LC authority files).
- CULTURE
The following controlled vocabularies are usually recommended by the metadata standards or best practice guide.
Standardized vocabularies
DCMI Type Vocabulary
http://dublincore.org/documents/dcmi-type-vocabulary/The DCMI Type Vocabulary provides a general, cross-domain list of approved terms that may be used as values for the Resource Type element to identify the genre of a resource.
[MIME] Internet Media Types
http://www.iana.org/assignments/media-types/
May be used as values for the Format element.ISO 639 – Codes for the representation of names of languages.
http://www.oasis-open.org/cover/iso639a.html
May be used as values for the Language element.ISO 3166 – Codes for the representation of names of countries.
http://www.oasis-open.org/cover/country3166.htmlThesauri and classification schemes
Note: Only a small number of thesauri and classification schemes are listed below. They are frequently mentioned in metadata standards. A more completed list is available online.
Subject Headings
LC Subject Headings (LCSH)
Website about Web access to LCSH http://www.loc.gov/cds/classweb.htmlFAST (Faceted Application of Subject Terminology) Authority File
http://www.oclc.org/research/projects/fast/
FAST adapted the LCSH with a simplified syntax. It retains the very rich vocabulary of LCSH while making the schema easier to understand, control, apply, and use. The headings have been built into FAST authority records.
Current search interface: http://fast.oclc.org/Medical Subject Headings (MESH)
http://www.nlm.nih.gov/mesh/meshhome.html
MeSH consists of sets of terms naming descriptors in a hierarchical structure that permits searching at various levels of specificity. There are 22,568 descriptors in MeSH. In addition to these headings, there are more than 139,000 headings called Supplementary Concept Records (formerly Supplementary Chemical Records) within a separate thesaurus.Thesauri
Art and Architecture Thesaurus (AAT)
http://www.getty.edu/research/tools/
vocabulary/aat/index.html
The AAT is a structured vocabulary of more than 133,000 terms, descriptions, bibliographic citations, and other information relating to fine art, architecture, decorative arts, archival materials, and material culture.Library of Congress Thesauri
http://www.loc.gov/pmei/lexico/tgm1/brsearch.htmlThesaurus for the Global Legal information Network (GLIN)
http://www.loc.gov/lexico/servlet/lexico?usr=pub-375:0&op=frames&db=GLIN
Now used for The Global Legal Information Network’s multi-national database of legislation, this thesaurus has been under continuous development since 1950.Legislative Indexing Vocabulary (LIV)
http://www.loc.gov/lexico/servlet/lexico?usr=pub-375:0&op=frames&db=LIV
The thesaurus was developed by the Congressional Research Service for use with legislative and public policy material.Thesaurus for Graphic Materials I: Subject Terms (TGM I)
http://lcweb2.loc.gov/pp/tgmiquery.html
A thesaurus consisting of thousands of terms and numerous cross references for the purpose of indexing visual materials.Thesaurus for Graphic Materials II: Genre and Physical Characteristic Terms (TGM II)
http://lcweb2.loc.gov/pp/tgmiquery.html
A thesaurus of more than 600 terms, developed by the Library of Congress Prints and Photographs Division, with input from other archival image repositories.Classification schemes Â
Dewey Decimal Classification (DDC)
Website about DDC http://www.oclc.org/dewey/default.htmThe ACM Computing Classification System [1998 Version], Valid in 2003, Association for Computing Machinery
http://www.acm.org/class/1998/Name authority lists
The Union List of Artist Names (ULAN)
http://www.getty.edu/research/conducting_research/vocabularies/ulan
The ULAN is a structured vocabulary containing more than 225,000 names and biographical and bibliographic information about artists and architects, including a wealth of variant names, pseudonyms, and language variants.The Getty Thesaurus of Geographic Names (TGN)
http://www.getty.edu/research/conducting_research/vocabularies/tgn/
The TGN is a structured, world-coverage vocabulary of 1.3 million names, including vernacular and historical names, coordinates, place types, and descriptive notes, focusing on places important for the study of art and architecture.
LC Name Authority file = Anglo-American Authority File (AAAF)
http://authorities.loc.gov/
Includes several millions of name authority records for personal, corporate, meeting, and geographic names.Best practice guidelines for data content
The best practice guides prepared by various communities and projects usually provide detailed guidelines regarding how to assign values when creating metadata records. The following are examples of standards for data content to be followed in particular communities.
Cataloguing Culture Objects (CCO), A Guide to Describing Cultural Works and Their Images
http://www.vraweb.org/ccoweb/cco/selections.html
Provides guidelines for selecting, ordering, and formatting data used to populate elements in a catalogue record, in order to to advance the increasing move toward shared cataloguing and contribute to improved documentation and access to cultural heritage information.
Guidelines for Encoding Bibliographic Citation Information in Dublin Core Metadata
http://dublincore.org/documents/dc-citation-guidelines/
It deals primarily with bibliographic citations for a resource within its own metadata, but some guidelines for describing references to other resources are also indicated.EAD Cookbook: http://www.archivists.org/saagroups/ead/ead2002cookbookhelp.html
” … provides assistance in using three applications for creating encoded finding aids- XMetaL, <oXygen/>, and Note Tab. This includes instructions for installing and modifying the applications, and auxiliary files such as templates that make them easier to use”DLESE Best Practices
http://www.dlese.org/Metadata/collections/metadata-best-practices.htm
Lists the metadata field definitions, cataloging best practices, and vocabulary explanations for the metadata fields in the DLESE Cataloging System.Best Practices for Shareable Metadata
(DRAFT)
http://comm.nsdl.org/download.php/653/ShareableMetadataBestPractices.doc Online version: http://oai-best.comm.nsdl.org/cgi-bin/wiki.pl?PublicTOC
Part of the Best Practices for OAI Data Provider Implementations and Shareable Metadata, A joint initiative between the Digital Library Federation and the National Science Digital LibraryMetadata standards usually include the best practice guides in the specifications, see section 4 for the list of standards.
- Including:
Leave a Reply