The purpose of minimum metadata is to supply the very basic information necessary to place these PDFs online plus standard information common to all items in this collection. Basic information unique to each Program is the title (as directly transcribed from the program), the performance date and technical information such as digital ID, page count and digitization specs. This approach follows general principals described in More Product, Less Process for Digital Collections (1)
Long term plans include adding more descriptive information to enhance discovery. This may include additional dates of performance, roles and names of performers and composers, etc. as well as parsing out titles into sub-elements. This work will be managed by Music librarians and catalogers.
Click here for the procedure for creating minimum metadata.
Application Profile
Element
|
Definition
|
|
Data unique to each program. (1)
|
dc.title
|
Initial title transcribed directly from Program.
|
dc.date.issued |
Date of musical performance. Use first date that appears on Program |
dc.identifier.digital
|
Unique identifier based on filename (e.g. ssmYYYY-MM-DDX) See file naming schema for more details.
|
dc.format.extent
|
Number of pages in PDF file. E.g. 12 pp
|
dc.identifier.citation |
System generated. See IR citations. |
|
|
|
Boilerplate data, common to all records
|
dc.digitization.specifications
|
Access copies of concert programs are presented as OCR'd PDFs.
|
dc.rights
|
Copyrighted material. All rights reserved.
|
dc.type.dcmi
|
Text
|
dc.type.genre
|
pamphlets
|
dc.publisher
|
Shepherd School of Music, Rice University
|
dc.date.digital |
Date of creation of digital resource (YYYY) |
dc.description |
(Optional) If information is provided on the program, add "Presented by" information |
dc.subject |
Assign performance type. (e.g. Graduate recital, Undergraduate recital, Faculty recital, Guest artist recital, Shepherd School ensemble) Detail List
|
Additional Notes
- Data specific to each item may be extracted from the digital files directly using tools such as exiftool (command line) or Omni Page Pro.
- Little to no normalization of titles are prepared at this stage. Only minor white space correction is performed. This means title segments (e.g. date, time and performance location) may vary in order as these segments vary in order found on the original document.
- For OCR testing findings, please see slides OCR Trials
Comments (1)
Monica said
at 12:13 pm on Oct 1, 2014
tinyurl for this page: http://tinyurl.com/orkx7by
You don't have permission to comment on this page.