Effective: release 5.0 forward (implemented in 2015)
DSpace indexes most metadata fields as well as the full text from the bitstream (or ingested file). If the file does not have embedded text (e.g. a PDF that is not OCR'd), then there is no text to index for searching. The collection description is also indexed.
Fields excluded from indexing are dc.description.provenance (which is non-public field generated by the system) and bitstream descriptions.
Browse vs Advance Filter searching
Narrow search results by selecting the “Show Advanced Filters” link and refining your search by the following categories:
Label
|
Element
|
Title
|
dc.title
dc.title.subtitle
dc.title.alternative
|
Author
Primary Roles
|
Note: Primary Roles are available via Browse
dc.contributor.author
dc.creator
dc.contributor.architect
dc.contributor.performer
dc.contributor.illustrator
dc.contributor.photographer
|
Contributor
Secondary Roles
|
Note: Secondary Roles are NOT available via Browse. These include:
dc.contributor
dc.contributor.accompanist
dc.contributor.agency
dc.contributor.arranger
dc.contributor.org
dc.contributor.company
dc.contributor.conductor
dc.contributor.donor
dc.contributor.editor
dc.contributor.funder
dc.contributor.interviewer
dc.contributor.lecturer
dc.contributor.publisher
dc.contributor.translator
following secondary roles by also be browsed by Discover filters (via custom themes) For example:
dc.contributor.advisor
dc.contributor.composer
dc.contributor.committeeMember
|
Subject
|
All qualifiers
|
Date Issued
|
dc.date.issued
|
Description (*) |
dc.description
dc.description.translation
dc.description.center
|
Abstract (*)
|
dc.description.abstract |
Type
|
All qualifiers.
May use standard vocabulary terms such as “Sound”, “Image”, “photograph”, “Book chapter”, “maps”, etc.
|
Full Text (*) |
Extracted Text Bundle, if present |
identifier (*) |
|
(*) Label not found in advanced filter options but are indexed and therefore content is searchable via Key Word searching box.
Basic Ranking Model
In general, any term found in the DSpace system "browse" options (Title, Date, Author, Subject) is indexed. Any field designated for discovery filtering (i.e. custom theme) is also indexed (e.g. thesis.degree.*). For specific elements please see above list.
Also the more often a term appears in the above metadata elements the higher the ranking in search results.
Examples:
Frequency of the term "Campanile" in the item metadata.
1) top ranking image: in this Rice Historical image (https://scholarship.rice.edu/handle/1911/63579) the word ""Campanile" appears in the title, description and subject.
2) top ranking text: in this Shepherd musical program (https://scholarship.rice.edu/handle/1911/43689), the word "Campanile" appears in the title, contributor and full text.
3) lower ranking yearbook: While the metadata for the yearbooks, the term "Campanile" only appears once in the title.
For more tips on searching the archive, please see IR FAQ: How do I find content in the archive?
Comments (0)
You don't have permission to comment on this page.