• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

DSpace Search Index Parameters

Page history last edited by Monica 3 years, 10 months ago

Effective: release 5.0 forward (implemented in 2015)

 

DSpace indexes most metadata fields as well as the full text from the bitstream (or ingested file). If the file does not have embedded text (e.g. a PDF that is not OCR'd), then there is no text to index for searching.  The collection description is also indexed.

 

Fields excluded from indexing are dc.description.provenance (which is non-public field generated by the system) and bitstream descriptions.

 

Browse vs Advance Filter searching

Narrow search results by selecting the “Show Advanced Filters” link and refining your search by the following categories:

 

Label

Element

Title

dc.title

dc.title.subtitle

dc.title.alternative

Author

Primary Roles

Note: Primary Roles are available via Browse

dc.contributor.author

dc.creator

dc.contributor.architect

dc.contributor.performer

dc.contributor.illustrator

dc.contributor.photographer

Contributor

Secondary Roles

Note: Secondary Roles are NOT available via Browse. These include:

dc.contributor

dc.contributor.accompanist

dc.contributor.agency

dc.contributor.arranger

dc.contributor.org

dc.contributor.company

dc.contributor.conductor

dc.contributor.donor

dc.contributor.editor

dc.contributor.funder

dc.contributor.interviewer

dc.contributor.lecturer

dc.contributor.publisher

dc.contributor.translator

 

following secondary roles by also be browsed by Discover filters (via custom themes) For example:

dc.contributor.advisor

dc.contributor.composer

dc.contributor.committeeMember

Subject

All qualifiers

Date Issued

dc.date.issued

Description (*)

dc.description

dc.description.translation

dc.description.center

Abstract (*)

dc.description.abstract 

Type

All qualifiers.

May use standard vocabulary terms such as “Sound”, “Image”, “photograph”, “Book chapter”, “maps”, etc.

Full Text (*) Extracted Text Bundle, if present
identifier (*)  

 

(*) Label not found in advanced filter options but are indexed and therefore content is searchable via Key Word searching box.

 

Basic Ranking Model

In general, any term found in the DSpace system "browse" options (Title, Date, Author, Subject) is indexed. Any field designated for discovery filtering (i.e. custom theme) is also indexed (e.g. thesis.degree.*). For specific elements please see above list.

 

Also the more often a term appears in the above metadata elements the higher the ranking in search results.

 

Examples:

 

Frequency of the term "Campanile" in the item metadata.

1) top ranking image: in this Rice Historical image (https://scholarship.rice.edu/handle/1911/63579) the word ""Campanile" appears in the title, description and subject.

2) top ranking text: in this Shepherd musical program (https://scholarship.rice.edu/handle/1911/43689), the word "Campanile" appears in the title, contributor and full text.

3) lower ranking yearbook: While the metadata for the yearbooks, the term "Campanile" only appears once in the title.

 

 

For more tips on searching the archive, please see IR FAQ: How do I find content in the archive?

 

Comments (0)

You don't have permission to comment on this page.