2.11 Solr scheme
Last updated
Last updated
The Solr schema used by the Goobi viewer Indexer and Goobi viewer Core is located in the Git repository of the Goobi viewer Indexer:
There are different types of fields in the schema itself. These allow different types of configuration or use.
Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix:
COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.
Field type
Example
Use and Configuration
static (fix)
DOCSTRCT, NUMPAGES
These fields cannot be configured. They are written automatically by the Goobi viewer Indexer.
static (configurable)
PI, CURRENTNO
These fields have a static name. However, the content can be configured via XPATH using the configuration file of the Goobi viewer indexer.
MD_
MD_TITLE
All fields beginning with MD_ can be freely configured in the Goobi viewer Indexer configuration file. All types of values can be saved. Depending on the field configuration, sorting fields, fields for faceting, Untokenized fields, etc. are also written.
MD2_
Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix: Solr field name stored information COinS HighWire Press DublinCore RIS MD_TITLE Titel des Werkes rft.title citation_title DC.title TI MD_ALTERNATETITLE Alternativtitel -/- -/- -/- J2 MD_CREATOR Name des Autors rft.au citation_author DC.creator AU MD_EDITOR Name des Editors -/- -/- -/- ED MD_CORPORATION Körperschaft rft.aucorp -/- -/- -/- MD_ISBN ISBN-Nummer rft.isbn citation_isbn -/- SN MD_ISSN ISSN-Nummer rft.issn citation_issn -/- SN MD_YEARPUBLISH Erscheinungs-jahr rft.date citation_publication _date DC.date PY MD_PUBLISHER Verlag rft.pub -/- DC.publisher PB MD_PLACEPUBLISH Erscheinungsort rft.place -/- -/- PP MD_EDITION Edition rtf.edition -/- -/- ET MD_SERIES Serie rft.series -/- -/- -/- MD_SUBJECT Schlagwort rft.subject -/- -/- -/- MD_LANGUAGE Sprache rft.language citation_language DC.language LA CURRENTNO Bandnummer -/- citation_volume -/- VL MD_INFORMATION Abstract zum Werk -/- description DCTERMS.abstract AB MD_NOTE Generische Notiz -/- -/- -/- N1 MD_GEOKEYWORD Geographisches Schlagwort -/- -/- -/- KW MD_PERSONKEYWORD Personen-schlagwort -/- -/- -/- KW MD_WORKKEYWORD Werkschlagwort -/- -/- -/- KW URN URN des Werkes -/- -/- DC.identifier -/- ACCESSCONDITION Zugriffslizenz -/- -/- DC.rights -/- NUMPAGES Anzahl der Seiten -/- -/- -/- SP NUMVOLUMES Anzahl der Bände in einem Anchor -/- -/- -/- NV PI_TOPSTRUCT Identifier des Hauptwerkes -/- -/- -/- CN COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.MD2_DESCRIPTION
Works the same as MD_. The purpose of this alternative pattern is to exclude certain fields from a blanket delivery if required (for example, if all MD_* fields are delivered via an interface).
MDNUM_
MDNUM_PUBLICRELEASEYEAR
Works like MD_, but the data type is not a string, but a long to ensure numerical sorting.
BOOL_
BOOL_DIRECTION_RTL
All fields beginning with BOOL_ can be freely configured in the Goobi viewer Indexer configuration file. Only Booleans (true|false) can be saved. These can be directly adopted if they are in the source document. Alternatively, a boolean can be ensured in the XPATH expression via boolean(string(...)).
SORT_
SORT_TITLE
Fields with SORT_ can be configured directly in the configuration file. This is useful, for example, if the sort title is already explicitly displayed in the metadata. Otherwise, the addSortField switch can be set to write a sort field for a metadata. Sort fields may only contain one value. The fields are then available for sorting search hits, for example.
FACET_
FACET_PLACEPUBLISH
Special form of metadata fields used for more efficient faceting. Fields CENTURY, YEAR, DC, DOCSTRUCT, DOCSTRUCT_*, MD_ and MD_* are automatically created as a FACET_* copy. The original field name is configured for faceting, and the FACET_* variant is automatically used.
_UNTOKENIZED
MD_TITLE_UNTOKENIZED
Fields with the suffix _UNTOKENIZED must not be configured directly in the configuration file. The fields are required, for example, in the Browse function. Whether a _UNTOKENIZED field is written is controlled by the switch addUntokenizedVersion.
_LANG_XX
MD_TITLE_LANG_DE
Fields with LANG_XX can be freely defined as suffixes to MD_ fields. XX must be replaced by a valid language abbreviation, for example DE, EN, FR or ES. If there are values in fields with _LANG_XX suffix in the Solr Index, the Goobi viewer core evaluates them and supports not only multilingualism in the interface, but also for the metadata values.
NORM_
NORM_ALTNAME
Searchable medial fields imported from authority records.
GROUPID_ / GROUPORDER_
GROUPID_SERIES, GROUPORDER_SERIES
Fields that describe group membership (e.g. to a band series). GROUPID_ contains the identifier of the group, GROUPORDER_ the sort sequence of the relevant record in this group. The value after the underscore must be
DATE_
DATE_PUBLICRELEASE
Fields with the Solr data type TrieDateField store time stamp (YYYY-MM-DDThh:mm:ssZ). For future application scenarios.
Solr field name
stored information
COinS
HighWire Press
DublinCore
RIS
MD_TITLE
Titel des Werkes
rft.title
citation_title
DC.title
TI
MD_ALTERNATETITLE
Alternativtitel
-/-
-/-
-/-
J2
MD_CREATOR
Name des Autors
rft.au
citation_author
DC.creator
AU
MD_EDITOR
Name des Editors
-/-
-/-
-/-
ED
MD_CORPORATION
Körperschaft
rft.aucorp
-/-
-/-
-/-
MD_ISBN
ISBN-Nummer
rft.isbn
citation_isbn
-/-
SN
MD_ISSN
ISSN-Nummer
rft.issn
citation_issn
-/-
SN
MD_YEARPUBLISH
Erscheinungs-jahr
rft.date
citation_publication _date
DC.date
PY
MD_PUBLISHER
Verlag
rft.pub
-/-
DC.publisher
PB
MD_PLACEPUBLISH
Erscheinungsort
rft.place
-/-
-/-
PP
MD_EDITION
Edition
rtf.edition
-/-
-/-
ET
MD_SERIES
Serie
rft.series
-/-
-/-
-/-
MD_SUBJECT
Schlagwort
rft.subject
-/-
-/-
-/-
MD_LANGUAGE
Sprache
rft.language
citation_language
DC.language
LA
CURRENTNO
Bandnummer
-/-
citation_volume
-/-
VL
MD_INFORMATION
Abstract zum Werk
-/-
description
DCTERMS.abstract
AB
MD_NOTE
Generische Notiz
-/-
-/-
-/-
N1
MD_GEOKEYWORD
Geographisches Schlagwort
-/-
-/-
-/-
KW
MD_PERSONKEYWORD
Personen-schlagwort
-/-
-/-
-/-
KW
MD_WORKKEYWORD
Werkschlagwort
-/-
-/-
-/-
KW
URN
URN des Werkes
-/-
-/-
DC.identifier
-/-
ACCESSCONDITION
Zugriffslizenz
-/-
-/-
DC.rights
-/-
NUMPAGES
Anzahl der Seiten
-/-
-/-
-/-
SP
NUMVOLUMES
Anzahl der Bände in einem Anchor
-/-
-/-
-/-
NV
PI_TOPSTRUCT
Identifier des Hauptwerkes
-/-
-/-
-/-
CN