2.11 Solr scheme
General
The Solr schema used by the Goobi viewer Indexer and Goobi viewer Core is located in the Git repository of the Goobi viewer Indexer:
Field types
There are different types of fields in the schema itself. These allow different types of configuration or use.
Field type | Example | Use and Configuration |
static (fix) | DOCSTRCT, NUMPAGES | These fields cannot be configured. They are written automatically by the Goobi viewer Indexer. |
static (configurable) | PI, CURRENTNO | These fields have a static name. However, the content can be configured via XPATH using the configuration file of the Goobi viewer indexer. |
MD_ | MD_TITLE | All fields beginning with MD_ can be freely configured in the Goobi viewer Indexer configuration file. All types of values can be saved. Depending on the field configuration, sorting fields, fields for faceting, Untokenized fields, etc. are also written. |
MD2_ | Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix: Solr field name stored information COinS HighWire Press DublinCore RIS MD_TITLE Titel des Werkes rft.title citation_title DC.title TI MD_ALTERNATETITLE Alternativtitel -/- -/- -/- J2 MD_CREATOR Name des Autors rft.au citation_author DC.creator AU MD_EDITOR Name des Editors -/- -/- -/- ED MD_CORPORATION Körperschaft rft.aucorp -/- -/- -/- MD_ISBN ISBN-Nummer rft.isbn citation_isbn -/- SN MD_ISSN ISSN-Nummer rft.issn citation_issn -/- SN MD_YEARPUBLISH Erscheinungs-jahr rft.date citation_publication _date DC.date PY MD_PUBLISHER Verlag rft.pub -/- DC.publisher PB MD_PLACEPUBLISH Erscheinungsort rft.place -/- -/- PP MD_EDITION Edition rtf.edition -/- -/- ET MD_SERIES Serie rft.series -/- -/- -/- MD_SUBJECT Schlagwort rft.subject -/- -/- -/- MD_LANGUAGE Sprache rft.language citation_language DC.language LA CURRENTNO Bandnummer -/- citation_volume -/- VL MD_INFORMATION Abstract zum Werk -/- description DCTERMS.abstract AB MD_NOTE Generische Notiz -/- -/- -/- N1 MD_GEOKEYWORD Geographisches Schlagwort -/- -/- -/- KW MD_PERSONKEYWORD Personen-schlagwort -/- -/- -/- KW MD_WORKKEYWORD Werkschlagwort -/- -/- -/- KW URN URN des Werkes -/- -/- DC.identifier -/- ACCESSCONDITION Zugriffslizenz -/- -/- DC.rights -/- NUMPAGES Anzahl der Seiten -/- -/- -/- SP NUMVOLUMES Anzahl der Bände in einem Anchor -/- -/- -/- NV PI_TOPSTRUCT Identifier des Hauptwerkes -/- -/- -/- CN COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.MD2_DESCRIPTION | Works the same as MD_. The purpose of this alternative pattern is to exclude certain fields from a blanket delivery if required (for example, if all MD_* fields are delivered via an interface). |
MDNUM_ | MDNUM_PUBLICRELEASEYEAR | Works like MD_, but the data type is not a string, but a long to ensure numerical sorting. |
BOOL_ | BOOL_DIRECTION_RTL | All fields beginning with BOOL_ can be freely configured in the Goobi viewer Indexer configuration file. Only Booleans (true|false) can be saved. These can be directly adopted if they are in the source document. Alternatively, a boolean can be ensured in the XPATH expression via boolean(string(...)). |
SORT_ | SORT_TITLE | Fields with SORT_ can be configured directly in the configuration file. This is useful, for example, if the sort title is already explicitly displayed in the metadata. Otherwise, the addSortField switch can be set to write a sort field for a metadata. Sort fields may only contain one value. The fields are then available for sorting search hits, for example. |
FACET_ | FACET_PLACEPUBLISH | Special form of metadata fields used for more efficient faceting. Fields CENTURY, YEAR, DC, DOCSTRUCT, DOCSTRUCT_*, MD_ and MD_* are automatically created as a FACET_* copy. The original field name is configured for faceting, and the FACET_* variant is automatically used. |
_UNTOKENIZED | MD_TITLE_UNTOKENIZED | Fields with the suffix _UNTOKENIZED must not be configured directly in the configuration file. The fields are required, for example, in the Browse function. Whether a _UNTOKENIZED field is written is controlled by the switch addUntokenizedVersion. |
_LANG_XX | MD_TITLE_LANG_DE | Fields with LANG_XX can be freely defined as suffixes to MD_ fields. XX must be replaced by a valid language abbreviation, for example DE, EN, FR or ES. If there are values in fields with _LANG_XX suffix in the Solr Index, the Goobi viewer core evaluates them and supports not only multilingualism in the interface, but also for the metadata values. |
NORM_ | NORM_ALTNAME | Searchable medial fields imported from authority records. |
GROUPID_ / GROUPORDER_ | GROUPID_SERIES, GROUPORDER_SERIES | Fields that describe group membership (e.g. to a band series). GROUPID_ contains the identifier of the group, GROUPORDER_ the sort sequence of the relevant record in this group. The value after the underscore must be |
DATE_ | DATE_PUBLICRELEASE | Fields with the Solr data type TrieDateField store time stamp (YYYY-MM-DDThh:mm:ssZ). For future application scenarios. |
special fields
Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix:
Solr field name | stored information | COinS | HighWire Press | DublinCore | RIS |
| Titel des Werkes | rft.title | citation_title | DC.title | TI |
| Alternativtitel | -/- | -/- | -/- | J2 |
| Name des Autors | rft.au | citation_author | DC.creator | AU |
| Name des Editors | -/- | -/- | -/- | ED |
| Körperschaft | rft.aucorp | -/- | -/- | -/- |
| ISBN-Nummer | rft.isbn | citation_isbn | -/- | SN |
| ISSN-Nummer | rft.issn | citation_issn | -/- | SN |
| Erscheinungs-jahr | rft.date | citation_publication _date | DC.date | PY |
| Verlag | rft.pub | -/- | DC.publisher | PB |
| Erscheinungsort | rft.place | -/- | -/- | PP |
| Edition | rtf.edition | -/- | -/- | ET |
| Serie | rft.series | -/- | -/- | -/- |
| Schlagwort | rft.subject | -/- | -/- | -/- |
| Sprache | rft.language | citation_language | DC.language | LA |
| Bandnummer | -/- | citation_volume | -/- | VL |
| Abstract zum Werk | -/- | description | DCTERMS.abstract | AB |
| Generische Notiz | -/- | -/- | -/- | N1 |
| Geographisches Schlagwort | -/- | -/- | -/- | KW |
| Personen-schlagwort | -/- | -/- | -/- | KW |
| Werkschlagwort | -/- | -/- | -/- | KW |
| URN des Werkes | -/- | -/- | DC.identifier | -/- |
| Zugriffslizenz | -/- | -/- | DC.rights | -/- |
| Anzahl der Seiten | -/- | -/- | -/- | SP |
| Anzahl der Bände in einem Anchor | -/- | -/- | -/- | NV |
| Identifier des Hauptwerkes | -/- | -/- | -/- | CN |
COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.
Last updated