2.11 Solr scheme

General

The Solr schema used by the Goobi viewer Indexer and Goobi viewer Core is located in the Git repository of the Goobi viewer Indexer:

Field types

There are different types of fields in the schema itself. These allow different types of configuration or use.

Field type

Example

Use and Configuration

static (fix)

DOCSTRCT, NUMPAGES

These fields cannot be configured. They are written automatically by the Goobi viewer Indexer.

static (configurable)

PI, CURRENTNO

These fields have a static name. However, the content can be configured via XPATH using the configuration file of the Goobi viewer indexer.

MD_

MD_TITLE

All fields beginning with MD_ can be freely configured in the Goobi viewer Indexer configuration file. All types of values can be saved. Depending on the field configuration, sorting fields, fields for faceting, Untokenized fields, etc. are also written.

MD2_

Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix: Solr field name stored information COinS HighWire Press DublinCore RIS MD_TITLE Titel des Werkes rft.title citation_title DC.title TI MD_ALTERNATETITLE Alternativtitel -/- -/- -/- J2 MD_CREATOR Name des Autors rft.au citation_author DC.creator AU MD_EDITOR Name des Editors -/- -/- -/- ED MD_CORPORATION Körperschaft rft.aucorp -/- -/- -/- MD_ISBN ISBN-Nummer rft.isbn citation_isbn -/- SN MD_ISSN ISSN-Nummer rft.issn citation_issn -/- SN MD_YEARPUBLISH Erscheinungs-jahr rft.date citation_publication _date DC.date PY MD_PUBLISHER Verlag rft.pub -/- DC.publisher PB MD_PLACEPUBLISH Erscheinungsort rft.place -/- -/- PP MD_EDITION Edition rtf.edition -/- -/- ET MD_SERIES Serie rft.series -/- -/- -/- MD_SUBJECT Schlagwort rft.subject -/- -/- -/- MD_LANGUAGE Sprache rft.language citation_language DC.language LA CURRENTNO Bandnummer -/- citation_volume -/- VL MD_INFORMATION Abstract zum Werk -/- description DCTERMS.abstract AB MD_NOTE Generische Notiz -/- -/- -/- N1 MD_GEOKEYWORD Geographisches Schlagwort -/- -/- -/- KW MD_PERSONKEYWORD Personen-schlagwort -/- -/- -/- KW MD_WORKKEYWORD Werkschlagwort -/- -/- -/- KW URN URN des Werkes -/- -/- DC.identifier -/- ACCESSCONDITION Zugriffslizenz -/- -/- DC.rights -/- NUMPAGES Anzahl der Seiten -/- -/- -/- SP NUMVOLUMES Anzahl der Bände in einem Anchor -/- -/- -/- NV PI_TOPSTRUCT Identifier des Hauptwerkes -/- -/- -/- CN COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.MD2_DESCRIPTION

Works the same as MD_. The purpose of this alternative pattern is to exclude certain fields from a blanket delivery if required (for example, if all MD_* fields are delivered via an interface).

MDNUM_

MDNUM_PUBLICRELEASEYEAR

Works like MD_, but the data type is not a string, but a long to ensure numerical sorting.

BOOL_

BOOL_DIRECTION_RTL

All fields beginning with BOOL_ can be freely configured in the Goobi viewer Indexer configuration file. Only Booleans (true|false) can be saved. These can be directly adopted if they are in the source document. Alternatively, a boolean can be ensured in the XPATH expression via boolean(string(...)).

SORT_

SORT_TITLE

Fields with SORT_ can be configured directly in the configuration file. This is useful, for example, if the sort title is already explicitly displayed in the metadata. Otherwise, the addSortField switch can be set to write a sort field for a metadata. Sort fields may only contain one value. The fields are then available for sorting search hits, for example.

FACET_

FACET_PLACEPUBLISH

Special form of metadata fields used for more efficient faceting. Fields CENTURY, YEAR, DC, DOCSTRUCT, DOCSTRUCT_*, MD_ and MD_* are automatically created as a FACET_* copy. The original field name is configured for faceting, and the FACET_* variant is automatically used.

_UNTOKENIZED

MD_TITLE_UNTOKENIZED

Fields with the suffix _UNTOKENIZED must not be configured directly in the configuration file. The fields are required, for example, in the Browse function. Whether a _UNTOKENIZED field is written is controlled by the switch addUntokenizedVersion.

_LANG_XX

MD_TITLE_LANG_DE

Fields with LANG_XX can be freely defined as suffixes to MD_ fields. XX must be replaced by a valid language abbreviation, for example DE, EN, FR or ES. If there are values in fields with _LANG_XX suffix in the Solr Index, the Goobi viewer core evaluates them and supports not only multilingualism in the interface, but also for the metadata values.

NORM_

NORM_ALTNAME

Searchable medial fields imported from authority records.

GROUPID_ / GROUPORDER_

GROUPID_SERIES, GROUPORDER_SERIES

Fields that describe group membership (e.g. to a band series). GROUPID_ contains the identifier of the group, GROUPORDER_ the sort sequence of the relevant record in this group. The value after the underscore must be

DATE_

DATE_PUBLICRELEASE

Fields with the Solr data type TrieDateField store time stamp (YYYY-MM-DDThh:mm:ssZ). For future application scenarios.

special fields

Some fields in the Solr search index are statically referenced in the Goobi viewer core to provide certain functions. These are listed in the following matrix:

Solr field name

stored information

COinS

HighWire Press

DublinCore

RIS

MD_TITLE

Titel des Werkes

rft.title

citation_title

DC.title

TI

MD_ALTERNATETITLE

Alternativtitel

-/-

-/-

-/-

J2

MD_CREATOR

Name des Autors

rft.au

citation_author

DC.creator

AU

MD_EDITOR

Name des Editors

-/-

-/-

-/-

ED

MD_CORPORATION

Körperschaft

rft.aucorp

-/-

-/-

-/-

MD_ISBN

ISBN-Nummer

rft.isbn

citation_isbn

-/-

SN

MD_ISSN

ISSN-Nummer

rft.issn

citation_issn

-/-

SN

MD_YEARPUBLISH

Erscheinungs-jahr

rft.date

citation_publication _date

DC.date

PY

MD_PUBLISHER

Verlag

rft.pub

-/-

DC.publisher

PB

MD_PLACEPUBLISH

Erscheinungsort

rft.place

-/-

-/-

PP

MD_EDITION

Edition

rtf.edition

-/-

-/-

ET

MD_SERIES

Serie

rft.series

-/-

-/-

-/-

MD_SUBJECT

Schlagwort

rft.subject

-/-

-/-

-/-

MD_LANGUAGE

Sprache

rft.language

citation_language

DC.language

LA

CURRENTNO

Bandnummer

-/-

citation_volume

-/-

VL

MD_INFORMATION

Abstract zum Werk

-/-

description

DCTERMS.abstract

AB

MD_NOTE

Generische Notiz

-/-

-/-

-/-

N1

MD_GEOKEYWORD

Geographisches Schlagwort

-/-

-/-

-/-

KW

MD_PERSONKEYWORD

Personen-schlagwort

-/-

-/-

-/-

KW

MD_WORKKEYWORD

Werkschlagwort

-/-

-/-

-/-

KW

URN

URN des Werkes

-/-

-/-

DC.identifier

-/-

ACCESSCONDITION

Zugriffslizenz

-/-

-/-

DC.rights

-/-

NUMPAGES

Anzahl der Seiten

-/-

-/-

-/-

SP

NUMVOLUMES

Anzahl der Bände in einem Anchor

-/-

-/-

-/-

NV

PI_TOPSTRUCT

Identifier des Hauptwerkes

-/-

-/-

-/-

CN

COinS are output on record pages and search hit pages. HighWire Press and DublinCore metatags are output on record pages.

Last updated