December 2018

Developments and innovations to Goobi workflow in December 2018

Extended support of standard data via DANTE

In addition to the metadata fields for texttareas, select menus, read-only fields or the connection of GeoNames and GND, which have been available for a long time, the GBV DANTE standard data service has been connected. Dante currently provides about 90 different vocabularies which can all be used with this revised implementation.

To use the DANTE API the file goobi_metadataDisplayRules.xml must be extended by the following configuration:

<!-- dante configuration for the metadata field DocLanguage-->
<dante ref="DocLanguage">
    <!-- name of the vocabulary, a list of all vocabularies can be found here:, the vocabulary name can be found inside of the element notation -->
    <!-- the list of fields to import the value from. The list is ordered and the first value that was found will be used for the display. Can be overwritten inside of the UI -->
    <field>NORM_LABEL_de, NORM_LABEL_en, NORM_LABEL_fr, NORM_LABEL_es</field>

The field configured in each case is then available in the METS editor. The search form is opened by means of the magnifying glass icon so that the desired value can be entered within the selected standard data vocabulary. The desired hit can then be selected from the displayed search hit list. Within the desired hit, the value to be imported can also be selected from a selection list and confirmed by clicking on the green button.

Correction of the determination of directory sizes when using S3

The integration of Amazon S3 as a storage system for storing large amounts of data has already made massive progress in recent months. During this work, further improvements and error corrections were always necessary. In addition to a reduction of file accesses, the determination of directory sizes, which did not always provide correct results when using S3, has also been revised.

New plugin for importing province data from the GBV catalogue

If province data have already been entered in the GBV catalogue, they can now be imported as well. Since this metadata is not contained within the MARC record, it cannot be imported during the initial import of the metadata. The import therefore takes place in a second step. For this purpose, the EPN of the A-record is used to search for the corresponding Pica+ record. The system then checks whether provenance data is available for the copy used. This check takes place by matching the EPN in field 092B$2 and the library symbol in field 092B$1.

To be able to use the plugin, a metadata group Provenience must exist in the rule record:

    <language name="de">Code</language>
    <language name="en">ProvenienceCode</language>
  <MetadataType normdata="true">
    <language name="de">Besitzer</language>
    <language name="en">ProveniencePrevOwner</language>
    <language name="de">Characteristik</language>
    <language name="en">ProvenienceCharacteristic</language>
    <language name="de">Datum</language>
    <language name="en">ProvenienceDate</language>
    <language name="de">Erläuterung</language>
    <language name="en">ProvenienceExplanation</language>
    <language name="de">GND</language>
    <language name="en">ProvenienceGND</language>
    <language name="de">Provenienz</language>
    <language name="en">Provenience</language>

Further adjustments are not necessary. The plugin itself is part of the collection goobi-plugin-opac-marc.jar and can be accessed via the name HaabProvenienceOpac. An activation within the configuration file goobi_opac.xml is not possible, because the plugin only delivers the provenance metadata and no structural elements are created. To integrate it into the workflow as a step after the initial import, the step plug-in intranda_step_provenience_request from the goobi-plugin-step-catalogue-request repository is also required.

Import EAD records directly into Goobi

Goobi is now able to import individual records from EAD find books. To do this, the EAD data records available in XML are periodically imported into the BaseX XML database. From now on, this database serves as a dynamically updated EAD memory and can be queried with the help of a special OPAC plug-in via the RESTXQ query language instead of a typical library catalog.

The special feature compared to a normal XML-based import is that the tectonics of the find book are preserved, since they are also stored within the XML database. If the tectonics is changed and the previously imported data record is moved to another structure, then only the EAD file must be read into the XML database in updated form. Since the identifiers of the data records are retained, presentation software such as the Goobi viewer, which also has access to this EAD memory, can query the current status of the tectonics live.

Further detailed information about the implementation including installation instructions can be found here:

Display metadata within tasks and in task details

Goobi workflow has been enhanced to allow you to display specific metadata from the METS file within individual accepted tasks as well as properties.

The display of this metadata integrates itself into the range of properties that can also be displayed and edited and can be individually configured for each task of individual workflows. Such a configuration looks like the following within the file goobi_processProperties.xml:

<metadata name="CatalogIDDigital">
    <showStep name="Print docket" />
<metadata name="DocLanguage">
    <showStep name="Print docket" />
<metadata name="TitleDocMain">
    <showStep name="Print docket" />

The display of descriptive metadata is also possible within the process details. For this purpose, a new area has been created below the tasks that lists these metadata.

With the help of this new functionality and future developments, redundant storage of some metadata will become superfluous in the future. From now on, information that is currently still available as a property as well as within the METS file can only be stored in the METS file and read from there. It is important that the checkbox Update metadata index when finishing is activated within the individual tasks of the workflow to ensure efficient caching of the data from the METS file for display.

For a complete update of the cache for the fast display of this metadata within the web interface, the existing GoobiScript updateDatabaseCache can of course also be called for several processes simultaneously.

Last updated