April 2020
Developments and innovations to Goobi workflow in April 2020
In the coming weeks we will probably focus on the following points:
- Refactorization of the LayoutWizzard plugin for automatic image cropping
- Conversion of the Goobi code to use CDI
The LayoutWizzard as a plugin for convenient cropping of images is becoming more and more popular. This was accompanied by the wish that its use should also be possible within the Amazon Cloud. For this reason, LayoutWizzard has now been revised so that it can be used on the S3 storage system in addition to a regular file system, just like Goobi workflow.

The LayoutWizzard plugin for cropping images using the example of double pages
For more information about the LayoutWizzard, please refer to the detailed documentation for operation and configuration options:
https://docs.goobi.io/goobi-workflow-plugins-en/step-plugins/layoutwizzard
docs.goobi.io
https://docs.goobi.io/goobi-workflow-plugins-en/step-plugins/layoutwizzard
Goobi workflow has been supporting the handling of 3D objects for several months now. This has already proved to be very useful for various data formats. However, handling very large files and their high-performance delivery and display was repeatedly a particular challenge. With the newly added functionality to support
DRACO
compression, the visualisation of qualitative 3D models has now become much better, as the file sizes are considerably smaller than in the previously supported formats.More information about compression using DRACO can be found here:
There is no significant change for the user in handling these new and already supported 3D formats. Objects can still be displayed, zoomed and rotated within the QA plug-in as well as within the METS editor as before.

3D objects visualised within the METS editor of Goobi workflow
Within the METS Editor, a user has also been able to manipulate the image stack for a long time, for example, to change the order of the files, delete files or even rename them. In the past, however, the operation of these functions was sometimes problematic, because after changes to the image stack, the METS file always had to be saved as well. With the revisions we have now made in this area, this is no longer necessary. Changes to files within the METS Editor are now always saved in the METS file as well, so that the METS file can no longer become asynchronous to the file names. In addition, we have conducted extensive testing to ensure that changes made will always affect all folders. For example, when renaming or deleting master files, derivative files, thumbnails, OCR results, LayoutWizzard files and also PDF files are immediately changed so that the changes made have the same effect in all versions of the digitised material.

Manipulations to the image stack in the METS Editor immediately affect all folders and the METS file
There was an error within the GoobiScripts
addStep
that prevented its execution. The GoobiScript has now been revised and is working again.
The GoobiScript createStep and its associated parameters
Over the last few weeks, we've worked hard to ensure that Goobi workflow now supports audio and video data directly within the METS Editor and also displays and plays back such files. In this way, Goobi now provides even better support for handling audio and video material and also generates the correct file groups for exporting METS files, allowing published digitised material to be displayed directly in the Goobi viewer, for example.

Playback of audio material within the METS editor of Goobi workflow

Displaying video material within the METS editor of Goobi workflow
To ensure that Goobi handles audio and video data correctly, some minor adjustments are required within the ruleset used to enable
audio
and video
within an object in addition to pages
, image areas
and 3D objects
. An example of such an adjustment in the ruleset is shown below:<DocStrctType>
<Name>page</Name>
<language name="de">Seite</language>
<language name="en">Page</language>
<language name="rusdml">page</language>
<language name="es">Página</language>
<allowedchildtype>area</allowedchildtype>
<metadata num="1m">logicalPageNumber</metadata>
<metadata num="*">_ucc_id</metadata>
<metadata num="1m">physPageNumber</metadata>
<metadata num="1o">_urn</metadata>
</DocStrctType>
<DocStrctType>
<Name>audio</Name>
<language name="de">Audio</language>
<language name="en">Audio</language>
<metadata num="1o">_urn</metadata>
<metadata num="1m">logicalPageNumber</metadata>
<metadata num="1m">physPageNumber</metadata>
</DocStrctType>
<DocStrctType>
<Name>video</Name>
<language name="de">Video</language>
<language name="en">Video</language>
<metadata num="1o">_urn</metadata>
<metadata num="1m">logicalPageNumber</metadata>
<metadata num="1m">physPageNumber</metadata>
</DocStrctType>
<DocStrctType>
<Name>object</Name>
<language name="de">3D Objekt</language>
<language name="en">3D Object</language>
<metadata num="1o">_urn</metadata>
<metadata num="1m">logicalPageNumber</metadata>
<metadata num="1m">physPageNumber</metadata>
</DocStrctType>
<DocStrctType>
<Name>area</Name>
<language name="de">Bildbereich</language>
<language name="en">Area</language>
<metadata num="*">_ucc_id</metadata>
<metadata num="1o">_urn</metadata>
<metadata num="1o">_COORDS</metadata>
<metadata num="1o">_SHAPE</metadata>
<metadata num="1m">physPageNumber</metadata>
<metadata num="1m">logicalPageNumber</metadata>
</DocStrctType>
<DocStrctType>
<Name>BoundBook</Name>
<language name="de">Gebundenes Buch</language>
<language name="en">Bound Book</language>
<allowedchildtype>page</allowedchildtype>
<allowedchildtype>audio</allowedchildtype>
<allowedchildtype>video</allowedchildtype>
<allowedchildtype>object</allowedchildtype>
<metadata num="1o">PhysicalLocation</metadata>
<metadata num="1o">datedigit</metadata>
<metadata num="1o">shelfmarksource</metadata>
<metadata num="1o">mediumsource</metadata>
<metadata num="1o">FormatSourcePrint</metadata>
<metadata num="*">_ucc_id</metadata>
<metadata num="1o">shelfmarkarchiveimageset</metadata>
<metadata num="1o">pathimagefiles</metadata>
<metadata num="1o">imagedescr</metadata>
<metadata num="1o">copyrightimageset</metadata>
<metadata num="1o">_urn</metadata>
<metadata num="1o">_representative</metadata>
</DocStrctType>
Depending on which audio and video formats you are working with at this point, they can be played back within the web browser without the need for any additional plugins. However, as this varies from browser to browser, it has proven to be best to use the derivatives to mp3 for audio files and mp4 for video files. The following is a list of which browsers directly support which file formats:
Video formats:
Browser | MP4 | WebM | Ogg |
Edge/Internet Explorer | YES | NO | NO |
Chrome | YES | YES | YES |
Firefox | YES | YES | YES |
Safari | YES | NO | NO |
Opera | YES | YES | YES |
Audio formats:
Browser | MP3 | WAV | OGG |
Edge/Internet Explorer | YES | NO | NO |
Chrome | YES | YES | YES |
Firefox | YES | YES | YES |
Safari | YES | YES | NO |
Opera | YES | YES | YES |
The already existing plugin for the uniform adaptation of anchor data sets was thoroughly revised and documented. It allows you to select a central Goobi process, e.g. for a multi-volume work or a journal, which should form the master process for all other Goobi processes. Changes within its anchor file in the Mets Editor therefore mean that all the associated Goobi processes can be updated with the identical data in the central anchor record.

Interface of the administration plugin for adopting the data in the anchor record as the master for all associated Goobi operations
Besides some small optical adjustments, a detailed documentation has been created for this plugin. This is available here:
https://docs.goobi.io/goobi-workflow-plugins-en/administration/plugin_intranda_administration_copyanchor
docs.goobi.io
https://docs.goobi.io/goobi-workflow-plugins-en/administration/plugin_intranda_administration_copyanchor
The City Archive Duderstadt was very progressive very early on and in the 1990s digitized the entire archive and created an enormous data stock including metadata and transcriptions. We have been working with this data for a long time and have now imported all data. The proprietary data was converted into an EAD format and the individual objects were described in a METS file, so that all deeply indexed metadata can now be republished.

Finding aid from the Stadtarchiv Duderstadt within the METS-Editor
The entire data stock of the Duderstadt city archive has also already been prepared for publication within a Goobi viewer and is already usable as a prototype. Before the URL is announced, however, some primarily visual work needs to be completed here. This will take place relatively soon, however, and will then be documented here once again.
Goobi workflow has long had an extendable REST interface to allow external systems to communicate with Goobi. The existing documentation for this within the manual has been revised. This means that an up-to-date version of the documentation on how to activate the interface for selected endpoints specifically for selected IP addresses is now available at the following address
https://docs.goobi.io/goobi-workflow-en/admin/7/7.5-goobi_rest.xml
docs.goobi.io
https://docs.goobi.io/goobi-workflow-en/admin/7/7.5-goobi_rest.xml
Last modified 2yr ago