Metadata Cleaning
Goobi Step Plugin for Manipulating and Cleaning Metadata for the City Archive Kiel
Overview
Identifier
intranda_step_kiel_archive_cleanup
Repository
Licence
GPL 2.0 or newer
Last change
25.07.2024 11:57:14
Introduction
This documentation describes the installation, configuration and use of the Step Plugin for cleaning metadata for the Kiel City Archive. In concrete terms, this is about the automated evaluation of metadata that are to be separated into individual fields, such as information on the scales of historical maps. Furthermore, this plugin also copies the corresponding image files into the master folder of the respective process.
Installation
The plugin consists of the following files to be installed:
The first file must be installed in the following directory:
In addition, there is a configuration file that must be located in the following place:
Overview and functionality
The plugin is usually executed fully automatically within the workflow. It first determines whether there is a block in the configuration file that has been configured for the current workflow with regard to the project name and work step. If this is the case, the METS file is opened and the necessary changes to the METS file take place. Finally, the plugin determines those images that contain a prefix in the file name, which is also read from the METS file in order to copy them into the master folder of the workflow.
This plugin is integrated into the workflow in such a way that it is executed automatically. Manual interaction with the plugin is not necessary. To use it within a step of the workflow, it should be configured as shown in the screenshot below.
Configuration
The configuration of the plugin is done via the configuration file plugin_intranda_step_kiel_archive_cleanup.xml
and can be adjusted during operation. The following is an example configuration file:
project
This parameter determines for which project the current block <config>
should apply. The name of the project is used here. This parameter can occur several times per <config>
block.
step
This parameter controls for which workflow steps the block <config>
should apply. The name of the workflow step is used here. This parameter can occur several times per <config>
block.
<importFolder>
This parameter specifies the directory from which the images are to be copied.
<fieldForImagePrefix>
This parameter controls which metadata of the METS file is decisive for the selection of the images to be copied as prefix.
<stepToSkipIfImagesAvailable>
Here you can define how the workflow should behave in case of missing images.
<size>
Determination of the scale field to be evaluated.
<sizeWidth>
Definition of the field to be generated for the width
<sizeLength>
Determination of the field to be generated for the length
<sizeScale>
Determination of the field to be generated for the scale
Last updated