OCR page selection

This is the technical documentation for the Goobi plugin for selecting single pages for OCR execution or non-execution.

OCR page selection

Introduction

​ This documentation describes the installation, configuration and use of a page selection plugin for downstream OCR processing. This plugin can be used to determine on a single-page basis which images from a process are sent to OCR with which font. ​

Precondition

​ The precondition for using the plugin is the use of Goobi workflow in version 3.0.4 or higher, the correct installation and configuration of the plugin as well as the correct integration of the plugin into the desired work steps of the workflow. In addition, a plugin is required for the actual OCR process and for merging the results. ​

Installation and Configuration

​ The following files must be installed to use the plugin: ​

/opt/digiverso/goobi/plugins/step/plugin_intranda_step_ocrselector.jar
/opt/digiverso/goobi/static_assets/plugins/intranda_step_ocrselector/css/style.css
/opt/digiverso/goobi/static_assets/plugins/intranda_step_ocrselector/js/app.js
/opt/digiverso/goobi/static_assets/plugins/intranda_step_ocrselector/js/riot.min.js
/opt/digiverso/goobi/static_assets/plugins/intranda_step_ocrselector/js/tags.js
/opt/digiverso/goobi/static_assets/plugins/intranda_step_ocrselector/js/ugh.js

​ The first file is the Java part of the plugin, all following files are needed for the graphical display. ​ The plugin does not have its own configuration, but reads the default value for all pages from the metadata of the process. ​

Settings in Goobi

Usage

Last updated