Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
When you have completed the task of pagination, the next step in the workflow involves structuring.
Working again in the Metadata Editor, the aim is to identify and record all the relevant structure elements of the material in a tree-like list. In this step of the workflow, all the structure elements to be indexed for that source material are marked using as precise a descriptor as possible and allocated to the corresponding structure element within the hierarchy. By way of example, for a monograph this can mean that several chapters, a foreword, an afterword, an index and other structure elements can be added under the structure element Monograph
. Further sub-elements can then be added under these structure elements and so on. In fact, the Metadata Editor allows you to create a hierarchy of elements with as many levels as you wish, including sub-chapters within chapters.
To start the task of organising the structure elements using the Metadata Editor, first select the Structure data
option from the navigation bar. Goobi will display an overview of the available options. It is important to remember that all changes to the structure data depend on which structure element is currently selected in the structure tree on the left (highlighted in bold).
This section explains a number of additional functions in the METS Editor that involve directly modifying the file system for the images. For this reason you should take great care when using the functions described below.
Pagination is a key element of digitisation projects. It involves matching page labels
(printed page numbers in the source material) to the scanned images. In older works, the page numbering often changes within the actual source text. Often, there is no printed page number, or the same page number is used more than once, and sometimes the pagination can change several times within a single work. A typical example of altered pagination occurs when the preface or introduction makes use of Roman numerals only to be replaced by Arabic numerals after the index.
To edit the pagination for the source work, open the pagination area in the Metadata Editor by clicking on the option in the navigation bar. Goobi will calculate the number of images in the current process folder and produce a vertical list. The box immediately to the right is used to allocate a pagination number to each image. As you can see below in the box entitled Page selection
, Goobi has initially determined that all the pages should be numbered consecutively using Arabic numerals, beginning with the number 1.
Using either the keyboard combinations or the image navigation bar above the scanned image, you can now navigate through the entire set of images to obtain the printed page number for each image. In most cases, the first few pages of books do not contain a page number. Click the first checkbox for image 1 in the Page selection
box. Next, in the Define pagination
box, select unnumbered
from the drop-down list and then click on the link From first selected page
. For pagination purposes, you have now told Goobi to treat the volume as though it did not contain any printed page numbers. Next, move to the first image that shows a printed page number.
Tip: While you are navigating through the images, if you come across a page where the pagination changes, just click on the image. This will automatically select the corresponding checkbox in the Page selection
list. To actually mark the checkbox, press the space bar on your keyboard. Alternatively, you can of course identify the corresponding image in the Page selection
list and then mark the checkbox using your mouse.
When you are selecting pages, please remember that you must go by the image number and not by the printed page number in the source. In Goobi, you can always tell which is the image number. Two numbers are always shown for each image, separated by a colon. The number on the left is the number of the file within the file system, i.e. the image number. The number to the right of the colon is the printed page number in the source, known as the page label. You should always take the image number (to the left of the colon) as your guide to avoid accidentally choosing the wrong pages. It is worth noting that in some source material the same printed page number (page label) can occasionally be used more than once.
Once you have identified the page where the pagination of the source material changes and marked the corresponding checkbox, choose the required pagination type in the Define pagination
box. Next, enter the current page number/label and click once again on the link From first selected page
. This will instruct Goobi to allocate the required page number to each image, beginning consecutively with the page you have selected.
Sometimes, the pagination of the source material can change more than once. Pages without a page number and repeated instances of the same page number occur frequently. Pages on which a new structure element begins (e.g. new chapters) but that do not contain a printed page number (although page numbering is continued on subsequent pages) are marked with a simulated pagination. This simulated pagination can be recognised in that a logical numbering is assumed despite the absence of a page number, and consequently the assumed page number is shown in brackets.
Example: If a chapter starts on a page without a printed page number and the next page contains the printed page number 4
you would give the first page of the chapter a simulated (i.e. assumed) page number, in this case a number 3. Simulated page numbers are based accordingly on an interpretation derived from the missing page identifier and the available page identifiers on subsequent pages of the source. Simulated page numbers are shown in square brackets. In this example, the designation used for the first page of the chapter would therefore be 3
.
For efficient pagination, use the keyboard combinations described in section Overview of the keyboard combinations. Especially if the source is particularly large and you cannot or do not wish to check every page, you can ensure that your pagination is actually correct by checking a selection. In most cases, it is relatively easy to detect mismatches in the pagination. To do so, use the keyboard combinations to navigate 20 pages at a time through the set of images, comparing the printed page number on your selected page with the pagination numbers allocated through the automatic consecutive numbering option. You can identify mismatches where the pagination sequence no longer coincides with the sequence of printed page numbers.
Alternatively, you can at any time click the icon next to the page number in the Page selection
box to display the corresponding image in the image display area on the right.
Goobi supports different page numbering methods for pagination purposes. As well as allocating Arabic and simulated Arabic page numbers, Roman and simulated Roman page numbers, free text and unnumbered pages, you can also specify the sequence to be used for consecutive numbering. Using the symbols in the Define pagination
box, you can determine how the page sequence in the book should actually appear.
Goobi supports the following page sequences:
Pagination types supported by the Metadata Editor
Tip: If you want to make a change within an existing pagination sequence, you can use one of Goobi’s features that that does not automatically overwrite all the subsequent pages using automatic numbering. To do so, select one or more pages in the Page selection
_**_box, enter the required numbering and pagination type and then clink on the link entitled Only the selected pages
. In this way, only the selected pages will be affected by the change.
There are several ways in which you can subsequently modify the structure of a document.
One of these involves moving structure elements to a different position. To do this, first select the structure element from the structure tree. Next, in the Selected docstruct
box, select the function Move docstruct up
or Move docstruct down
to move the selected element by one position in the required direction.
Icon | Description |
---|---|
However, if you want to move the structure element to a different position within the hierarchy, you will need to select the function Move docstruct to other location
. This will open a dialogue box in which you can specify exactly where you want to place the structure element. You will be able to select only from those locations permitted by the Goobi ruleset.
Icon | Description |
---|---|
There are various keyboard combinations available in the metadata editor. These are summarised here once again:
Keyboard combination | Description of function |
---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Pagination type | Description of pagination type |
---|---|
Page numbering
A page number is printed on each page.
Column numbering
Each page contains two individually numbered columns. Accordingly, each page will have two pages numbers. In the Page selection
box, you will notice that in such cases the consecutive numbering advances in twos, since each column on each page is allocated a number.
Sheet numbering
In the case of sheet numbering, each physical sheet in the book is given a sheet number. Unlike the page numbering method, no distinction is made between verso and recto. Accordingly, page numbers are not allocated per image. Automatic consecutive numbering is based on every second image. Two images are allocated to each page number before adding the next page/sheet number in the sequence.
Recto/verso sheet numbering
Using this method, page numbers are allocated as for sheet numbering
, except that each page number is followed by an r for recto or v for verso.
Recto/verso page numbering
Here, pages are allocated as for page numbering (see above) with the addition of an r
for recto or v
for verso after the number.
| Move to previous image |
| Move to next image |
| Move 20 image forward |
| Move 20 images back |
| Move to first images |
| Move to last image |
| Activate or deactivate the checkbox for the currently displayed image for pagination. |
| Save the METS file |
| carry out the validation |
Display image for the selected page
Selection of pagination type
Move the structure element one place higher in the same hierarchical level
Move the structure element one place lower in the same hierarchical level
Move the structure element to another location in the hierarchy
Goobi allows you to copy structure elements from the METS file of one process to another process. To do this, in the Selected docstruct
box, select the function Import structure elements from another process
. Goobi will display a tree view from which you can copy the required structure. You can then specify the target location within the current METS file.
First enter the name of the process from which you want to obtain the data, then click Search for process
.
You can now choose one or more structure elements from the selected process by clicking on them. Once you have selected the required elements, click on the Next
button to return to the structure tree for the process you are currently working on.
In this view, you can now select the structure element into which you want to insert the structure elements you have just copied from the other process. Please note that you can only select a target structure element that is allowed by the ruleset to contain the copied elements. Also, this function allows you to copy only logical structure elements from another process. This means that you cannot use this function to copy images and page allocations.
This function can be used to download a file from the current process folder to the user’s own computer. The user can select the file to be stored locally using the computer’s file browser.
The metadata editor has an editor for OCR results with which the ALTO XML files in the process can be edited. This editor can be accessed via a button in the upper menu bar:
The ALTO editor overlays the usual interface of the metadata editor. The interface consists of two main elements. The digital copy is displayed on the left and the automatically recognised OCR text on the right. If you move the mouse over the image or over the text on the right, the recognised line and the recognised word are highlighted in the image:
The editor allows you to correct the words recognised by the OCR. To do this, simply click in the text on the right and change the words as you are used to doing with text editors. When you edit a word, it is also marked on the left of the digital copy. If a word is difficult to read, you can zoom in the image display with the mouse wheel. In addition, the page to be edited can be selected via the navigation below the image.
To save the results, just click the green button at the bottom right. To undo all changes, the editor can simply be closed.
The main area of the Metadata Editor also contains a navigation bar to perform a range of steps during metadata indexing. Selecting Pagination
, Structure data
, Metadata
and File replacement
allows you to switch between the individual editing modes in the Metadata Editor. Each of these modes is explained in greater detail below and should be performed in the specified order.
If you wish, you can modify previously created pagination sequences. Goobi allows you to move or delete individual pages or several pages at the same time. To do this, you first need to select the required pages in the Page selection
box. Once you have highlighted at least one page in this way, you will be able to choose from the following functions:
Icon | Description |
---|---|
Regardless of whether or not you select a page, you can also use the following function:
Icon | Description |
---|---|
If one or more pages are moved up, each page will displace its predecessor in the pagination sequence and will take over all the settings and page allocations of the predecessor. If a page is moved down, its original place it taken by the next page in the pagination sequence.
If the changes affect the image currently being displayed, the image number in the page display is updated automatically. There is no change in the displayed image.
As well as moving files, you can also delete selected files. This action deletes the selected page completely from the metadata file. It affects both filename allocation and allocation to structure elements or allocated metadata.
The file is also deleted from all the process folders. You can search for files that you wish to delete from all the process folders using the selected filename. Goobi will search the available folders for files that match the filename (but not necessarily the file extension). The file will be deleted in all the subfolders of the ocr
and images
directories. If the currently displayed image has been deleted, Goobi will display the first image. If not, the image number will be updated.
Goobi also allows you to generate new filenames so that systems that are not based on the sequence of filenames in the METS file can continue to display the images in the correct sequence. All the files are numbered using an eight-digit system based on the sequence in the METS file. The files will be renamed in all the process folders.
Goobi’s Metadata Editor is one of the core elements of digitisation projects. Most work involving the Metadata Editor is performed by trained librarians. There are two ways to open the Metadata Editor. Administrators and project managers can open the screen for processing structure data and metadata at any time from any individual process listed in the process view page, regardless of its workflow status.
Users with a more restricted level of authorisation can access the Metadata Editor through the My tasks
area but can only perform those workflow steps for which they have been designated as the responsible user.
Once you have opened the Metadata Editor, you have full access to all the editing options relating to the pagination, structure data and metadata of the digitised material. The Metadata Editor is divided into a number of sections.
The grey-shaded area on the left of the screen contains the structure tree, where you can see in hierarchical form all the structure elements that have already been obtained from the source material. When you select a structure element, it will appear in bold in the tree view. For each structure element, the descriptor is based on the type chosen for that element.
Icon | Description |
---|---|
Click on the icon just in front of the structure element to expand or collapse individual sections of the structure tree. If you hold the cursor over the small symbol next to each structure element, you will see a pop-up with further details of that structure element without having to open it.
The list of elements from which you can select in the New structure element box will vary depending on which structure element has been activated. You can choose the position where you want to insert the new structure element from the following options:
Positions for new structure elements
Position | Description of position for the new structure element |
---|---|
The list of structure elements from which you can select will vary depending which of the four position options you choose. The range of elements available for selection depends on how the Goobi administrator has specifically defined the system for the current project using the freely configurable rulesets. Within certain structure elements, this means that structure data and metadata can be specified that are admissible within a given hierarchy. For example, within the structure element Monograph
it is not possible to assign the structure element Periodical article
, although you could assign a Chapter
, which is a perfectly normal component of a monograph. Depending on this configuration, the list display will be based on the structure element currently selected in the left-hand navigation tree.
The fastest way of adding new structure elements in Goobi is to choose the option As last sub-element
As last sub-element First page
input box. Move forward until you come to the last page for the structure element you wish to add, and this time click on the image symbol next to the Last page
input box. Clicking the image symbols instructs Goobi to apply the page number currently selected on the right of the image display. Using the little arrows on either side of the image icon, you can enter the page number for the preceding or following page. This allows you to allocate pages without spending an inordinate length of time moving between pages in the image display section. Next, click on Add structure element
to insert this new structure element as the last sub-element within the current structure element. This is an efficient method of adding structure elements.
If you want to create additional sub-elements as part of a structure element that you have already added, first select the existing element to which you want them to belong. You can now use the same method as described above by adding each new structure element as the last sub-element.
This single step of the workflow therefore involves providing two important pieces of information. As well as identifying and recording the digital structure of the source material (to reflect its logical structure in the form of chapters, sub-chapters, forewords, afterwords, indices and any other structure elements that can be configured individually in your Goobi installation), when you add structure elements you are also providing the corresponding page ranges for each one of those elements.
Clicking on the small arrow symbols next to the First page
and Last page
text boxes instructs Goobi to set these as the beginning and end pages for that structure element. As the metadata file grows in the background, it will eventually be perfectly clear which page ranges correspond to which structure elements. If the digitised material is later made available for viewing, e.g. on a website, researchers will then be able to display all the pages corresponding to a particular sub-element, and sub-chapters downloaded in pdf format will always contain the correctly allocated pages.
For some structure elements, Goobi will display input fields below the selected type so that metadata can be allocated directly to the structure element. Which fields are displayed (e.g. main title, keywords) will depend on the structure type that has been selected and on the ruleset configurations in Goobi.
Goobi offers a wide range of functions that simplify the task of working with structure data and metadata. These options can be found in the top menu bar and at the right-hand edge of the structure tree pane in the Metadata Editor.
The following functions are available from the top menu bar. The corresponding icons can be found on the right-hand side of the bar.
At any time while you are working with the Metadata Editor, you can use the Validate
option to check whether you have complied with the rules provided by Goobi for the structure data and metadata. These rules are defined individually for each project by the Goobi administrator. Before closing the step you are working on, Goobi will conduct a further validation check. If this validation fails, Goobi will not let you close the task. It is therefore advisable to perform regular validation checks in the Metadata Editor to ensure that you have complied with the rules. If you click Validate
and Goobi then determines that your metadata and structure data do not comply with the established rules, it will display an explanatory warning message to that effect.
Icon | Description |
---|---|
By choosing this function you can prevent the image from scrolling in the Metadata Editor when you are dealing with long lists of metadata or during the allocation of pagination sequences. If you activate this function, the selected image will remain in place on the right hand side of the window together with its navigation bar and the zoom options, even if you scroll up or down on the right of the Metadata Editor.
Icon | Description |
---|---|
This menu bar icon provides several options for additional settings. Simply click the icon to display the available functions.
Icon | Description |
---|---|
The Save
option allows you to save your work in the Metadata Editor at any time. You should always click on this button when you want to make sure your data cannot be lost.
Click this menu bar icon to exit the METS Editor. You then have two options.
The Exit
option exits the Metadata Editor without saving your work, so you should usually ensure that you save your data before selecting this option. This function can also be used, for example, to exit the Metadata Editor without having to save your work in the event that you unintentionally delete or change the data.
The option Save and exit
saves all your work and closes the Metadata Editor. Goobi will then return you to the screen from which you launched the Metadata Editor. If you accessed the Metadata Editor from My tasks, for example, you will find yourself back in that area. If you launched the Metadata Editor from the Processes area, selecting Save and exit will automatically return you to the same point in that area.
If you select the menu bar icon Settings, Goobi will display a drop-down list containing several additional functions.
Click on this function to display information about the hierarchical level of each structure element in the structure tree on the left. This function is particularly useful in cases where the hierarchy of the structure tree contains many levels. An indication of the hierarchical level can help you determine which level you are currently viewing.
Using this function, you can switch between the fully expanded and fully compressed versions of the structure tree on the left hand side of the Metadata Editor. This allows you to fully expand even the most complex structure trees with a single click without having to click and expand each section within the tree individually.
You can select the Hide image
option at any time to prevent the image from being displayed. Conversely, the Show image
option causes the image to be shown again after being hidden.
This option allows you to recommence pagination. You may wish to use this function to reset all the pagination data and start indexing the pagination sequences again from the beginning.
This function allows you to work with the physical structure tree rather than the logical structure tree. This allows you, for example, to record metadata for individual pages or edit metadata stored in the physical metadata area.
The structure tree pane contains a number of additional icons/functions that can be used to modify the way the structure tree is displayed.
The function gives you an option to link the structure tree on the left of the Metadata Editor to the image. If this function has been activated, the image display will change to the start page of the structure element you have selected in the structure tree.
If you activate this function, Goobi will display the names given to individual structure elements directly within the structure tree. This may give you a better overview, e.g. to help you locate chapters by their name in the structure tree.
Select the Tree page number
to display the page ranges allocated to each structure element in the structure tree on the left. The first and last page of the range will appear in brackets next to each structure element. Each page range displayed contains the image number and the page label (printed page number).
Select this icon if you wish to print out the structure tree.
This function can be used to upload a file from the user’s computer to a selected process folder.
The user will need to specify a file and choose the position within the pagination sequence where the new file is to be inserted. Next the user can choose whether this new file is to be inserted as an uncounted
page or whether it should be integrated into the existing pagination. Click the Upload file
button to insert the file into the currently selected process folder. If this is the media folder, the pagination sequence will be updated. The file will be inserted in the selected position and either created as a uncounted page or integrated into the existing pagination sequence. If the file is integrated, the page number for each of the following pages will increase by one. In this context, however, it is important to note that the last file in the pagination sequence is not allocated a page number. It remains unpaginated and if necessary will again have to be changed separately in the pagination sequence.
If there is already a file in the selected folder with same name as the file being updated, the file cannot be uploaded and the user will receive an error message.
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Icon | Description |
---|---|
Menu item to help users navigate to and select the first or last page of a new structure element
Set the current image as the first or last page
Set the previous image as the first or last page
Set the next image as the first or last page
Validate metadata against designated ruleset
Fix the image display to avoid image scrolling
This menu bar icon allows you to choose various settings from a sub-menu.
Save the current structure data and metadata
Verlassen des METS-Editors mit oder ohne vorheriges Speichern
Display the hierarchical level in the structure hierarchy
Fully collapse the structural hierarchy
Hide image / Show image
Reset pagination completely
Show physical structure tree / Show logical structure tree
Link the image with the selected structure element
Show main title in the structural hierarchy
Display the assigned pages in the structure hierarchy
Print the structure element hierarchy
Move the selected page up
Move the selected page down
Delete the selected page
Generate new filenames
Symbol for collapsing and expanding the structure element hierarchy
Before current element
Choose Before current element
to insert a structure element immediately before the currently selected element.
After current element
Choose After current element
to insert the new structure element on the same hierarchical level directly after the current element.
As first sub-element
Choose As first sub-element
to insert the new structure element as a sub-element within the hierarchy before any other sub-elements that are already in place within the currently selected structure element.
As last sub-element
Choose As last sub-element
to insert the new structure element as a sub-element of the current element. The new structure element will be inserted after any other elements that are already in place.
The right-hand side of the Metadata Editor gives you an overview of the individual digital images that form part of the current process together with the number of pages, the current magnification level, the currently displayed image number and information about which derivative of the available digitised material you are currently viewing.
You can move between individual pages using the Forward
and Previous
links just above the image. You can also move quickly to the previous and next images by selecting them directly.
The current page is shown in the middle of the page range with the previous and next two pages on each side (provided they are available for display). Clicking on any of the pages will take you directly to the corresponding scanned image. Goobi also features a number of keyboard combinations for repeated navigation between different pages of the source material.
Keyboard combination for navigating between images in the Metadata Editor
Keyboard combination | Description of function |
---|---|
Using these keyboard combinations allows you to move quickly and easily through the digitised material, even over large areas. Another navigation option allows you to move directly to a specific image by entering that image number in the Go to image
box and pressing return. Goobi will then automatically display the requested image.
In addition to all the above navigation options, you can also change the way Goobi displays the scanned image. To do so, just click on either magnifying glass symbol below the page navigation bar to increase or reduce the magnification/zoom level. The current magnification is shown between the two magnifying glass symbols. If you want to select a particular level of zoom, simply click between the two magnifying glass symbols, enter a figure and press the Enter key to confirm.
Goobi also allows you to rotate the image in 90 degree stages to view information that can only be read in landscape format. To do so, just click on the arrow pointing to the right to rotate the image clockwise and on the arrow pointing left to rotate the image anti-clockwise.
Underneath the image you will find an option to select from different derivatives of the image (where available). In the drop-down list entitled Folder, Goobi lists all the image folders linked to the currently selected process. For example, if you already have a number of derivative images for a specific process (e.g. master images, scaled or compressed versions of images or derivatives with a different tone such as bitonal images), you can switch between these derivative images simply by selecting the corresponding folder.
Within the METS Editor you can specify a particular image that can be used to represent the entire work. This is usually the title page. Most digitisation portals (e.g. Europeana, ZVDD, VD18) and the Goobi viewer use the first image in the METS file as the representative image, unless a different image has been specifically chosen for that purpose.
In order to specify the representative image, first select the required image from the available list. Click on the corresponding icon make this the representative image. Within the METS file, the selected image will now have the attribute USE=“banner“
.
Provided that the server-based target folder is not empty, you can use the Server-based import
box to select a folder from which previously exported images can now be imported into the current process.
In order to do this, you will need to specify the point in the pagination sequence where you want to insert the new file. You can then choose whether the new files are to be inserted as uncounted
pages or whether they are to be integrated into the existing pagination.
For every subfolder in the target folder, Goobi will then search for a counterpart in the current process folder. The selected files are then imported to this folder. Once the files have been successfully imported, the subfolder in the target folder is deleted.
A subfolder entitled fileupload
is automatically generated on the Goobi server in the configured folder named tempfolder
. This subfolder is used to exchange files between processes on the server.
In this area of the METS Editor, you can select one or more pages for export to the exchange folder on the server. A subfolder with the current process title is created in the exchange folder. Goobi will search for the selected pages in all the image directories (derivatives) for the current process and export them to the process subfolder in the exchange folder.
You can also choose to delete all the selected pages from the current process when you export them. This deletion will also be applied to the filename allocation, all allocations to structure elements and all the allocated metadata. The file will also be deleted from all the process folders.
If the currently displayed image has been deleted, Goobi will display the first image for that process. If not, the image number will simply be updated.
Icon | Description |
---|---|
Rotate the displayed image 90 degrees to the left or right
Ctrl
+ Shift
+ Cursor left
Move to previous image
Ctrl
+ Shift
+ Cursor right
Move to next image
Ctrl
+ Shift
+ Cursor up
Move 20 images forward
Ctrl
+ Shift
+ Cursor down
Move 20 images back
Ctrl
+ Shift
+ Pos 1
Move to first image
Ctrl
+ Shift
+ End
Move to last image