SobekCM Technical Aspects
SobekCM METS Editor
      Download and Install
      Concepts and Preparation
      Using the METS Editor
            Creating a new METS
            Editing an existing METS
            Creating structure map
            Saving in different formats
      Batch Processes
            Spreadsheet or CSV File
            Marc21 Data File
            OAI-PMH Harvesting
            Directory/Metadata Update
      Image Derivative Creation
      Preferences and Settings
            First Launch Configuration
            Changing your preferences
      Development
            Release History
MISSING BANNER

Batch Processing through a set of Directories

Overview

By selecting the third option ( Step through a series of directories ) from the Batching Menu, the SobekCM METS Editor can either create or update METS files through a deep folder structure. All resource files in the folders will be added to the METS file as well. This is a particularly useful way to take METS files generated from spreadsheets, csv files, or a marc21 data report and add related resource files in bulk.

Process Instructions

Selecting the third option ( Step through a series of directories ) from the Batching Menu will launch the form:

Figure 1: Batch Import Directory Recursion Form

This form allows you to select the parent directory to recurse through.

You must also select the type of metadata source file to read. This can either be a pre-existing METS file, a MODS file, a MarcXML file, or a Dublin Core description file. After selecting the type, either enter or select the filter used to determine which file to read. This can be a wildcarded type entry, such as in the populated examples, such as '*.mets', '*.mets.xml', or '*_dc.xml'. If all your metadata files are named the same, you can also enter the full name, such as 'dc.xml'.

If you select any source file other than METS, you will also need to chose how the METS ObjectID should be assigned. If you chose to have consecutive ObjectID's assigned in accordance to the SobekCM BibID pattern, the first BibID you enter must adhere to the following rules.

SobekCM Bibliographic Identifier (BibID) Rules

The BibID is a ten digit alphanumeric identifier used for SobekCM libraries which begins with at least two characters and ends with at least four numbers. Whatever you enter will be the beginning of the final BibID's, which also act as part of the ObjectID for the METS files.

If you enter a string less than 10 characters long, this will be the prefix. For example, entering 'ca008' will cause the first BibID assigned to be 'CA00800000', the second 'CA00800001', the third 'CA00800002', etc..

Entering a full BibID, such as 'MANIOC0123' will break the item into two parts and compute each BibID from the provided BibID. For example, entering 'MANIOC0123' will cause the first BibID assigned to be 'MANIOC0123', the second 'MANIOC0124', the third MANIOC0125', etc..

When you are done entering the necessary information, press the CONTINUE button.

Process Description

Once the process begins, the form will become larger, and a progress bar and current status will be displayed throughout the process.

Figure 2: Batch Processing

The process will run through the following steps:

  1. Metadata File Read (one of two options here)
    • METS file read
    • MODS, MarcXML, or Dublin Core file read
      In this case, since we are building a new METS file, all your defaults set within your metadata preferences are applied. In addition, an ObjectID is assigned according to the rule you just set in this form.
  2. All files added to the METS, either in the page image or download structure maps
  3. If your current settings are to compute the checksums, checksums are computed
  4. METS file is written within the directory