Skip to content. | Skip to navigation

Personal tools

You are here: Home / About the library / Projects / Minimal Effort Ingest

Minimal Effort Ingest

Ingest more for less!

NEWS

Minimal Effort Ingest

An expensive part of ingesting digital collections into digital repositories is the quality assurance (QA) phase. Traditionally, data and metadata are quality assured before ingest,
to ensure that only data which complies with the repository data formatting and documentation standards is preserved.

In Minimal Effort Ingest, we postpone the QA of data and metadata until after the data has been ingested and even further, if resources are not available. This approach makes
it possible to secure the incoming data quickly.

This approach was presented at iPres2015 using this poster (pdf) and this abstract (pdf) and won best poster award.

Minimal Effort Ingest was also presented at the DPC briefing day "Practical Preservation and People: a briefing about metadata" in Belfast in December 2015 using this Minimal Effort Ingest presentation.

At the State and University Library, Denmark, we have implemented Minimal Effort Ingest as the workflow for our Newspaper Digitization Project.

The State and University Library, Denmark has also incorporated Minimal Effort Ingest into both it’s Digital Preservation Policy (pdf) and Strategy (pdf).

Autonomous Preservation Tools

At iPres2016 we present Autonomous Preservation Tools in Minimal Effort Ingest.

 

An Autonomous Preservation Tool is a preservation tool, which can discover AIPs (Archive Information Packages) to process on its own. We assume that AIPs maintain an account of past events. In Digital Preservation such an account can be important for showing data authenticity and provenance, so many repository systems implement this already. From this
account the Autonomous Preservation Tool can determine whether it has already processed an AIP or not.
This approach brings a great deal of flexibility:
  • Removing an Autonomous Preservation Tool is a local operation. No other Autonomous Preservation Tools or workflows will be affected.
  • When a new Autonomous Preservation Tool is added, it will automatically start processing old AIPs as well as new. No migration steps are needed.
  • When an Autonomous Preservation Tool is changed, it can be marked as a new Autonomous Preservation Tool, and thus start processing all previously processed AIPs. Alternatively, the tool can be configured to continue where it left off, and thus only process new AIPs.

The NEWSPAPER PROJECT AND THE OPEN SOuRCE CODE

The homepage for the newspaper project is en.statsbiblioteket.dk/newspaper-digitization. There is online access to the digitised newspapers through the State and University service Mediestream The homepage for the implementation project is sbforge.org/display/NEWSPAPER/Newspaper+digitisation+Home. The open source code is on Github github.com/statsbiblioteket?utf8=✓&query=newspaper. Start on the homepage for the implementation project to get an overview of all the involved code repositories.

Minimal contact

Contact

Bolette Ammitzbøll Jurik
+45 8946 2322
baj@statsbiblioteket.dk

Asger Askov Blekinge
+45 8946 2100
abr@statsbiblioteket.dk

Kåre Fiedler Christiansen
+45 8946 2036
kfc@statsbiblioteket.dk