Our digital preservation projects
Bulk Ingest Pipeline
The amount of digital material in the Library’s collections continues to increase. This means that we need to ingest larger amounts of digital material into the NDHA for preservation. To help with this we are working on a Bulk Ingest Pipeline project.
The Bulk Ingest Pipeline project aims to redesign the process and scale up the technical infrastructure needed to process and import digital material into the NDHA. Some of the design goals of the project include automating and centralising common functions where possible, rather than maintaining multiple tools with overlapping features, or overusing manual processes.
After 10 years of production, the Library is planning the next-generation version of the NDHA. The focus will be on re-architecting and modernising the back-end technical infrastructure so that it can better support the preservation needs of the next decade.
Web Curator Tool Development
The Library is collaborating with the National Library of the Netherlands to enhance the Web Curator Tool (WCT). The WCT is used in the Web Archiving programmes of both institutions.
The WCT is a free open-source tool used to collect websites for archival purposes.
We use the WCT to harvest websites. Websites that have been harvested are preserved in the NDHA and are available to researchers. Websites we archive include websites that are no longer available online.