Blogs: Preservation Risks

Blog posts filtered by the Preservation Risks subject tag.

Browse blogs by subject

All subjects Access Analysis Android apache tika ApacheTika AQuA ARC ARC to WARC archives archiving audiovisual Benchmark benchmarking best practice best practices Bit rot bitcurator board game British Library Characterisation Community compression Corpora CSV-Validator curation Database Database Archiving Database Preservation Delivery Digital Forensics digital preservation digitisation Disk Images DROID E-ARK E-ARK Project EaaS eArchiving Education Emulation epub Experimentation extensible Fido File Formats FLAC Flashback floppy disk floppy disks floppy drive Format Identification Format Registry GitHub Hackathon Hardware obsolescence help httpreserve Identification IDPD17 IMPACT Internet Standards iPRES. community survey isolyzer jhove job JP2 JPEG2000 jpylyzer LZW magnetic media Matchbox MediaConch Members Metadata metadate Migration Monitoring Normalisation OCR open Open Planets Foundation Open Preservation Foundation Open source OPF diary Optimization Packaging PDF PDF/A Planets policy PREFORMA PREMIS preservation Preservation Actions preservation planning Preservation Risks Preservation Strategies Preservia Process Projects PRONOM Provenance pywb recordkeeping records Representation Information Research data research infrastructure Resources RFC Rogues Gallery Rosetta Roy SCAPE Server Siegfried Signature Development significant properties Software Software benchmarking SPARQL specification specifications spreadsheets SPRUCE standards technical technical registry testing TIFF Tika Tools training validation veraPDF Virtual Machines w3c WARC Watch WAV WAVE Web Archiving Web Publications wget Wikidata Workflow Workflows Zip

Disclaimer: I am by no means an expert on TIFF (or anything else). This blog (series) is me sharing my recent look into TIFF errors. Please feel free to comment, point out errors, suggest better fixes, etc. At the end of the day, we’re all in this together and here to learn from each other! […]

By Micky Lindlar, posted in Micky Lindlar's Blog

19th Mar 2020  11:36 PM  647 Reads  1 Comment

Some four years ago I wrote a blog post that demonstrated how Apache Preflight (the PDF/A validator tool that is part of Apache PDFBox) can be used to detect features in a PDF that are potential preservation risks. A follow-up blog applied Schematron rules to the Preflight output in an attempt at doing policy-based assessments. […]

By johan, posted in johan's Blog

1st Jun 2017  1:53 PM  2279 Reads  No comments

Join us to help improve JHOVE, an open source, identification, characterisation, and validation tool widely-used by the digital preservation community. Building on the fantastic community collaboration from our first online JHOVE Hack Day, we are happy to announce that registration is open for our second JHOVE Hack Day at: https://jhoveonlinehackday-spring2017.eventbrite.co.uk. During our second online hack day […]

By Becky McGuinness, posted in Becky McGuinness's Blog

27th Apr 2017  8:00 AM  0 Reads  No comments

Many factors contribute to the long-term preservation of and access to digital collections. And typically, the endpoint for this material is a repository—or other type of preservation system. But what happens to content after it is stored? How do digital preservationists ensure that content is correct and valid when ingested as well as remains unchanged […]

By caylinsmith, posted in caylinsmith's Blog

23rd Jan 2017  12:57 PM  4437 Reads  1 Comment

In my previous blog post I addressed the detection of broken audio files in an automated workflow for ripping audio CDs. For (data) CD-ROMs and DVDs that are imaged to an ISO image, a similar problem exists: how can we be reasonably sure that the created image is complete? In this blog post I will […]

By johan, posted in johan's Blog

13th Jan 2017  3:30 PM  12155 Reads  5 Comments

While browsing ArchiveTeam's File Formats Wiki earlier this week, I came across some entries I created there on Quattro Pro spreadsheets two years ago. At the time I had also contributed some old Quattro Pro for DOS spreadsheets (here and here) from my personal archives to the OPF format corpus. Seeing those files again, I […]

By johan, posted in johan's Blog

29th Oct 2014  2:59 PM  26894 Reads  2 Comments

We’ve been doing legacy disk extracts at Archives New Zealand for a number of years with much of the effort enabling us to do this work being done by colleague Mick Crouch, and former Archives New Zealand colleague Euan Cochrane – earlier this year, we received some disks from New Zealand’s Department of Conservation (DoC) which we successfully imaged and […]

By ross-spencer, posted in ross-spencer's Blog

23rd Sep 2014  8:14 AM  15289 Reads  4 Comments

Over the last three and a half years, the SCAPE project worked in several directions in order to propose new solutions for digital preservation, as well as improving existing ones. One of the results of this work is the SCAPE preservation environment (SPE). It is a loosely coupled system, which enables extending existing digital repository […]

By jmaferreira, posted in jmaferreira's Blog

19th Sep 2014  1:51 PM  12355 Reads  No comments

I would like to draw your attention to the new QA tool for finger detection on scans: https://github.com/openplanets/finger-detection-tool. This tool was developed by AIT in scope of the SCAPE project.   Checking to identify fingers on scan manually is a very time-consuming and error-prone process. You need a tool to help you: Fingerdet. Fingerdet is […]

By Roman Graf, posted in Roman Graf's Blog

10th Jul 2014  11:49 AM  11246 Reads  No comments

Hi, this is my first blog post in which I want to introduce the project I am currently working on: Flint. history Flint (File/Format Lint) has developed out of DRMLint, a lightweight piece of Java software that makes use of different third party tools (Preflight, iText, Calibre, Jhove) to detect DRM in PDF-files and EPUBs. […]

By alecs, posted in alecs's Blog

2nd Jul 2014  12:53 PM  12630 Reads  No comments