Can Digital Alone Preserve the Record?

Can Digital Alone Preserve the Record?

DIGITAL PRESERVATION

A Research Review on Format Obsolescence, Authentication, AI Risk,

Vendor Independence, and the Emerging Non-Digital Answers

April 2026    Based on Current Institutional Research

 

ABSTRACT

The global archival community is investing heavily in digital preservation frameworks, blockchain authentication, and synthetic DNA storage research. Yet the aggregate evidence from those same research streams reveals a consistent finding: no currently deployable digital technology alone satisfies all five critical preservation criteria simultaneously — format independence, AI resilience, single source of truth, authentication integrity, and freedom from private vendor dependency. This review synthesises the latest institutional research to examine where each technology stands, what its structural limits are, and what the evidence says about the role of non-digital preservation media in a comprehensive long-horizon strategy.

 

1. Format Migration — The Treadmill Problem Has No Finish Line

 

 

The most authoritative current body of work comes from the Digital Preservation Unit at the U.S. National Archives and Records Administration (NARA), which now formally tracks 742 file format versions across 16 electronic record categories. Their Q4 2024 Digital Preservation Framework update is instructive not as a sign of progress but as evidence of the ecosystem's ongoing instability:

“Four Record Category Preservation Action Plans were revised... covering Digital Audio, Digital Cinema, Digital Video, and Digital Still Image.”  — NARA Digital Preservation Framework Update, Q4 2024 [1]

A preservation framework that must be revised quarterly to keep pace with format changes is, by structural definition, not a solved problem. The science confirms why. ACS Nano research into alternative storage media states plainly:

“Current storage technologies are not suitable for long-term storage beyond 50 years.”  — ACS Nano / PMC — Emerging Approaches to DNA Data Storage [2]

This means that every digital record, regardless of how well it was originally created, must be actively migrated to newer media multiple times across a single human lifetime — and an indefinite number of times across institutional timescales. Queen's University Belfast's Digital Preservation Strategy 2025–2030 represents the state of institutional best practice today: checksum validation at deposit, annual format migration reviews, geographic replication, audit trails. It is competent, professional work. It is not a solution. It is perpetual maintenance.

2. AI Changes — A New Attack Surface on Preservation Integrity

 

 

Artificial intelligence introduces two structurally distinct threats to digitally preserved records that current preservation research is only beginning to address.

2a. AI-Generated Content Contamination

As AI-produced text, images, and documents become indistinguishable from authentic records at the byte level, the ability to assert that a digitally preserved file is what it claims to be is fundamentally undermined. The blockchain community's answer — cryptographic hashing and on-chain provenance registration — addresses this partially but only for records registered before AI manipulation was possible. Retroactive authentication of existing digital archives at scale remains an unsolved problem.

2b. AI-Driven Format and Codec Fragmentation

AI systems are increasingly the primary consumers of archived data, for training, retrieval, and autonomous analysis. This imposes format and encoding requirements on archives that differ substantially from those optimised for human readability. As Radiant Archives observed in their 2025 review of emerging preservation technologies:

“Machine learning models can predict when file formats are likely to become obsolete, enabling proactive migration strategies.”  — Radiant Archives — The Future of Digital Archiving, January 2025 [3]

The irony is that the same AI systems being proposed as preservation tools are also accelerating the obsolescence cycle they are meant to manage. Formats optimal for 2025-era AI ingestion may be meaningless to 2040-era systems. The migration treadmill now runs on two tracks simultaneously.

3. Single Source of Truth — Blockchain Is the Best Digital Answer, But It Has Structural Limits

 

 

Research consensus in 2025 identifies blockchain as the most credible digital mechanism for establishing a tamper-evident, decentralised single source of truth for records. Cornell University's blockchain programme summarises the core claim:

“Using the blockchain makes your workflow and work life a lot easier because it's a single source of truth. It's immutable, it can't be changed and it's transparent.”  — Cornell University / Ava Labs — Blockchain Chronicle, August 2025 [4]

The most architecturally sophisticated archival model is the academic TrustChain framework, which proposes combining an immutable blockchain-based archival system with incrementally evolving metadata to maintain the archival bond between records over time. The blockchain embedded authentication capability is well-established:

“Blockchain embeds authentication into the document itself, and protects against tampering and modification.”  — Anderson Archival — Taking Blockchain Technology to Preservation [5]

However, blockchain for archival purposes carries structural limits that the research community acknowledges but institutions frequently understate:

  Blockchain consensus can be overridden. As the Springer/Cluster Computing review notes: “blockchain technology is resistant to tampering, but it can be altered if consensus is reached among participants.” [6] This is not merely theoretical for state-level threat actors.

  Blockchain stores hashes of records, not the records themselves. The cryptographic fingerprint proves a file has not changed since registration; it does not preserve the file’s readability or interpretability across decades. The underlying file must still be stored and migrated elsewhere.

  Public blockchain infrastructure depends on network participation incentive structures — typically cryptocurrency economics — controlled by private entities and subject to regulatory and geopolitical disruption.

  Long-horizon blockchain viability assumes continuous global internet infrastructure, electricity, and consensus protocol maintenance. None of these can be guaranteed across 50–500-year archival timescales.

 

4. Vendor Independence — Where Digital Preservation Is Most Exposed

 

 

Every layer of the digital preservation stack — cloud infrastructure, software rendering environments, codec libraries, operating systems, hardware fabrication, undersea cable networks — involves private companies and international supply chains dominated by a small number of jurisdictions. NARA's own framework acknowledges this constraint implicitly, measuring migration feasibility by the availability of tools maintained by private entities that carry no legally binding preservation obligations to future generations.

Institutions are responding with their feet. The global digital-to-microfilm service market, which converts born-digital records onto archival-grade microfilm for long-term physical preservation, is growing:

“The global Digital to Microfilm Service market... is projected to grow from USD 178 million in 2025 to USD 236 million by 2031.”  — Intel Market Research — Digital to Microfilm Service Market, 2025 [7]

These are not nostalgia purchases. They represent deliberate risk-mitigation decisions by banking, government, insurance, and healthcare institutions that have conducted lifecycle cost analysis and concluded that digital-only preservation is structurally exposed. The same report notes that commercial enterprises account for nearly 42% of demand, “reflecting growing adoption in banking and insurance sectors for compliance document preservation.”

5. DNA Data Storage — The Frontier Technology With a Millennial Promise

 

 

The most scientifically significant emerging preservation technology is synthetic DNA data storage, which by mid-2025 has crossed from laboratory demonstration to early commercial prototyping. The 2025 Journal of AI in Bioinformatics roadmap paper summarises the trajectory:

“DNA storage — from provocative demonstrations into mature technologies with early end-to-end prototypes.”  — Chen, Li & Cao — DNA Storage in Biological Data Management, 2025 [8]

The headline preservation figure is remarkable. ACS Nano research comparing media types states:

“The maximum preservation time is 50 years for digital memories and 500 years for paper, while it is millennia for inorganic matrix-encapsulated DNA.”  — ACS Nano / PMC — Emerging Approaches to DNA Data Storage [2]

The SNIA DNA Data Storage Technology Review (June 2025) adds a critical architectural advantage over both digital and conventional analog media:

“Future reading technologies can be applied to DNA data archives... the reader need not be packaged away with the data.”  — SNIA — DNA Data Storage Technology Review v1.0, June 2025 [9]

This addresses the format obsolescence problem at a physical layer: unlike digital files that must be continuously migrated to remain readable, DNA archives can, in principle, be read by sequencing technologies that do not yet exist, without any action on the stored data itself.

Critical Unsolved Problems for Practical Archival Deployment

  Write costs and latency remain commercially unviable for most institutional use. The 2025–2030 roadmap for DNA storage is still a roadmap, not a deployed solution.

  Random access at scale is constrained. The SNIA review notes that current top-end sequencing platforms can only access approximately 312 GB at a time — below the requirements of most modern institutional archives.

  Standardisation is embryonic. The DNA Data Storage Alliance's sector specifications are first-generation.

  Environmental sensitivity — temperature, humidity, UV exposure, pH — means DNA archives require controlled conditions analogous to microfilm vaults, without microfilm’s 150-year institutional practice track record.

  Commercial DNA synthesis and sequencing infrastructure is concentrated in a small number of private companies, recreating the vendor dependency problem it is meant to solve.

6. The Analog Benchmark — What Microfilm Actually Delivers

 

 

Against this landscape, the properties of silver halide microfilm on polyester base are not a sentimental argument for an old technology. They are a rigorous performance specification that no currently deployable alternative meets across all five preservation criteria simultaneously.

“Microfilm boasts an impressive life expectancy of up to 500 years — a significant advantage for institutions tasked with long-term preservation.”  — LIS Academy — Microfilming for Preservation: Pros and Cons, November 2025 [10]

The technology-independence argument is equally precise:

“Microfilm requires no electricity or specialized software... making it essentially ‘future-proof’ against the rapid changes in technology that challenge digital preservation efforts.”  — LIS Academy — Microfilming for Preservation: Pros and Cons, November 2025 [10]

On security and authentication, microfilm’s analog nature provides guarantees that no digital system can replicate without continuous active maintenance:

“As an analog medium, microfilm cannot be remotely accessed or altered... not affected by electromagnetic pulses... reading microfilm requires only optical magnification.”  — LIS Academy — Microfilming for Preservation: Pros and Cons, November 2025 [10]

On long-term cost, the lifecycle analysis from the global market review is direct:

“Microfilm can be 50–150 times more cost-effective than digital storage for long-term preservation applications.”  — Micrographics Data Online — Microfilm Renaissance: Global Case Studies, 2025 [11]

The standard practice at the world’s leading tier-1 institutions reinforces the point. Cornell University’s preservation documentation states: “Preservation microfilm is intended to be permanent, and for this reason, three generations are produced: the camera or master negative, the print master negative, and the positive-use copy.” The British Library, the Library of Congress, and the National Archives of multiple nations all operate on this same master/duplicate/access-copy model.

 

7. The Verdict: Can We Trust Digital Alone?


 

The aggregate evidence from current institutional and academic research produces a clear answer: no. Digital preservation alone cannot satisfy the full set of long-horizon requirements simultaneously, and the community’s own research documents precisely why.

The root structural problem is that digital preservation conflates two fundamentally different functions — access and permanence — and optimises overwhelmingly for access. Every architecture decision that makes a digital record easier to retrieve, search, and share also increases its exposure to format obsolescence, vendor dependency, cyberattack, electromagnetic risk, and the continuous cost of active maintenance.

The most intellectually honest assessment is that the world needs a layered preservation architecture in which different layers carry fundamentally different threat profiles. The research supports this directly:

 

Recommended Layered Preservation Architecture

Layer

Medium

Role

Vendor Dependency

Access & Discovery

Digital (cloud/local)

Daily use, search, AI ingestion

High — private companies, internet

Authenticated Provenance

Blockchain hash anchoring

Chain of custody, tamper evidence

Medium — distributed network

Redundant Master Archive

Silver halide microfilm on polyester base

500-year preservation, no tech dependency

Zero — optics only

Future Cold Store

Synthetic DNA (when viable)

Millennial-scale density storage

High — biotechnology infrastructure

 

The digital layers serve access. The analog layer serves permanence. DNA storage, when commercially viable, extends the cold-archive layer to geological timescales. Blockchain anchors provenance and chain-of-custody integrity. Each layer does what it is physically and architecturally suited to do.

The field’s current moment is one of honest reckoning. The institutions spending the most on digital preservation research are the ones most clearly documenting why digital preservation alone is insufficient. The organisations converting born-digital records back to microfilm are not acting from ignorance. They are acting from rigorous lifecycle analysis of risk, cost, and the physics of long-term information survival.

The question is not whether analog has a role in the 21st century. The question is whether institutions are willing to build preservation architectures that match the actual threat environment, rather than the one that is most convenient to fund, procure, and manage within a single organisation’s technology refresh cycle.

Sources & References

 

 

[1]  NARA Digital Preservation Unit. “Digital Preservation Framework Updates, October–December 2024.” Fixity Check Blog, January 8, 2025. fixity-check.blogs.archives.gov

[2]  ACS Nano / PMC. “Emerging Approaches to DNA Data Storage: Challenges and Prospects.” pubs.acs.org/doi/10.1021/acsnano.2c06748

[3]  Radiant Archives. “The Future of Digital Archiving: Emerging Technologies and Best Practices.” January 15, 2025. radiant-funds.com

[4]  Cornell University Chronicle. “Blockchain platform securely digitizes public records.” August 2025. news.cornell.edu

[5]  Anderson Archival. “More Than Bitcoin: Taking Blockchain Technology to Preservation.” andersonarchival.com

[6]  Springer / Cluster Computing. “Blockchain-based access control and privacy preservation in healthcare.” August 2025.

[7]  Intel Market Research. “Global Digital to Microfilm Service Market.” 2025. intelmarketresearch.com

[8]  Chen, R., Li, X., & Cao, B. “The Future of DNA Storage in Revolutionizing Biological Data Management.” Journal of AI in Bioinformatics 1(2), 2025.

[9]  SNIA. “DNA Data Storage Technology Review Version 1.0.” June 30, 2025. snia.org

[10]  LIS Academy. “Microfilming for Preservation: Pros and Cons.” November 2025. lis.academy

[11]  Micrographics Data Online. “The Microfilm Renaissance: Global Case Studies and Cost Analysis of Analog Preservation in the Digital Age.” 2025.

This research review is compiled from publicly available institutional and academic sources as cited. April 2026.

Kembali ke blog

Tulis komentar

Ingat, komentar perlu disetujui sebelum dipublikasikan.