Federal and territory archivists are under mounting pressure to tackle a sprawling duplicate-image problem in Canberra's public sector digital repositories, with storage costs, retrieval reliability and long-term preservation all identified as concerns by records management professionals this week.
The issue has sharpened in mid-2026 for a specific reason: the ACT Government's whole-of-government cloud migration program, which moved significant volumes of agency data onto centralised platforms over the past 18 months, inadvertently copied tens of thousands of image files multiple times during batch transfers. That migration, coordinated through the ACT's Digital, Data and Technology Solutions directorate, has left some agency folders carrying three or four copies of the same scanned document or photograph.
What the Experts Are Pointing To
Records management specialists at the Australian Society of Archivists, which has its national office in the ACT, have flagged duplicate-image proliferation as one of the sector's most underestimated ongoing costs. Storage isn't free: enterprise cloud storage for government in Australia runs broadly in the range of a few hundred dollars per terabyte per year at scale, and duplicated image files — which tend to be large — compound that bill quietly. A single high-resolution scan of an A3 heritage document can run to 50 megabytes or more; multiplied across a repository of millions of records, redundant copies represent a non-trivial budget drain.
At the Australian National University's School of Computing, researchers working on digital preservation methods have been developing hash-based deduplication tools designed specifically for archival collections. The approach works by generating a unique fingerprint for each image file; if two fingerprints match, one copy is marked for deletion after human review. Academics there have described the methodology in recent conference papers as both mature and deployable at relatively low cost, though implementation in government settings requires careful integration with existing records classification frameworks.
The National Archives of Australia, headquartered at the Parkes precinct near Queen Victoria Terrace, manages the Commonwealth's permanent digital record. The Archives has published technical guidance on deduplication as part of its Digital Continuity 2025 policy framework, which formally expired at the end of 2025 and whose successor framework is still being finalised. That gap in overarching policy has left some agencies uncertain about their obligations when duplicate files are discovered.
Local Implications for the Public Service
For Canberra's dominant public-service workforce, the practical consequences are less abstract than they might sound. Staff at agencies clustered in Barton, Woden and the CBD who rely on shared document management systems sometimes encounter multiple versions of the same image with no clear indication of which is the authoritative copy. That slows down Freedom of Information responses, complicates internal audits, and creates headaches for teams preparing briefings that draw on photographic or scanned-document evidence.
The ACT's own records authority, operating under the Territory Records Act 2002, requires agencies to maintain accurate and accessible records. Duplication does not automatically breach that requirement, but it does complicate the demonstration of compliance — a point that auditors at the ACT Auditor-General's Office have raised in broader digital governance contexts in previous annual reports.
The University of Canberra's Faculty of Arts and Design, which runs postgraduate programs in archives and records management from its Bruce campus, has been fielding increased enrolment inquiries for its digital preservation subjects. Practitioners working in Canberra-based agencies appear to be seeking formal credentials to manage exactly these kinds of emerging technical challenges.
For agencies or teams facing the problem now, the practical path forward involves three steps that specialists broadly agree on: run a checksum audit to identify exact duplicates before touching anything; apply a records retention schedule to determine which copy qualifies as the authoritative version; and document the deduplication process so that any future audit can confirm nothing was deleted improperly. Getting those steps sequenced correctly — rather than rushing to free up storage — is where most implementations go wrong. The digital equivalent of throwing out the original while keeping the photocopy is an error that, in a government records context, can prove very difficult to undo.