ACT government agencies, the Australian National University and several federal bodies headquartered in Canberra are sitting on digital image archives that contain significant proportions of duplicate files — wasting storage budgets, slowing down retrieval systems and complicating Freedom of Information requests. The push to fix it is now a live policy conversation in the capital.
The issue has sharpened in 2026 as the federal government's digital record-keeping obligations under the Archives Act 1983 come under renewed scrutiny. The National Archives of Australia, based on Queen Victoria Terrace in Parkes, has been expanding its digitisation program for physical records, and that work is flushing older, unmanaged image libraries into systems that were never designed to deduplicate at scale. The result is repositories where the same photograph, scan or graphic can appear dozens of times under different file names, different folders and different metadata tags.
Experts in digital asset management point to a structural problem: agencies built their image libraries organically over two decades, often without a common taxonomy or a single ingestion point. Every machinery-of-government change — and Canberra has had several since 2022 — shuffled shared drives between departments, copying files rather than migrating them cleanly.
What the Institutions Are Saying
At the Australian National University's Chifley Library precinct on Acton Peninsula, library and information science researchers have been studying the deduplication challenge in cultural heritage collections. Academic work published through ANU's College of Arts and Social Sciences this year identified that large cultural repositories can carry duplicate rates of between 15 and 40 percent depending on how long a collection has been accumulating without governance intervention — a range that aligns with anecdotal figures circulating among ACT government IT managers.
The University of Canberra, on Kirinari Street in Bruce, has a digital humanities research unit that has flagged similar concerns about the image libraries used in publicly funded research projects. Researchers there argue that deduplication is not merely a storage cost issue — it affects reproducibility, because published research that cites a specific image file can lose its reference chain when duplicates are pruned without proper version control.
The ACT Government's Digital, Data and Technology Solutions directorate, which manages shared ICT infrastructure for territory agencies, has acknowledged the problem exists but has not publicly released figures on the scale of duplication across its systems. Procurement records on the ACT Government's tender portal show the directorate sought vendor expressions of interest in digital asset management platforms in late 2025, with contract notices referencing requirements for automated deduplication functionality — a signal that the territory is moving toward a procurement-based solution rather than a manual audit.
The Cost Question and What Comes Next
Storage is not free. Cloud infrastructure costs for Australian government agencies typically run on whole-of-government panel arrangements, and duplicated files consume real budget. Industry benchmarks cited in digital governance literature suggest that eliminating redundant image files in a mid-sized government repository can reduce raw storage consumption by 20 to 30 percent, though the labour cost of auditing and remediating the collection often offsets short-term savings.
For Canberra specifically, the stakes include more than efficiency. The National Capital Authority manages image libraries covering the parliamentary triangle, Anzac Parade and Canberra's designed landscape — records with heritage significance. Losing provenance through careless automated deletion is a legitimate risk that archivists here take seriously.
The practical path forward, according to digital records specialists, involves three steps: an automated scan to identify candidate duplicates using hash-matching tools, a human review workflow to confirm deletion decisions on flagged files, and a governance policy that prevents the problem recurring at ingestion. Several vendors have been pitching exactly this sequence to ACT and federal agencies over the past six months.
Agencies and institutions that want to get ahead of an eventual whole-of-government mandate would be wise to start the audit work now, before a centralised policy locks in a single vendor or methodology that may not suit their collections. The conversation in Canberra has moved well past whether to act. The argument now is about who leads, who pays and how fast it happens.