ACT government agencies and local research institutions are sitting on vast libraries of duplicate digital images — redundant files that inflate storage costs, muddy public records, and complicate everything from Freedom of Information requests to infrastructure planning. The push to clean up those libraries is gaining traction in 2026, driven by tighter agency budgets and a growing body of professional advice about what systematic deduplication actually involves.
The timing is not accidental. The ACT Budget delivered in June 2026 kept a close eye on digital infrastructure expenditure across the public service, and records managers inside the Territory have been under pressure to justify storage allocations that have ballooned alongside the proliferation of high-resolution photography, drone survey imagery, and scanned document collections. Duplicate files — sometimes numbering in the hundreds of thousands across a single department — sit at the centre of that conversation.
What the Professionals Are Telling Agencies
Records and information management specialists advising ACT government bodies have been consistent on one point: automated deduplication tools alone are not sufficient. The professional guidance circulating among digital archivists — including through programs run by the Australian Society of Archivists, which holds regular workshops in the ACT region — stresses that any replacement workflow must include human sign-off before originals are deleted, particularly where images form part of a legal or evidentiary record.
The concern is practical. A duplicate detection algorithm identifies files by hash value or pixel similarity, but it cannot determine which version of an image is the authoritative one for record-keeping purposes. A planning photograph taken during a Gungahlin corridor assessment, for example, might exist in three slightly different crop formats across the ACT Planning directorate, the National Capital Authority's consultation files, and a contractor's project folder. Which one gets kept matters — and that decision requires policy, not just software.
At the Australian National University on Acton Peninsula, library and digital preservation staff have been grappling with the same problem at scale. The ANU's Chifley Library and its affiliated digital repository have been working through image deduplication as part of a broader digitisation review, a process that specialists say typically takes between 12 and 18 months for a mid-sized institutional collection. Staff there have noted publicly — in sessions at the ANU Centre for Digital Humanities Research — that the governance framework matters as much as the technical tooling.
The Cost of Getting It Wrong
Storage is not cheap, even at government rates. Enterprise cloud storage for ACT agencies typically runs through whole-of-government arrangements managed via Shared Services, but those costs still land on agency budgets. Industry benchmarks suggest that unmanaged image duplication routinely inflates storage volumes by 20 to 40 percent in organisations that have not run a structured deduplication program. For a directorate holding several terabytes of survey, monitoring, and communication imagery — not an unusual figure for a department like Transport Canberra and City Services — that represents a non-trivial annual expenditure.
The broader risk is reputational as much as financial. When agencies respond to FOI requests or prepare evidence for ACT Civil and Administrative Tribunal proceedings, submitting multiple versions of the same image without clear metadata provenance creates confusion and can draw criticism from applicants and adjudicators. Information rights advocates in Canberra have pointed to this as a transparency problem, not merely an IT one.
For smaller entities — community organisations in Belconnen, local business councils, even the University of Canberra's institute programs on Bruce Campus — the calculus is simpler but the core advice from digital management professionals is the same: audit before you delete, document your methodology, and keep at least one backup of every original file for a defined retention period before a replacement workflow is considered complete.
The practical next step for ACT agencies, according to guidance from the National Archives of Australia's digital records standards, is to start with a controlled pilot on a discrete image collection rather than attempting a department-wide sweep. Agencies that have begun that work are being encouraged to document their methodology so it can be shared across the ACT public service — turning what has been a fragmented, directorate-by-directorate problem into something closer to a common solution.