Canberra's public sector is facing renewed pressure to clean up how it manages digital image libraries, after years of duplicate, mislabelled, and orphaned files quietly inflating storage costs and creating compliance headaches across agencies. The problem is unglamorous, but insiders say it is costing taxpayers money and slowing down the kind of digital-services reform the Albanese government has staked significant political capital on delivering.
The issue has sharpened in 2026 as agencies push to consolidate legacy systems ahead of a broader Australian Government Architecture (AGA) review deadline later this year. When departments migrate records into centralised repositories, duplicate images — the same photograph or scanned document stored multiple times under different file names or metadata tags — surface in large numbers. Managing them properly, or failing to, has direct consequences for Freedom of Information compliance, records integrity, and the cost of cloud storage contracts.
Why Canberra Is Ground Zero for the Debate
The ACT government's own Digital Strategy, which runs through to 2025–2026, flagged data quality as a priority. But the Civic-based team within the ACT Digital, Data and Technology Solutions directorate has been candid in briefing materials that image deduplication remains an area where neither automated tooling nor human review processes have been standardised across agencies. A spokesperson for the directorate declined to provide figures on the volume of affected records when contacted by The Daily Canberra this week.
At the federal level, the National Archives of Australia, headquartered on Queen Victoria Terrace in Parkes, has long grappled with the problem in its digitisation programs. Archives staff processing physical records for the RecordSearch database — which holds tens of millions of items — routinely encounter situations where a scanned image has been ingested more than once, either through batch-processing errors or because multiple units submitted the same document independently. The Archives published revised digitisation standards in late 2024, but implementation guidance specific to image deduplication was not included in that release.
Researchers at the Australian National University's School of Computing, based on the Acton campus, have been working on machine-learning tools designed to flag near-duplicate images in large institutional collections. The approach uses perceptual hashing — a technique that generates a fingerprint for each image and compares them at scale — rather than relying on file-name matching alone, which experts say catches only a fraction of true duplicates. The University of Canberra's Centre for Creative and Cultural Research, on Kirinari Street in Bruce, has explored similar questions in the context of cultural heritage collections, noting that the stakes extend beyond efficiency to questions of historical accuracy when duplicate records carry conflicting metadata.
What Experts and Officials Want to See
The conversation among practitioners in Canberra right now is less about whether deduplication matters and more about who owns the problem. Records managers argue it sits with IT procurement teams who approved cloud migration contracts without deduplication clauses. IT teams say it belongs with information governance. Neither group has the budget authority to mandate a fix across portfolio agencies.
Digital records specialists consulted for this article — who work across several Commonwealth departments but were not authorised to speak on the record — described the situation as one where the Australian Government Information Management Office (AGIMO) functions, now absorbed into the Australian Public Service Commission and the Digital Transformation Agency in Canberra's Barton precinct, left a governance gap that has never been properly filled.
The Digital Transformation Agency, which sits on King Edward Terrace in Parkes, has not issued a public standard on image asset management as of July 2026. Its Digital Service Standard covers service delivery but does not extend to back-end data hygiene at the asset level.
For agencies looking to move now, practitioners point to three practical steps: conduct an audit using perceptual hashing tools before any cloud migration, establish a single authoritative image repository with access controls that prevent parallel ingestion, and assign a named data steward — not just a team — with sign-off authority. The ACT government's next Digital Strategy iteration, expected to be released for public consultation in the second half of 2026, is seen as an opportunity to lock those expectations into policy for territory agencies at least. Whether the federal government follows suit is another question entirely.