Federal agencies headquartered in Canberra are sitting on digital record systems riddled with duplicate images, and the people responsible for fixing the problem say the window for action is narrowing fast. The issue — redundant scanned documents, duplicated photo assets and replicated digital files clogging government databases — has drawn scrutiny from archivists, freedom of information practitioners and digital governance specialists in recent weeks.
The timing matters. The National Archives of Australia, based on Queen Victoria Terrace in Parkes, is midway through a multi-year digitisation push. At the same time, the ACT Government's Directorate of Digital, Data and Technology Services has been expanding its shared services platforms across directorates in Civic and Fyshwick. When duplicate images proliferate at that pace, the downstream problems multiply: inflated storage costs, unreliable search results, and — most critically for a government town — compromised Freedom of Information responses.
The Australian National University's School of Computing, located on Acton Peninsula, has published research examining image deduplication algorithms applicable to large institutional repositories. Separately, the University of Canberra's Faculty of Arts and Design has worked with cultural institutions on metadata standards that help prevent duplicates forming in the first place. Neither institution has been formally contracted to advise on the current federal problem, but their work is cited in discussions happening inside agencies.
Records management professionals point to a specific technical bottleneck: most Australian Government agencies procured their document management systems under contracts awarded between 2015 and 2019, before deduplication was a standard feature requirement. Replacing or upgrading those systems carries significant cost. The Digital Transformation Agency, which sits on Mort Street in the CBD, sets whole-of-government policy on this infrastructure, and agency insiders say updated guidance on image asset governance is expected before the end of the 2026 calendar year.
What the Specialists Are Recommending
Those working in the field are converging on a handful of practical positions. First, retrospective deduplication — running automated tools across existing archives — is now technically feasible and far cheaper than it was five years ago, with some commercial platforms quoting per-terabyte processing fees that have dropped sharply since 2022. Second, procurement policy needs to mandate deduplication capability in any new document management contract. Third, staff training is just as important as software: many duplicates originate from human behaviour, not system failure.
The ACT Government's own archives program, administered through the Territory Records Office in Phillip, faces a version of the same challenge at the sub-national level. Growth in the Gungahlin and Belconnen corridors has generated planning documents, environmental assessments and infrastructure records at record rates as the city's population, which passed 470,000 in 2025 according to ACT Government estimates, continues to climb. Each development application generates multiple scanned attachments, and without active deduplication controls, the same drawing or report can appear in a system dozens of times under different reference numbers.
For public servants navigating this in practice, the immediate advice from digital governance practitioners is straightforward: audit before you migrate. Any agency planning to move records to a new platform in the next 12 months should run a duplication analysis on existing holdings first, or risk embedding the problem permanently in a new environment. The cost of that audit, according to publicly available tender data from similar state government projects interstate, typically runs between $80,000 and $250,000 depending on archive size — a fraction of what remediation costs once a migration is complete.
The National Archives has not yet announced a formal policy position on deduplication standards. Its next scheduled update to the Digital Continuity 2020 policy framework — the document that governs how agencies manage and preserve born-digital and digitised records — is expected later this year.