Thousands of duplicate digital images are sitting inside ACT government databases, public records systems and institutional archives — redundant files that waste storage, confuse cataloguers and, in some cases, risk burying the authoritative version of a document beneath layers of identical copies. Records managers across Canberra are now pushing for clearer policy on how those duplicates get identified, ranked and replaced.
The pressure has sharpened this year as agencies prepare for a broader digital asset overhaul tied to the ACT Government's whole-of-government cloud migration, a program that has been rolling through directorates since late 2024. When agencies move legacy databases into centralised cloud storage, duplicate image files multiply the migration cost and complicate version control — particularly in environments where a single photograph or scanned document may have been uploaded, re-saved and re-tagged dozens of times across different business units.
Why Canberra's Institutional Landscape Makes This Harder
The problem is especially acute in the capital for structural reasons. The Australian National University holds one of the largest research image repositories in the southern hemisphere, spanning decades of scientific fieldwork and policy documentation. The University of Canberra's research data services team manages separate repositories covering social science and urban planning imagery, some of which overlaps with ACT government planning records held on Nishi and London Circuit. Both institutions operate under federal and territory privacy frameworks simultaneously, meaning any replacement or deletion decision touching personally identifiable imagery requires sign-off across multiple compliance layers.
The ACT Archives Authority, based on Rudd Street in Fyshwick, sets the retention schedule that governs when a record can be altered or destroyed — and that schedule does not yet have a specific provision addressing automated duplicate detection. Records managers working within the authority's framework have been lobbying for an amendment since at least early 2025, arguing that the existing rules were written before machine-learning deduplication tools became standard procurement items.
At the federal level, the National Archives of Australia on Queen Victoria Terrace in Parkes faces a parallel issue at far greater scale. Its digitisation program — which has been converting physical holdings to digital formats for several years — generates derivative image files at multiple resolutions, and each derivative is technically a distinct record under current policy. The result is that a single scanned photograph can exist in five or six forms in the same system, none of which can be deleted without a formal disposal authority.
What the Experts and Officials Are Saying
Records management professionals speaking at a May 2026 forum hosted by the Records and Information Management Professionals Australasia chapter in Canberra argued that the sector needs a national standard distinguishing between a true duplicate — a byte-for-byte identical file — and a derivative copy, which carries independent archival value. Without that distinction codified in policy, agencies default to keeping everything, which defeats the purpose of any cleanup exercise.
Digital preservation specialists have pointed to hash-verification tools as the most defensible method for identifying true duplicates, because the process is auditable and does not rely on subjective metadata matching. Several ACT directorates have begun trialling such tools internally, though no whole-of-government rollout date has been announced. Procurement records from Access Canberra's digital services panel, published on the ACT Government's tender portal, show contracts for data quality and deduplication services valued at under $200,000 have been let to at least two suppliers since January 2026.
The practical advice coming from records professionals is consistent: agencies should not wait for a comprehensive policy update before acting on obvious true duplicates, but they should document every replacement decision with a disposal log that references the applicable retention schedule. Any image tied to ongoing litigation, a freedom of information request, or a heritage listing — such as photographs connected to properties on the ACT Heritage Register — should be quarantined from any automated replacement process until legal clearance is obtained.
For Canberra's institutions, the next pressure point is the end of the 2025–26 financial year migration window. Directorates that have not resolved their duplicate image inventories before consolidation will carry the problem into the new cloud environment — and at that point, fixing it becomes significantly more expensive.