Canberra's public sector holds one of the highest concentrations of digital image archives per capita of any city its size in the world. Across Commonwealth departments, the National Archives of Australia on Queen Victoria Terrace, the National Library on Parkes Place, and the Australian Institute of Aboriginal and Torres Strait Islander Studies in Acton, duplicate image files are consuming storage budgets and slowing down public-facing digital services. The problem has a name in records management circles — duplicate image proliferation — and Canberra is now confronting it at a scale that smaller capitals can only guess at.
The timing matters. The federal government's Digital Records Modernisation Program, which entered its second phase in early 2026, has pushed agencies to audit their digital asset holdings before a Commonwealth-wide cloud migration deadline later this year. That audit work has exposed what archivists and IT procurement officers have known for years: the same scanned photograph, heritage image, or departmental graphic often exists in dozens of slightly different file versions, each stored separately, each accruing hosting costs. For a city where roughly one in three workers is employed by the Australian Public Service, the inefficiency is not abstract — it lands directly on taxpayer-funded storage contracts.
What Canberra Is Actually Doing About It
The Australian National University's digital humanities team, based at Chifley Library on the Acton campus, has been piloting automated deduplication tools since late 2025 as part of a broader research data management project. The work involves applying perceptual hashing — a technique that identifies visually similar images even when file names and metadata differ — across collections that run to hundreds of thousands of assets. The University of Canberra's library and information science faculty in Bruce has separately published methodology guidance for government clients, though the scale of uptake across departments remains uneven.
The National Archives, which manages records under the Archives Act 1983, confirmed in its most recent corporate plan that digital storage rationalisation is a stated priority. It did not publish a dollar figure for current duplicate-related waste, but comparable agencies in comparable jurisdictions offer a rough guide. The UK National Archives reported in 2024 that deduplication exercises across three major collections freed up storage equivalent to roughly 18 percent of total holdings. If the same ratio applied to Canberra's consolidated Commonwealth holdings — a rough and conservative extrapolation — the savings would be substantial enough to register in annual budget reporting.
Wellington, Ottawa, and the Benchmark Problem
Wellington and Ottawa are the most instructive comparisons. Both are mid-sized federal capitals with dominant public sectors and centralised cultural collections, and both have further along deduplication programs than Canberra does. Statistics New Zealand's digital asset unit completed a full deduplication sweep of its image holdings in 2023, cutting stored file volume by 22 percent. Library and Archives Canada ran a similar exercise across its photographic collection between 2022 and 2024, reporting a reduction in active storage nodes from 14 to nine.
Canberra has not yet produced a comparable whole-of-government figure. Part of the reason is structural: unlike in Ottawa, where Library and Archives Canada operates as a single consolidated body, Canberra's cultural institutions — the National Library, National Gallery on King Edward Terrace, National Museum in Acton, and the Archives — each manage their own digital infrastructure. That federation of systems makes a unified deduplication sweep administratively complicated. A cross-agency working group under the Digital Transformation Agency has been examining the coordination question, though no public report has been released as of July 2026.
For anyone managing images inside an ACT government agency or a federally funded institution right now, the practical advice from the sector is consistent: start with an audit of assets created before 2015, when consistent file-naming standards were not yet widespread. Collections held in older content management systems, particularly those migrated across two or more platforms, are most likely to carry duplicates. The cost of doing nothing is no longer just a storage line item — it affects search performance, public access times, and the integrity of digital finding aids that researchers at institutions like ANU and UC rely on daily.