Canberra's public sector holds more duplicate digital images than almost any comparable city in the developed world, and the bill for storing them is no longer trivial. Across ACT government directorates and Commonwealth agencies concentrated in the parliamentary triangle, digital asset audits conducted over the past 18 months have flagged redundant image files as one of the fastest-growing contributors to unstructured data bloat — a problem that translates directly into storage procurement costs, slower systems, and compliance headaches under the Australian Government Recordkeeping Metadata Standard.
The timing matters because the federal government's data centre consolidation program, centred on facilities in Fyshwick and Hume, is pushing agencies to rationalise what they actually store before migrating to shared infrastructure. Duplicate images — identical or near-identical photograph files saved multiple times across different folders, SharePoint libraries, and legacy drives — are consistently identified in internal technology reviews as low-hanging fruit that nobody gets around to cutting.
What the Numbers Look Like on the Ground
Digital asset management specialists working across the ACT public sector estimate that duplicate and near-duplicate image files can account for between 30 and 40 per cent of an organisation's total image library in the absence of active deduplication policies. For a mid-size Commonwealth agency employing 800 to 1,200 staff — the typical footprint along Northbourne Avenue or in the Barton precinct — that translates to tens of thousands of redundant files. At current enterprise cloud storage rates, which have hovered around $0.023 per gigabyte per month for tiered Australian data centre hosting, the annual overhead on a single agency's duplicate image problem can reach tens of thousands of dollars before labour costs for retrieval and version confusion are factored in.
The Australian National University, which manages one of the largest digital image repositories in the ACT through its heritage collections and research data infrastructure, has publicly committed to a staged asset rationalisation program as part of its broader digital strategy review. The University of Canberra, based in Bruce, has similarly flagged data hygiene as a priority under its information governance framework. Neither institution publishes granular figures on duplicate file volumes, but both are active participants in the Australian Research Data Commons, which sets interoperability and deduplication standards for member institutions.
For ACT government specifically, the Digital Strategy 2025–2028 sets measurable targets around data quality, though the framework does not isolate image duplication as a standalone metric. The ACT Auditor-General's Office reviewed ICT asset management practices across three directorates in its 2024–25 performance audit cycle, with findings touching on unstructured data governance — a category that encompasses image libraries held by agencies such as Transport Canberra and City Services, which accumulates significant photographic records through infrastructure inspections along routes including Flemington Road in Gungahlin and the expanding light rail corridor.
Practical Steps Agencies Are Taking
Deduplication is not a glamorous fix, but it is measurable. Hash-based deduplication tools — software that generates a unique fingerprint for each image file and flags exact copies — can typically process a 500,000-file library in under 24 hours on standard server hardware. Perceptual hashing, which catches near-identical images that differ only in resolution or compression, takes longer but catches the category of duplicates that hash matching misses.
Agencies on the ACT Whole of Government Microsoft 365 tenancy, which covers most directorates operating out of offices in Civic and Woden, have access to SharePoint's built-in storage analytics dashboard, which surfaces duplicate file alerts as a standard feature. The gap, according to information management professionals in the sector, is not tooling — it is policy. Without a mandatory deduplication review cycle written into records management procedures, the audits simply do not happen.
The practical advice for agencies heading into the second half of 2026 is straightforward: schedule a baseline audit before the next budget cycle, establish a retention schedule that specifically addresses image assets, and assign a named records officer accountability for the outcome. The Fyshwick data centre migration window, expected to accelerate through late 2026 and into 2027, is the most logical forcing function available. Agencies that arrive at migration with clean image libraries will spend less, migrate faster, and face fewer compliance questions from the National Archives of Australia on the other side.