Canberra's public sector has a clutter problem measured in terabytes. Across ACT government agencies, federal departments headquartered in the Parliamentary Triangle, and institutions such as the Australian National University, digital asset managers are confronting a mundane but expensive reality: duplicate images stored across shared drives, content management systems and cloud platforms are consuming storage that costs real money every single month.
The issue is more urgent now because the ACT government's ongoing digital transformation program — which centralised a significant portion of agency data infrastructure from 2023 onward — has forced IT teams to audit legacy file stores for the first time in years. What they are finding, according to publicly available digital asset management literature and records management guidance issued by the National Archives of Australia, is that duplicate and near-duplicate image files routinely account for between 20 and 40 per cent of total media storage in large organisations.
The Numbers Behind the Redundancy
Storage is not free. Commercial cloud providers typically charge Australian government clients in the range of $20 to $35 per terabyte per month for standard object storage, depending on contract tier. For an agency managing even 50 terabytes of media assets — a modest estimate for a department with years of accumulated communications and policy imagery — redundant files alone could represent a recurring cost of several hundred dollars a month, compounding annually. Multiply that across the roughly 20 directorates that sit under the ACT Public Service, and the aggregate becomes a genuine budget line.
The Australian Bureau of Statistics, located on Benjamin Way in Belconnen, publishes periodic data on government digital expenditure. Federal agencies more broadly spent over $10 billion on ICT in the 2022–23 financial year, according to the Australian Government's annual ICT expenditure report, with data storage forming a significant share. Canberra, as the epicentre of that federal workforce, hosts a disproportionate concentration of that infrastructure.
At ANU in Acton, the university's library and research data services teams manage image collections spanning decades of academic publishing and archival digitisation. Institutions of that scale deal with duplicate detection as a routine but resource-intensive task. The problem is compounded whenever staff turnover occurs — a persistent feature of Canberra's public service churn — because new employees frequently re-upload files already sitting elsewhere on the network.
What Duplicate Detection Actually Involves
Modern duplicate image detection relies on perceptual hashing algorithms, which generate a short numeric fingerprint for each image and flag files with fingerprints falling below a defined similarity threshold. Unlike simple file-size or filename matching, perceptual hashing catches images that have been resized, recompressed or slightly cropped — the kind of near-duplicates that accumulate when communications teams pull assets from shared libraries across multiple campaigns.
The University of Canberra's Information Sciences faculty at Bruce has published research touching on digital preservation and metadata integrity, areas directly relevant to the scale of this problem in government settings. The practical challenge for agencies is not identifying that duplicates exist — software can do that — but deciding which version of a file is the authoritative one, particularly when images carry embedded metadata such as copyright notices, geotags or accessibility descriptions that differ between copies.
For ACT government teams working under the Territory Records Act 2002, the stakes are not only financial. Records legislation requires that official images attached to policy documents, ministerial briefings or public communications are retained in their approved form. Deleting the wrong version of a duplicate is not just a storage error; it can constitute a records management breach.
Agencies reviewing their digital asset holdings before the end of the 2025–26 financial year — July 31 is the standard deadline for many ACT government annual reporting cycles — are being advised by records management professionals to run deduplication audits before migrating to any new platform, rather than after. Moving a bloated, redundant library into a new system simply relocates the problem at migration cost.
The practical upshot for Canberra's large institutional employers is straightforward: assign a records officer to audit image repositories against the National Archives' Digital Continuity Policy, run a perceptual hash comparison across asset libraries before the next contract renewal with your cloud provider, and document which file version carries the authoritative metadata. The savings are real. The compliance risk of not acting is equally real.