Canberra's public sector organisations are sitting on vast libraries of duplicate digital images — redundant files costing storage budgets, complicating records management and, in some cases, creating compliance risks under federal archiving rules. The issue has sharpened this year as agencies face tighter ICT budgets after the May federal budget, and as the ACT Government pushes departments to consolidate digital infrastructure ahead of a broader cloud-migration program due for completion by mid-2027.
For a city where the public service is the dominant employer, the problem is not abstract. The Australian Public Service Commission, which has offices on Constitution Avenue in Reid, maintains guidance for agencies on digital records hygiene, but implementation has been uneven across the roughly 100 entities covered by the Public Governance, Performance and Accountability Act 2013. Duplicate images — from scanned correspondence, satellite data, research outputs and citizen-facing portals — accumulate quietly until a migration or audit forces a reckoning.
Why It Matters Right Now
The timing is pointed. The National Archives of Australia, based on Queen Victoria Terrace in Parkes, has been updating its digital preservation framework, and agencies must demonstrate that records held digitally are authentic, complete and retrievable. A single master image being stored as four or five untagged duplicates across different departmental drives doesn't just waste space — it can muddy provenance, making it harder to confirm which version of a document is the authoritative one.
At the Australian National University in Acton, researchers working with large geospatial and scientific datasets have long grappled with the same issue at an institutional scale. ANU's library and digital collections teams have piloted deduplication workflows as part of their digital research infrastructure work, though the university has not publicly released figures on storage reclaimed. The University of Canberra in Bruce has similarly flagged data governance as a priority in its 2024–2027 strategic plan, noting the need to improve how research data is catalogued and maintained.
Digital records specialists — speaking generally, not about any specific agency — point to three recurring causes: staff uploading files through multiple channels (email, shared drives, content management systems), migrations that copy rather than consolidate, and the absence of automated deduplication at the point of ingest. The problem compounds over time. An agency that ingests 50,000 scanned documents per year and has no deduplication in place can theoretically accumulate hundreds of thousands of redundant files within a decade.
What Experts and Practitioners Are Recommending
Practitioners in Canberra's govtech community — a cluster that includes consultancies along Northbourne Avenue and in the Civic precinct — generally advocate for three interventions: implementing perceptual hashing tools that flag near-identical images at upload, establishing a single authoritative digital asset management system per agency, and scheduling regular reconciliation audits tied to the National Archives' Digital Continuity 2020 policy, which has been in force since 2016.
The Digital Transformation Agency, which operates out of offices in Mort Street, Braddon, has published guidance on whole-of-government platforms that can help, including shared cloud storage arrangements under the Whole of Australian Government cloud panel. Agencies that have moved to those arrangements report cleaner deduplication outcomes because ingestion rules can be enforced centrally rather than left to individual teams.
Cost is real. Commercial cloud storage in Australian government-certified environments typically runs at rates between $0.02 and $0.05 per gigabyte per month, depending on access tier and provider. For a medium-sized agency storing tens of terabytes of unnecessarily duplicated image data, that is a recurring and avoidable expense every financial year.
For Canberra organisations navigating the issue, the practical starting point is a storage audit against the National Archives' Check-up Digital tool, which agencies are required to complete annually. Identifying where duplicate images live — whether in a SharePoint environment, a legacy TRIM system, or a departmental intranet — is the prerequisite before any remediation can begin. The next scheduled reporting cycle under that framework falls in the first quarter of 2027, giving agencies roughly eight months to get their houses in order.