A quiet but costly problem has been building inside Canberra's government agencies and cultural institutions: thousands of duplicate digital images clogging storage systems, complicating public records obligations, and draining IT budgets that are already under pressure. The issue has moved from a back-office nuisance to a policy decision point, with several ACT government directorates now weighing whether to pursue automated deduplication tools, manual audits, or a hybrid approach before the end of the 2026–27 financial year.
The timing matters. The ACT Government's Digital Strategy, which sets benchmarks for data integrity and storage efficiency across directorates, flags compliance reviews scheduled for the September 2026 quarter. Agencies that cannot demonstrate orderly digital asset management risk being flagged in the annual Territory Records Office compliance report — a finding that carries both reputational and administrative consequences.
Where the Problem Is Sharpest
Two institutions illustrate the scale of the challenge. The Australian National University's library system, which manages one of the largest digital collections in the Southern Hemisphere, has been working through a multi-year digitisation program that has inadvertently produced overlapping image files across separate project batches. Meanwhile, the ACT Heritage Library on Parkes Place — the territory's principal repository for historical photographic collections — flagged in its most recent annual report that storage costs had risen materially due to redundant file accumulation across its digitisation grants program.
The problem is not unique to heritage bodies. The ACT Education Directorate, which manages digital resources across more than 90 public schools in suburbs stretching from Gungahlin in the north to Tuggeranong in the south, has been consolidating its learning management systems onto a single platform. That consolidation has surfaced duplicate image assets that were uploaded independently by schools over several years, creating version-control headaches and inflating cloud storage costs.
Commercial storage pricing adds urgency. Enterprise cloud storage in the AWS Sydney region — used by several ACT government agencies — was sitting at approximately AUD $0.025 per gigabyte per month for standard-tier storage as of mid-2026. For an agency carrying tens of thousands of redundant high-resolution image files, each running between 20 and 50 megabytes, the monthly bill from duplicate data alone can reach five figures. That is money the ACT Budget, which recorded a deficit of $878 million in 2025–26 according to the Mid-Year Economic and Fiscal Outlook released in December 2025, cannot absorb without scrutiny.
The Decisions That Will Define the Outcome
Three choices are now in front of decision-makers, and the window for making them is tighter than it looks. First, agencies must decide whether to deploy automated deduplication software — tools that scan file hashes and flag identical or near-identical images — or rely on staff-led audits. Automated tools are faster but require procurement approval and, in the case of sensitive heritage imagery, raise questions about whether algorithmic decisions to delete files satisfy the Territory Records Act 2002.
Second, there is the question of governance. The Chief Digital Officer's office, based on London Circuit in Civic, has been developing cross-agency data standards, but deduplication policy has not yet been formally incorporated into those standards. Whether that changes before the September review will depend on how much political bandwidth exists inside the ACT's digital reform agenda given competing priorities — light rail stage 2 planning, the Northbourne Avenue corridor redevelopment, and ongoing housing approvals in Belconnen are all consuming ministerial attention simultaneously.
Third, institutions like ANU and UC will need to decide whether they align with the ACT government's emerging framework or continue managing their own image repositories independently. Both universities receive ACT government research funding and digitisation grants, which creates at least an indirect lever for alignment.
The practical path most likely to gain traction is a staged one: automated flagging of obvious duplicates by October 2026, followed by human review of edge cases through the first quarter of 2027. Agencies that move early stand to reduce storage overheads before the next budget cycle and to avoid the compliance scrutiny that is coming regardless. Those that wait may find the decision made for them.