Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archive Problem: The Numbers Behind the Duplicate Image Crisis

ACT government agencies and federal departments are sitting on millions of redundant digital files, and the storage bill is climbing every quarter.

Share

By Canberra News Desk · Published 5 July 2026, 4:48 am

4 min read

Updated 4 h ago· 5 July 2026, 12:17 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Federal and territory agencies based in Canberra are collectively managing an estimated tens of millions of duplicate image files stored across departmental servers — a problem that costs real money, slows workflows, and is forcing a reckoning about how the public service handles digital asset management.

The timing matters. With the ACT government mid-way through a digital transformation push tied to its 2025–2030 Digital Strategy, and federal departments under renewed pressure to cut operational costs ahead of the May 2027 budget cycle, the mundane problem of duplicate image files has quietly become a line item that procurement officers and IT directors can no longer ignore.

At the Australian National University in Acton, the institutional repository team has been grappling with the issue for at least three years. ANU's library and digital collections unit manages more than four million image assets across research and administrative systems, according to figures the university has cited in its own published infrastructure reviews. A significant share of those — estimates within comparable university systems put the duplication rate at between 20 and 40 per cent — are functionally identical files stored under different filenames, in different folders, on different servers.

What the Numbers Actually Look Like

Storage is not free. Enterprise-grade cloud storage for Australian government entities typically runs between $0.02 and $0.05 per gigabyte per month under whole-of-government panel arrangements managed through the Digital Transformation Agency. A department holding 50 terabytes of image files — not unusual for a mid-size agency with a decade of accumulated records — could be spending upwards of $30,000 a year on storage alone. If 30 per cent of those files are duplicates, that is roughly $9,000 annually on data that serves no function.

Scaled across the 150-plus agencies and bodies housed in Canberra's parliamentary and civic precincts, the aggregate waste is not trivial. The Digital Transformation Agency, headquartered on Constitution Avenue in Reid, has flagged data hygiene as a component of the broader myGov and whole-of-government data reform program, though no single published figure captures the total cost of duplication across the Commonwealth estate.

The University of Canberra in Bruce has taken a more structured approach. UC's IT governance framework, updated in early 2024, includes a mandatory deduplication audit cycle for all shared drives attached to its research data management platform. The university has not publicly disclosed cost savings from the program, but comparable programs at Queensland University of Technology produced a reported 28 per cent reduction in active storage consumption within 18 months of implementation, according to QUT's own published case study from 2023.

The Local Workflow Problem

For public servants in Barton, Phillip and the Treasury cluster around Capital Hill, the duplicate image problem is less about storage cost and more about time. A 2024 survey by the Australian Public Service Commission found that APS employees estimated spending an average of 4.5 hours per week searching for files or resolving version-control issues — though the APSC's published figures do not isolate image duplication specifically as a contributor.

Deduplication software has existed for years. Tools such as those embedded in Microsoft SharePoint Online — the platform used across most Commonwealth agencies under the M365 whole-of-government licensing deal — include built-in detection for identical files. The catch is that automated deduplication requires an initial audit, clear retention rules, and sign-off from records managers under the Archives Act 1983. That last requirement slows rollout considerably in agencies with complex records obligations.

Practically, the path forward for ACT-based agencies starts with a baseline audit. The National Archives of Australia, based in Parkes, publishes guidance under its Check-Up Digital self-assessment program that agencies can use to establish how many image assets they hold and what proportion are likely duplicates. Running that assessment costs nothing beyond staff time. Fixing the problem, on the other hand, requires policy decisions about what to keep, a defensible disposal authority, and — for larger agencies — a project budget that realistically falls between $50,000 and $200,000 for a medium-sized department. The agencies that start that process now will be better positioned when the next round of operational efficiency reviews lands in Canberra.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia