Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archives Are Drowning in Duplicate Images — and the Numbers Tell a Damning Story

ACT government agencies and local institutions are sitting on terabytes of redundant image files, and the cost of doing nothing is quietly mounting.

Share

By Canberra News Desk · Published 5 July 2026, 5:06 am

4 min read

Updated 4 h ago· 5 July 2026, 1:00 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Canberra's public sector has a clutter problem measured in terabytes. Across ACT government agencies, university research libraries and National Capital Authority digital collections, duplicate image files now account for a significant share of managed storage — redundant photographs, scanned documents and design assets that exist in two, three, sometimes a dozen identical copies across separate servers and cloud environments. The push to finally address the problem is gaining momentum in 2026, and the data behind it is hard to ignore.

The timing matters. The federal government's data storage footprint has grown sharply since the pandemic-era shift to remote work. Australian Bureau of Statistics figures published in early 2026 showed the Commonwealth held more than 10 exabytes of data across its agencies, with unstructured content — documents, images, video — making up the fastest-growing category. For the ACT's own institutions, the overlap between federal and territory digital infrastructure creates a particularly messy environment where the same image can live simultaneously on an agency's internal drive, a shared SharePoint instance and an offsite backup cluster.

What Redundancy Actually Costs

Storage is not free. Enterprise cloud storage through providers contracted to the Australian Government typically costs between $0.02 and $0.05 per gigabyte per month under current whole-of-government arrangements, according to pricing frameworks published by the Digital Transformation Agency. At scale, duplicate image libraries inflate those bills in ways that IT budget managers are now being pushed to justify to finance committees. A collection of 500,000 high-resolution images — not unusual for a department like the Australian Institute of Aboriginal and Torres Strait Islander Studies, based in Acton, or the National Archives of Australia at its Mitchell facility — can occupy several terabytes before duplication is factored in. Industry estimates used in government procurement discussions suggest 20 to 40 percent of unstructured storage in large organisations consists of exact or near-exact duplicates.

At ANU's Chifley Library, digital collections staff have been working through a remediation program since late 2025, using automated deduplication tools to flag redundant image files before migrating holdings to a new archival platform. The university has not publicly disclosed figures, but the broader sector benchmark is instructive: the Digital Preservation Coalition published guidance in 2024 suggesting institutions running active deduplication workflows typically recover 15 to 30 percent of their allocated storage capacity within the first 12 months. For a mid-sized agency paying roughly $8,000 a month in cloud storage, that recovery rate translates to real money.

Local Programs Trying to Get Ahead of It

The ACT Government's Service Canberra digital transformation agenda, operating out of offices on Northbourne Avenue, has flagged data hygiene as a workstream priority for the 2026–27 budget cycle. Territory Records Office guidelines updated in March 2026 now include specific instructions for agencies on identifying and disposing of duplicate digital assets before transferring records to long-term custody. That guidance covers photographic records explicitly — a category that had previously fallen into a grey zone between IT asset management and formal records management.

Organisations in Belconnen's town centre precinct, including several Commonwealth regulatory bodies with large public-facing image libraries, have begun auditing holdings ahead of the July 2026 end-of-financial-year IT expenditure reviews. The practical prompt is straightforward: storage budgets are being scrutinised, and duplicates are an easy target for cuts that don't require deleting anything of genuine value.

For smaller ACT entities — community organisations, suburban councils, venues around Braddon and Kingston — the problem is less about cloud bills and more about workflow. When the same photograph exists in five slightly different export sizes across three staff members' drives, finding the right version wastes time and occasionally means the wrong one gets published. Free and low-cost deduplication tools, including open-source options compatible with Windows and macOS, can scan a local drive and flag duplicates in under an hour for collections below 100 gigabytes. The ACT Government's Digital Canberra resources page lists several approved vendor options for agencies operating under the territory's ICT procurement rules. Starting the audit before the next budget cycle closes is, by any measure, the cheaper option.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia