Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archive Problem: The Numbers Behind a City Drowning in Duplicate Images

ACT government agencies and local institutions are sitting on terabytes of redundant image files, and the cost of cleaning up the mess is quietly mounting.

Share

By Canberra News Desk · Published 5 July 2026, 5:00 am

4 min read

Updated 4 h ago· 5 July 2026, 12:54 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Canberra's public sector has an image problem — literally. A growing body of digital asset audits across ACT government agencies shows that duplicate image files now account for a significant share of storage overhead in government content management systems, with some departmental libraries carrying duplication rates above 40 percent. The problem has been quietly expensive for years. Now, with the ACT Government's broader digital transformation push accelerating through 2026, the reckoning is arriving.

The context matters. Federal and territory agencies alike have been migrating legacy content systems — some running on platforms first deployed in the early 2010s — onto cloud-based infrastructure. When those migrations happen without rigorous deduplication, old files get dragged along wholesale. A single photograph of, say, the Questacon building on King Edward Terrace might exist in five different crops, three different resolutions, and two different file formats, each saved by a different communications officer over a decade. Multiply that across dozens of agencies and the waste compounds fast.

What the numbers actually look like

Digital asset management specialists working across the Canberra market — including firms contracted to both the Australian Public Service and the ACT Government Directorate network — have flagged duplication as a systemic issue rather than an isolated one. Industry benchmarks suggest that in large content libraries without active deduplication protocols, between 30 and 60 percent of stored image files are redundant copies or near-duplicates. At standard AWS S3 storage rates, which hovered around USD $0.023 per gigabyte per month as of mid-2026, a department holding even 10 terabytes of redundant image data is spending roughly $2,760 per month — or more than $33,000 per year — storing files nobody needs.

The Australian National University's digital collections team, which manages photographic archives across the Acton campus and multiple off-site repositories, has been working since late 2024 on an internal deduplication project tied to its library system upgrade. The University of Canberra's institutional repository on Bruce campus faces a similar inventory challenge, particularly for research image datasets that are frequently duplicated across departmental servers when shared between faculties.

At the territory level, the ACT Government's Digital Strategy, which was updated in 2025, explicitly calls for improved data stewardship — but the specific problem of image duplication sits in a gap between IT asset management and communications workflows. Nobody owns it cleanly. Comms teams upload, IT teams store, and procurement teams pay the bills without always connecting those three functions.

The local cost beyond the server bill

The dollar figure on storage is only part of the story. The harder cost is human time. When a communications officer at a directorate office on London Circuit needs a photograph for a media release, a duplicated, untagged library means search time blows out. Estimates from digital workflow consultancies place the productivity drag at between 15 and 30 minutes per image search in poorly organised libraries — trivial per instance, but significant when multiplied across hundreds of staff and thousands of content requests per year.

Gungahlin and Belconnen community groups working with ACT Government grant portals have also reported friction when submitting image documentation for community project applications, partly because the backend systems these portals connect to lack consistent image validation — meaning the same photograph submitted in two formats creates two stored records.

The practical fix is neither glamorous nor particularly expensive. Automated perceptual hashing — software that identifies visually similar images even when file names differ — can process large libraries in days. Several APS agencies have piloted this approach since 2025, though full rollout across the Canberra public service remains patchy. The ACT Government's Procurement and Capital Works directorate is understood to be reviewing digital asset management contracts later this year, which would be the natural moment to bake deduplication requirements into new service agreements.

For local institutions that can't wait for a government-wide solution, the advice from digital archivists is blunt: audit before you migrate. Moving a bloated, duplicated image library to new infrastructure does not solve the problem — it just makes it more expensive to store.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia