Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archives Are Riddled With Duplicate Images — and the Numbers Are Worse Than Anyone Admitted

A growing problem inside ACT government and public sector digital libraries is costing storage budgets and slowing down web platforms, with new internal audits putting hard figures on the scale of the mess.

Share

By Canberra News Desk · Published 5 July 2026, 5:45 am

4 min read

Updated 3 h ago· 5 July 2026, 1:47 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Canberra's Digital Archives Are Riddled With Duplicate Images — and the Numbers Are Worse Than Anyone Admitted
Photo: Photo by Pat Saengcharoen on Pexels

Roughly one in every three images stored across ACT government digital asset systems is a duplicate, according to figures circulated among ICT officers within the directorate network this financial year. The finding has triggered a quiet but urgent push to automate duplicate detection across agencies, with at least two major platforms already flagging the problem as a drag on performance and procurement costs.

The issue lands at a sensitive moment. The ACT government is mid-cycle on its whole-of-government digital infrastructure review, and agencies are under pressure to justify cloud storage expenditure ahead of the 2026-27 budget. Duplicate image files — the same photograph or graphic stored under different filenames, in different folders, or re-uploaded across multiple content management systems — inflate storage tallies without adding any functional value. At scale, across an organisation like the ACT Public Service, which employs roughly 23,000 people across directorates in Civic, Barton and Fyshwick, the arithmetic adds up fast.

What the Audits Are Actually Finding

Internal technical reviews conducted across three ACT directorates between January and May 2026 found that digital asset libraries had grown by an average of 40 percent over the preceding four years, but usable, unique image assets had grown by only about 22 percent over the same period. The gap — nearly 18 percentage points — is largely attributable to duplicate uploads, re-saves and format conversions that were stored rather than replaced. One directorate's content management system was found to contain more than 4,700 image files, of which a deduplication scan identified close to 1,600 as functionally identical to another file already in the system.

The University of Canberra's library and digital services team, which manages image assets for public-facing research and student platforms on the Bruce campus, ran a comparable internal audit in late 2025. Staff identified that their media repository held approximately 2,200 duplicate image entries out of roughly 9,000 total files — just under 25 percent. The exercise took three staff members around six working days to complete manually, a workload that prompted the team to begin evaluating automated deduplication tools in the first quarter of 2026.

The Australian National University, whose digital collections span the Acton campus and extend to the Noel Butlin Archives Centre in Dickson, has faced similar pressures. Long-running digitisation programs — particularly those tied to historical photograph collections — have generated large volumes of near-identical scans at different resolutions, all retained on the assumption that higher-resolution versions might eventually be needed. That assumption has a storage cost: high-resolution TIFF files can run to 80 megabytes per image, meaning even a few hundred duplicates consume tens of gigabytes.

The Cost Calculation — and What Comes Next

Cloud storage pricing for government and educational institutions in Australia typically falls between $0.02 and $0.05 per gigabyte per month under enterprise agreements, meaning a library carrying 500 gigabytes of duplicate data is paying somewhere between $120 and $300 per year just to store files it doesn't need. That sounds modest in isolation. Across a portfolio of eight to ten directorates, each carrying their own content management infrastructure, the cumulative waste becomes a genuine line item.

The more consequential cost is operational. Web platforms that pull images from bloated, poorly indexed repositories load more slowly, return more search noise and require more manual curation to maintain. For public-facing ACT government sites — including those serving residents in Gungahlin and Belconnen, where population growth has pushed up demand for online service delivery — that friction translates directly to user experience problems.

The ACT government's Digital Strategy, which runs to 2025 and has an anticipated successor framework, identifies data quality as a foundational priority. Duplicate image management sits within that remit, and several directorates are understood to be evaluating purpose-built deduplication software as part of their next procurement cycle, with decisions expected before the end of calendar 2026. For agencies still managing the problem manually, the ANU and UC audits offer a useful benchmark: set aside dedicated staff time, run a hash-based scan of the full asset library first, and budget for at least one full week of remediation work per 10,000 files held.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia