Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archive Problem: The Numbers Behind the Territory's Duplicate Image Crisis

ACT government agencies and local institutions are sitting on enormous backlogs of redundant digital files, and the storage bill is growing every quarter.

Share

By Canberra News Desk · Published 5 July 2026, 6:17 am

4 min read

Updated 3 h ago· 5 July 2026, 1:52 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Canberra's Digital Archive Problem: The Numbers Behind the Territory's Duplicate Image Crisis
Photo: Photo by Daniel Morton-Jones on Pexels

The ACT government holds tens of thousands of duplicate images across its digital asset libraries — and nobody agrees on exactly how many. What is clear, from procurement records and IT sector benchmarks, is that the problem costs money, slows workflows, and is getting harder to ignore as storage contracts come up for renewal across multiple directorates in the second half of 2026.

Duplicate image accumulation is not unique to Canberra, but the Territory's structure makes it a particularly acute local issue. With a public service workforce that turns over faster than almost any comparable jurisdiction — as federal and ACT agencies compete for the same pool of graduates coming out of the Australian National University and the University of Canberra — institutional knowledge about what files exist, where they live, and whether a colleague uploaded the same photograph two years ago tends to walk out the door with departing staff.

What the data shows about redundant files

Industry analysis from digital asset management consultancies — whose findings have been cited in multiple Australian government procurement discussions — consistently finds that between 30 and 40 per cent of images stored in enterprise content management systems are exact or near-exact duplicates. Applied to mid-sized government operations, that proportion translates to a measurable drag on storage budgets. Cloud storage for government-grade environments in Australia was pricing at roughly $0.023 per gigabyte per month as of late 2025, according to published rate cards from major providers. An agency holding 10 terabytes of image content and carrying a 35 per cent duplication rate is effectively paying for 3.5 terabytes of nothing every single month.

The ACT's own digital records obligations compound the issue. Under the Territory Records Act 2002, agencies cannot simply delete files without following an authorised disposal schedule. That means duplicates often sit in systems for years, pending a formal culling process that requires staff time and sign-off from the ACT Territory Records Office, based in Greenway. The backlog, across directorates ranging from Transport Canberra to the Suburban Land Agency, has grown as agencies added remote-work infrastructure after 2020 and staff began uploading assets from personal devices and shared drives without consistent naming conventions.

Local organisations starting to act

At the University of Canberra's Bruce campus, the library and IT services division began a structured duplicate-detection audit in early 2026, using automated perceptual hashing tools that compare images at a pixel-cluster level rather than relying on file names or metadata. The approach, now increasingly standard in the sector, can process thousands of images per hour and flag matches for human review rather than automatically deleting anything. UC has not publicly disclosed the scale of its findings, but the methodology is now being discussed inside at least two ACT government directorates as a potential model.

The National Archives of Australia, headquartered on Queen Victoria Terrace in Parkes, faces a related but distinct version of the same problem. Its digitisation programs have produced millions of image files since the early 2000s, and deduplication has been an ongoing challenge as different scanning batches produced slightly different versions of the same source document. The Archives has flagged digital preservation as a priority area in public budget submissions, though specific internal duplication figures have not been released.

For smaller ACT government units and statutory bodies, the practical path forward involves three steps that IT procurement officers are increasingly building into tender specifications: mandatory deduplication audits before any cloud migration, minimum metadata standards at the point of upload, and quarterly automated scans using tools that can distinguish true duplicates from intentional variants — say, a cropped version of a photograph held alongside the original. Gungahlin-based agencies that moved into newer office fit-outs in the past three years and adopted fresh content management platforms have a cleaner starting point than older inner-north directorates still migrating from legacy systems.

The next round of ACT whole-of-government ICT procurement discussions is scheduled for the fourth quarter of 2026. Whether deduplication standards end up embedded in those contracts will shape how the Territory manages its digital image problem for the better part of the next decade.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia