Skip to main content
The Daily Canberra

All of Canberra, every day

News

Canberra's Digital Archives Are Riddled With Duplicate Images — Here's What Officials and Experts Are Saying About It

Government agencies, researchers at ANU and local heritage bodies are pushing for a coordinated fix to a growing problem in public digital collections.

Share

By Canberra News Desk · Published 5 July 2026, 5:16 am

4 min read

Updated 3 h ago· 5 July 2026, 1:22 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

Canberra's Digital Archives Are Riddled With Duplicate Images — Here's What Officials and Experts Are Saying About It
Photo: Photo by Pat Saengcharoen on Pexels

Canberran public institutions are sitting on tens of thousands of duplicated digital image files — and the people responsible for managing them say the problem has quietly ballooned for years. The ACT Government's digital records directorate, the National Archives of Australia on Queen Victoria Terrace, and university libraries are all dealing with overlapping copies of the same images stored across incompatible systems, inflating storage costs and making legitimate records harder to find and verify.

The issue has sharpened in mid-2026 because several federal agencies are mid-cycle on mandatory digital asset audits required under updated Commonwealth records management frameworks. For institutions already stretched by public service workforce pressures, the administrative burden of identifying and removing duplicate image files is landing at a particularly awkward moment.

What the Experts Are Pointing To

Archivists and digital records specialists at the University of Canberra's iSchool — located on Kirinari Street in Bruce — have been examining the practical costs for smaller agencies. Their work points to a recurring pattern: images are uploaded multiple times through separate intake portals, then tagged inconsistently, which means automated deduplication tools flag only a fraction of the actual duplicates. The rest stay buried, consuming server space and muddying search results.

At ANU's Chifley Library, staff managing the university's digital collections have grappled with similar issues in photographic and map archives. The Menzies Research Collection alone holds digitised items going back to the 1960s, and librarians have noted publicly in conference papers that manual review remains the only reliable method for catching near-duplicate images — photographs of the same subject taken seconds apart, or scans made at different resolutions from the same physical document.

Industry specialists in digital asset management broadly agree that AI-assisted deduplication tools have improved significantly since 2023, but that implementation requires upfront investment and staff retraining that many Canberra-based agencies have been slow to prioritise. Procurement cycles in the ACT public service typically run 18 to 24 months for software of that category, which means decisions made now would not deliver working systems until late 2027 at the earliest.

Local Costs and the Housing Parallel

Storage is not free. Enterprise cloud storage contracts for mid-sized federal agencies typically run to several hundred thousand dollars annually, and duplicated files that go unaddressed compound those costs year on year. For an ACT government already managing tight capital budgets — the 2025-26 ACT Budget allocated funding for digital infrastructure upgrades across multiple directorates — the inefficiency is one that records managers have been raising through internal channels for at least two years.

There is also a heritage dimension. The ACT Heritage Library, based at Woden, holds photographic records of Canberra's development from the early twentieth century. Duplicates in that collection do not just cost money — they create uncertainty about which version of an image is the authoritative one for citation and publication. When researchers at institutions like the Australian Institute of Aboriginal and Torres Strait Islander Studies on Lawson Crescent request access to specific historical photographs, receiving multiple near-identical versions with different metadata creates genuine complications for their work.

Digital records consultants working with Canberra agencies say the most practical near-term step is a phased hash-matching audit — a process that compares files mathematically to find exact copies — before moving to the more complex task of identifying near-duplicates. That first phase can typically be completed in-house using existing tools, and several ACT directorates are understood to be scoping that work for the second half of 2026.

For institutions weighing next steps, the message from the records management community is consistent: waiting for a perfect AI solution is costlier than starting with a manual or semi-automated approach now. The Gungahlin and Belconnen satellite offices of several federal departments, which have their own local document management systems, are among the sites flagged as priorities in internal agency planning documents for digital clean-up rounds scheduled before the end of the financial year.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia