Thousands of duplicate image files have been identified within the ACT Heritage Library's digital catalogue this week, creating a bottleneck that administrators say is delaying public access to newly scanned historical records. The problem surfaced during a scheduled audit of the Canberra Images Online database, a joint initiative between ACT Libraries and the Canberra Museum and Gallery on London Circuit in Civic.
The timing matters. The ACT Government committed earlier this year to expanding digital access to its heritage collections by the end of the 2026 financial year, a deadline that has now become harder to meet. Duplicate image files — some records show the same photograph appearing as many as four or five times under different catalogue numbers — mean curators must manually review and merge entries before new material can be published cleanly to public-facing portals.
How the Problem Built Up
The duplication issue is not new, but its scale became visible this week when a batch of approximately 14,000 newly digitised images from the Noel Butlin Archives Centre at the Australian National University on Acton Peninsula was being prepared for integration. Metadata inconsistencies between the ANU's own cataloguing system and the ACT Libraries format meant automated ingestion tools flagged conflicts, generating multiple entries for identical files rather than consolidating them.
Digitisation projects routinely grapple with this kind of interoperability problem, particularly when multiple institutions contribute material to a shared repository. The Canberra Images Online platform draws from at least six separate source collections, including holdings from the Canberra District Historical Society based in Mitchell and photographic records from the former National Capital Development Commission. Each collection arrived with its own naming conventions, and the software used to deduplicate entries has not kept pace with the volume of incoming material.
The ACT Budget handed down in June 2026 allocated $2.3 million over two years to digitisation and digital preservation across ACT Cultural Facilities Corporation programs. Whether that funding covers the additional remediation work now required has not been confirmed publicly by the directorate responsible.
What Curators Are Doing About It
Staff at the Canberra Museum and Gallery have begun a manual triage process, cross-referencing catalogue entries against original accession records. The work is expected to take several weeks. Researchers at the University of Canberra's Faculty of Arts and Design on Kirinari Street in Bruce, who use the Canberra Images Online database for urban history and planning research, have been advised informally that some search results may return incomplete or redundant results during the remediation period.
The duplication problem also has a practical cost beyond inconvenience. Storage for duplicated high-resolution image files — each archival-quality scan can run to 80 megabytes or more — accumulates quickly. Industry benchmarks from the Digital Preservation Coalition suggest duplicate file bloat can inflate storage costs by 15 to 30 percent in mid-sized institutional repositories, though the ACT Libraries directorate has not released figures specific to its own holdings.
Longer term, the incident has renewed calls from heritage professionals for a standardised metadata framework across ACT government-held collections. A working group involving representatives from ACT Libraries, the National Archives of Australia on Queen Victoria Terrace in Parkes, and the Australian Institute of Aboriginal and Torres Strait Islander Studies has been meeting since March 2026 to draft common ingestion standards, but no final guidelines have been published.
For researchers and members of the public who rely on the database, the practical advice for now is to cross-check any Canberra Images Online results against the physical catalogue at the Civic library branch on London Circuit, where staff can confirm whether a particular image has a valid, unique accession record. The directorate is expected to provide a public update on the remediation timeline before the end of July 2026.