Canberra's Digital Archives Push Hits a Snag: Duplicate Image Problem Exposed This Week
A systematic audit of the ACT government's digitisation program has surfaced thousands of duplicate images clogging shared databases, forcing agencies to pause indexing work mid-project.
This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →
A routine quality check by ACT government archivists has uncovered a significant duplicate image problem across shared digital storage systems, with preliminary findings suggesting thousands of scanned files have been catalogued multiple times across at least three separate agency databases. The discovery, which came to light this week, has prompted a temporary pause in the broader digitisation push that the ACT Government Libraries and the Territory Records Office have been jointly running since early 2025.
The timing matters. The ACT's public service workforce — one of the largest per-capita concentrations of government employees in the country — has been pushing hard to migrate legacy paper records into searchable digital form. With Light Rail Stage 2 planning documents, Gungahlin growth corridor land assessments, and Belconnen community infrastructure files all part of the current digitisation batch, duplicated records create genuine administrative risk: staff referencing outdated or redundant versions of planning approvals could introduce errors into active project work.
What the Audit Found
The Territory Records Office, based on London Circuit in the city centre, identified the problem after a cross-agency review flagged unusual file-size bloat in the shared government content management system. Scanned documents from the former ACT Planning and Land Authority — absorbed into the current Environment, Planning and Sustainable Development Directorate — appear to account for a significant portion of the duplicates, according to background briefing materials circulated within the directorate this week. The Australian National University's digitisation partnerships program, which has assisted with several ACT government archiving projects at the Noel Butlin Archives Centre on the ANU campus in Acton, is not directly implicated but has been asked to review joint project files as a precaution.
Duplicate images in large-scale digitisation projects are not unusual. Industry benchmarks published by the Australian Society of Archivists suggest error rates of between two and five per cent are common in bulk scanning operations, with duplicates often arising when batch-scanning software fails to detect previously ingested files. At scale — and the ACT government's current program involves an estimated 1.2 million pages across its 2025-26 digitisation tranche — even a two per cent duplication rate means tens of thousands of redundant files consuming storage and distorting search results.
What Happens Next for Public Servants and Residents
The Territory Records Office is understood to be deploying automated deduplication tools this week to scrub the affected databases before indexing resumes. Staff at agencies including the Transport Canberra and City Services Directorate, which has offices at Dickson, have been advised not to rely on the shared archive portal for planning-era documents until a clean-up notice is issued — likely within the next fortnight, based on the internal project timeline.
For residents dealing with development applications or historical records requests — a common need in fast-growing suburbs like Gungahlin and along the Belconnen Way corridor — the practical advice from the Territory Records Office is to submit requests as normal but expect processing delays of up to 10 additional business days while the audit completes. Walk-in records services at the Archives reading room on Sudeley Street in Mitchell remain open, and staff there can assist with manual file retrieval as a workaround.
The episode also puts fresh scrutiny on procurement decisions underpinning the digitisation program. The ACT government's 2024-25 budget allocated funding for the multi-year archiving push, but the specific contract terms governing quality-assurance checks have not been made public. Whether the deduplication failure represents a gap in vendor contractual obligations or an internal process breakdown is a question the Territory Records Office has not yet answered publicly. A formal review of the program's quality controls is expected to be completed before the next major scanning batch begins in September 2026.
Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.