Skip to main content
The Daily Canberra

All of Canberra, every day

News

How Canberra's Digital Archives Ended Up Full of Duplicates — and What's Being Done About It

Years of siloed government IT systems and rapid agency expansion left ACT and federal records repositories riddled with duplicate images, and the bill to fix it is now coming due.

Share

By Canberra News Desk · Published 5 July 2026, 5:45 am

4 min read

Updated 3 h ago· 5 July 2026, 1:47 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Canberra is independently owned and covers Canberra news free from advertiser or sponsor influence. Read our editorial standards →

How Canberra's Digital Archives Ended Up Full of Duplicates — and What's Being Done About It
Photo: Photo by Katie Barget on Pexels

Canberra's public sector has a clutter problem hiding in plain sight. Across federal agencies concentrated along Constitution Avenue and the ACT government's own digital infrastructure, duplicate images — scanned documents, photographs, policy graphics, satellite maps — have accumulated across decades of overlapping IT procurement cycles, agency mergers, and emergency digitisation drives. The reckoning is now well underway, with several Commonwealth and Territory bodies actively auditing their holdings in 2026.

The problem did not emerge overnight. It traces back to at least the early 2000s, when individual directorates began scanning paper records independently, with no shared taxonomy and no central deduplication mandate. When the Australian Public Service Commission pushed agencies toward cloud storage platforms after 2015, teams migrated existing holdings wholesale rather than cleaning them first. Duplicates travelled with them.

A Legacy of Siloed Systems

The National Archives of Australia, headquartered in Parkes on Queen Victoria Terrace, holds responsibility for Commonwealth records and has been working through a multi-year digital preservation program. The ACT government's own records sit under the jurisdiction of the Territory Records Office, which operates under the Territory Records Act 2002. Neither body has historically coordinated deduplication efforts with the other, meaning images of the same Canberra infrastructure projects — think the light rail corridor through Gungahlin, or flood mapping around the Molonglo Valley — can appear in multiple repositories under different file names and metadata tags.

The Australian National University's archives and digital humanities teams in Acton have flagged the issue in their own collections too. Researchers working with historical photographic holdings have encountered thousands of near-identical scans created during different digitisation rounds, each tagged with conflicting dates or provenance notes. The University of Canberra's library digitisation unit in Bruce has reported similar friction when drawing on shared government datasets for urban planning research.

Part of the structural cause is procurement-driven. Federal IT contracts have historically been awarded agency by agency, with vendors setting their own file ingestion standards. A 2022 Australian National Audit Office report on digital record-keeping found significant inconsistencies in how agencies classified and stored digital assets — a finding that has continued to shape reform conversations inside the Department of Finance's whole-of-government digital policy work on Newlands Street in Barton.

The Cost of Cleaning Up

Storage is not free. Cloud hosting costs for Commonwealth agencies — under whole-of-government arrangements brokered through the Digital Transformation Agency — run to hundreds of millions of dollars annually across the federal estate. Duplicate images inflate that storage footprint directly. Industry estimates for large-scale deduplication projects in the Australian government context typically range from $50,000 to well over $500,000 depending on collection size, metadata complexity, and whether manual review is required for ambiguous matches.

The ACT government's digital services team, operating out of offices in Canberra City, has been piloting automated deduplication tools on selected collections since late 2025 as part of a broader data governance refresh. The tools use hash-matching and perceptual comparison algorithms to flag likely duplicates for human review rather than deleting automatically — a cautious approach designed to avoid destroying unique records that happen to look similar to others.

The timing matters because the ACT is also in the middle of a records expansion phase. Growth in Gungahlin and the Molonglo Valley means new planning documents, environmental assessments, and infrastructure photographs are being added to government holdings at a faster rate than at any point in the territory's history. Getting the deduplication baseline right now shapes how manageable those collections will be in five years.

For public servants and researchers working with Canberra's digital record base, the practical upshot is straightforward. Agencies are being advised through the Digital Transformation Agency's guidance frameworks to enforce single-source ingestion rules for new acquisitions and to document provenance at point of capture rather than retrospectively. For existing holdings, a phased audit approach — starting with the highest-volume image collections — is considered best practice. The alternative is a problem that compounds with every new scanning contract signed.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Canberra

Covering news in Canberra. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Canberra news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Canberra and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia