The Daily Milan

Milan news, every day

News

Milan Moves to Purge Duplicate Images From Its Digital Urban Archive — and the Rest of the World Is Watching

As cities from Amsterdam to Tokyo wrestle with bloated civic image databases, Milan's cultural institutions are piloting a systematic deduplication programme that could set a new standard for European urban heritage management.

By Milan News Desk · Published 4 July 2026, 9:11 pm

3 min read

Milan Moves to Purge Duplicate Images From Its Digital Urban Archive — and the Rest of the World Is Watching
Photo: Photo by Mathias Reding on Pexels

Milan's civic digital archive holds tens of thousands of photographs documenting the city's transformation — from the rubble of post-war Porta Romana to the gleaming towers of Porta Nuova rising along Viale della Liberazione. The problem: nobody is entirely sure how many of those images are unique. A technical audit commissioned by the Comune di Milano and completed in spring 2026 identified redundant image files running into the hundreds of thousands across the city's interconnected municipal databases, according to internal planning documents reviewed by this newspaper.

The issue matters because Milan is under unusual pressure right now. With the Milan-Cortina 2026 Winter Olympics opening ceremony scheduled for February 6, 2026 — already passed — and the post-Games legacy phase now fully underway, the city's communications infrastructure is being scrutinised as never before. The regional government of Lombardy and the centre-left administration under Mayor Beppe Sala have found common ground on very little lately, but digital archive integrity is one area where both sides acknowledge the problem is real and the cost of ignoring it is growing.

The deduplication effort is being led jointly by the Archivio Fotografico del Comune di Milano, based on Via Palazzo Reale, and the digital services division of Fondazione Milano, which oversees several of the city's cultural data platforms. Their working method involves running perceptual hashing algorithms — software that generates a fingerprint for each image based on visual content, not just filename — across legacy storage systems that in some cases date to the early 2000s. The pilot phase began in March 2026 and is expected to produce a full deduplication report by October.

How Milan Compares to Amsterdam, Paris and Tokyo

Milan is not alone in confronting this. Amsterdam's Stadsarchief, which manages one of Europe's largest municipal photographic collections, completed a comparable exercise in 2023, reducing its working digital image inventory by roughly 18 percent after eliminating confirmed duplicates. Paris's Direction des Affaires Culturelles launched its own deduplication protocol across the Bibliothèque historique de la Ville de Paris in late 2024, citing storage costs running at approximately €240,000 per year for redundant data alone. Tokyo's Metropolitan Archives undertook a phased clean-up between 2022 and 2025, processing over 4 million image files.

Milan's challenge is complicated by the fashion and design sector, which has generated enormous volumes of commercial imagery now partially absorbed into civic archives through partnership agreements with institutions such as the Museo del Novecento and the Triennale di Milano on Viale Alemagna. Those partnerships brought prestige and content, but also layered duplication: the same runway photograph from a 1990s collection might exist in six formats across four separate servers. Resolving rights and ownership before deletion adds legal complexity that Amsterdam, with a simpler archive structure, did not face at comparable scale.

What Comes Next for the Archive — and for Anyone Who Uses It

For journalists, researchers and urban planners who regularly draw on Milan's public image holdings, the practical upshot is significant. Once the deduplication work concludes, the Archivio Fotografico is expected to migrate its cleaned catalogue to a unified content management system with public API access — a change that would, for the first time, allow external users to search by visual similarity rather than keyword tags. A provisional launch window of early 2027 has been discussed in municipal planning sessions, though no date has been formally confirmed.

The budget earmarked for the full programme sits at approximately €1.2 million across two fiscal years, according to figures circulated in the Comune di Milano's 2026 technology spending outline. That compares favourably with Paris's outlay but is modest relative to Tokyo's multi-year effort, which was reported by Japan's Agency for Cultural Affairs to have cost the equivalent of around €3.8 million. For a city of Milan's ambition — and its self-image as a global design capital — getting the numbers right before the archive goes public will matter as much as the technology itself.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.