The Daily Milan

Milan news, every day

News

Milan Leads European Push to Purge Duplicate Images From Public Records, But Rivals Are Closing the Gap

From the Porta Nuova skyline to centuries-old archival holdings in the Brera district, the city is deploying AI-assisted deduplication tools — and other global fashion and design capitals are watching closely.

By Milan News Desk · Published 4 July 2026, 9:00 pm

3 min read

Milan Leads European Push to Purge Duplicate Images From Public Records, But Rivals Are Closing the Gap
Photo: Photo by Riccardo Falconi on Pexels

Milan's municipal digital archive office formally launched its image-deduplication programme in March 2026, targeting roughly 4.2 million photographs, architectural renderings and design assets held across city-managed repositories. The initiative, run through the Comune di Milano's Direzione Sistemi Informativi, sets a binding deadline of December 31, 2026 — timed deliberately to coincide with the close of the Milan-Cortina Winter Olympics — to eliminate redundant visual records that have accumulated over two decades of digitisation drives.

The timing is not coincidental. With Olympic broadcast partners and international media organisations demanding clean, licensable image libraries from the host city, the pressure to deliver a verified, deduplicated visual archive has intensified sharply this year. The problem is neither trivial nor cosmetic: duplicated records inflate storage costs, create legal ambiguity over licensing rights, and slow down the retrieval systems that journalists, researchers and city planners rely on daily.

What Milan Is Actually Doing

The operational hub for the project sits within the Palazzo del Comune's data infrastructure team in Via Larga, with secondary processing nodes housed at the Politecnico di Milano's artificial intelligence laboratory in Bovisa. The university partnership, formalised in a January 2026 agreement, gives the city access to perceptual hashing and convolutional neural network tools developed by Politecnico researchers. Those tools can flag near-duplicate images — photographs taken seconds apart, or scans of the same archival print at different resolutions — rather than only catching pixel-perfect copies.

The Fondazione Prada's digital archive team, which manages tens of thousands of images documenting its exhibitions at the Largo Isarco venue, has been cited internally by the Comune as a private-sector model for how deduplication can be embedded into ongoing cataloguing workflows rather than treated as a one-off clean-up exercise. The foundation declined to provide figures publicly, but its methodology has informed at least part of the Comune's operational protocol.

Budget for the civic programme stands at €1.8 million across 2026, according to documents published in the city council's delibera of February 17, 2026. Roughly 60 percent of that allocation covers software licensing and cloud processing fees, with the remainder funding three contract data analysts and staff training across six municipal departments.

How Milan Compares to Amsterdam, Paris and Tokyo

Milan is not alone. Amsterdam's Stadsarchief completed a comparable deduplication exercise across its 750,000-image digital collection in late 2024, using open-source tooling developed partly by the Dutch national cultural heritage agency Rijksdienst voor het Cultureel Erfgoed. The Amsterdam project took 14 months and removed approximately 18 percent of its visual holdings as confirmed duplicates or degraded redundant copies — a benchmark that Milan's project managers regard as a realistic target for their own cull.

Paris is further behind. The Bibliothèque nationale de France announced a multi-year image audit programme in September 2025, but the first operational phase covering its Gallica digital platform is not expected to conclude before mid-2027. Tokyo's National Diet Library launched a deduplication pilot in fiscal year 2025 covering newspaper photograph archives, though the scope remains narrower than Milan's cross-departmental ambition.

Where Milan holds a specific advantage is in the fashion and design sector. The Camera Nazionale della Moda Italiana, based in Via Gerolamo Morone, began pushing member brands in early 2025 to adopt standardised image metadata schemas — a precondition for effective deduplication at scale. That industry pressure means the city's commercial image ecosystem is more structured than those of comparable European cities, giving the Comune's tools cleaner data to work with from the outset.

For organisations and individuals whose images sit inside municipal or partner archives, the practical upshot over the next six months is straightforward: the Comune's portal for image rights enquiries, accessible through the city's Servizi Digitali platform, will be updated quarterly as batches of the archive are verified and cleared. Rights holders who believe their material has been incorrectly flagged for removal have a 30-day window to contest each batch notification, a process the Direzione Sistemi Informativi says will open formally in September 2026. The Olympic deadline is fixed. The backlog is large. The clock is running.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.