Milan's municipal digital archive office formally launched its image-deduplication programme in March 2026, targeting roughly 4.2 million photographs, architectural renderings and design assets held across city-managed repositories. The initiative, run through the Comune di Milano's Direzione Sistemi Informativi, sets a binding deadline of December 31, 2026 — timed deliberately to coincide with the close of the Milan-Cortina Winter Olympics — to eliminate redundant visual records that have accumulated over two decades of digitisation drives.
The timing is not coincidental. With Olympic broadcast partners and international media organisations demanding clean, licensable image libraries from the host city, the pressure to deliver a verified, deduplicated visual archive has intensified sharply this year. The problem is neither trivial nor cosmetic: duplicated records inflate storage costs, create legal ambiguity over licensing rights, and slow down the retrieval systems that journalists, researchers and city planners rely on daily.
What Milan Is Actually Doing
The operational hub for the project sits within the Palazzo del Comune's data infrastructure team in Via Larga, with secondary processing nodes housed at the Politecnico di Milano's artificial intelligence laboratory in Bovisa. The university partnership, formalised in a January 2026 agreement, gives the city access to perceptual hashing and convolutional neural network tools developed by Politecnico researchers. Those tools can flag near-duplicate images — photographs taken seconds apart, or scans of the same archival print at different resolutions — rather than only catching pixel-perfect copies.
The Fondazione Prada's digital archive team, which manages tens of thousands of images documenting its exhibitions at the Largo Isarco venue, has been cited internally by the Comune as a private-sector model for how deduplication can be embedded into ongoing cataloguing workflows rather than treated as a one-off clean-up exercise. The foundation declined to provide figures publicly, but its methodology has informed at least part of the Comune's operational protocol.
Budget for the civic programme stands at €1.8 million across 2026, according to documents published in the city council's delibera of February 17, 2026. Roughly 60 percent of that allocation covers software licensing and cloud processing fees, with the remainder funding three contract data analysts and staff training across six municipal departments.
How Milan Compares to Amsterdam, Paris and Tokyo
Milan is not alone. Amsterdam's Stadsarchief completed a comparable deduplication exercise across its 750,000-image digital collection in late 2024, using open-source tooling developed partly by the Dutch national cultural heritage agency Rijksdienst voor het Cultureel Erfgoed. The Amsterdam project took 14 months and removed approximately 18 percent of its visual holdings as confirmed duplicates or degraded redundant copies — a benchmark that Milan's project managers regard as a realistic target for their own cull.
Paris is further behind. The Bibliothèque nationale de France announced a multi-year image audit programme in September 2025, but the first operational phase covering its Gallica digital platform is not expected to conclude before mid-2027. Tokyo's National Diet Library launched a deduplication pilot in fiscal year 2025 covering newspaper photograph archives, though the scope remains narrower than Milan's cross-departmental ambition.
Where Milan holds a specific advantage is in the fashion and design sector. The Camera Nazionale della Moda Italiana, based in Via Gerolamo Morone, began pushing member brands in early 2025 to adopt standardised image metadata schemas — a precondition for effective deduplication at scale. That industry pressure means the city's commercial image ecosystem is more structured than those of comparable European cities, giving the Comune's tools cleaner data to work with from the outset.
For organisations and individuals whose images sit inside municipal or partner archives, the practical upshot over the next six months is straightforward: the Comune's portal for image rights enquiries, accessible through the city's Servizi Digitali platform, will be updated quarterly as batches of the archive are verified and cleared. Rights holders who believe their material has been incorrectly flagged for removal have a 30-day window to contest each batch notification, a process the Direzione Sistemi Informativi says will open formally in September 2026. The Olympic deadline is fixed. The backlog is large. The clock is running.