Milan's municipal digital archive, maintained through the Comune di Milano's ICT directorate, has been quietly wrestling with a sprawling catalogue problem: thousands of duplicate images lodged inside public-facing databases, tourism portals, and the city's urban planning repositories. The cleanup operation, which formally began in January 2025, is now in its second phase, targeting records tied to the Milan-Cortina 2026 Winter Olympics infrastructure filings and the Porta Nuova regeneration zone documentation.
The issue matters now because the Olympics deadline is pressing. With events scheduled to begin in February 2026, Milan's digital presentation layer — the portals, press kits, and venue directories used by international journalists, accreditation bodies, and sponsors — cannot afford broken thumbnails, mislabelled photography, or duplicated aerial shots that contradict each other on matters of construction progress. Getting the image catalogue right is not a cosmetic concern; it directly affects official communications and legal filings tied to planning permissions around the Piazza Tre Torri site in Porta Nuova.
How the Duplication Problem Took Root
The origins trace back to the mid-2010s, when Milan accelerated its smart-city ambitions. The city's various departments — urban planning, tourism, heritage conservation — each built or licensed separate content management systems. The Fondazione Fiera Milano, operating out of the vast exhibition complex in Rho, maintained its own photo library. The fashion and design economy added another layer: major institutions including the Triennale di Milano on Viale Alemagna and the Camera Nazionale della Moda generated enormous volumes of event photography, some of which fed into city-promoted digital channels without standardised metadata tagging.
By the time the Comune di Milano began a formal digital asset audit in late 2023, internal assessments — cited in a council committee session that autumn — found that certain landmark images, including aerial views of the Duomo di Milano and street-level photography from the Navigli district canals, appeared in multiple versions across at least four separate internal systems. Some images carried conflicting copyright attributions. Others had been resized, cropped, and re-uploaded so many times that the original source files were effectively untraceable without forensic comparison tools.
The fashion sector's involvement complicated matters further. During Milan Fashion Week — held twice annually, drawing roughly 60,000 industry visitors each February and September according to the Camera Nazionale della Moda's published figures — the volume of imagery entering city-adjacent channels spikes sharply. Images from runway shows at locations including Palazzo Clerici on Via Clerici and temporary structures in the Tortona design district routinely end up shared across municipal tourism feeds, fashion council archives, and third-party press platforms, each copy accumulating slightly different metadata.
The Cleanup and What Comes Next
The second phase of the remediation project, running through September 2026, involves a systematic deduplication pass using perceptual hashing software across the Comune's primary content repositories. Priority is being given to images directly connected to Olympic venue documentation — specifically the PalaItalia Santa Giulia arena in the Rogoredo neighbourhood and the Pista Olimpica di bob in Cortina — where consistency between planning documents and press materials is subject to scrutiny by the Italian National Olympic Committee, known as CONI.
For organisations outside the municipal structure, the practical lesson from Milan's experience is straightforward. Any institution managing a large image library across multiple platforms — whether a fashion house on Via Montenapoleone or a design studio in the Isola district — risks the same fragmentation if asset management is not centralised from the outset. The cost of retroactive deduplication, in both staff hours and potential legal exposure over copyright misattribution, consistently exceeds the cost of proper tagging infrastructure at the point of ingestion.
The Comune's ICT office has indicated the full remediated archive will be migrated to a unified digital asset management platform before the Olympic opening ceremony. Whether the February 2026 timeline holds depends in part on how many further conflicts the perceptual hashing sweep continues to surface — a number that, as of mid-2026, remains stubbornly higher than initial projections anticipated.