The problem sounds mundane until you see the invoices. Milan's major public institutions and fashion-sector companies have spent the better part of three years quietly dismantling a sprawling mess of duplicated digital assets — identical or near-identical images filed under different names, stored across multiple servers, and in some cases licensed separately each time they were used. The cleanup has involved archive teams at the Comune di Milano, design houses operating out of the Porta Nuova district, and cultural bodies preparing digital infrastructure ahead of the Milan-Cortina 2026 Winter Olympics.
The reason it matters now is timing. With less than six months left before the Olympics open and Milan's Palazzo del Ghiaccio and the Fiera Milano exhibition complex both scheduled to host media operations serving a projected global audience in the hundreds of millions, having clean, correctly tagged, non-duplicated image libraries is not optional. A duplicated asset sitting under two different licence agreements is not just an administrative irritant — it is a potential legal and financial liability.
How the Duplication Built Up Over a Decade
The roots of the problem trace back to roughly 2014 and 2015, when institutions across Milan began digitising archive material at scale, often using different contractors working to different metadata standards. The Biblioteca Nazionale Braidense on Via del Senato digitised substantial portions of its photographic collection during that period. Fashion houses clustered around Via della Spiga and Corso Como were simultaneously building proprietary digital asset management systems to handle runway photography, lookbook images, and campaign material.
None of these projects talked to each other. Files were transferred between agencies, resized, renamed, and re-uploaded. By 2021, a routine audit at one Porta Nuova-based luxury group — details of which circulated among DAM consultants in Milan without becoming public — reportedly found that more than 30 percent of stored image files had at least one functional duplicate. That figure, while not independently verified for this article, aligns with industry benchmarks published by the Digital Asset Management Foundation, which has documented duplication rates of between 25 and 40 percent in large fashion and media organisations that migrated legacy archives without a unified taxonomy.
The Milan-Cortina organising committee, known formally as Fondazione Milano Cortina 2026, accelerated the conversation. Its communications and media operations required a single, clean image repository that partner institutions across Lombardy could draw from without risking duplicated licensing or contradictory usage rights. That operational demand effectively forced a timetable on organisations that had been treating deduplication as a background task.
What the Cleanup Actually Involves
Duplicate image replacement is not simply deleting files. Each duplicate must be assessed: Is it truly identical at the pixel level, or is it a resized derivative with its own legitimate use case? Does it carry a separate IPTC metadata record that links it to a specific publication or event? Does removing it break any existing hyperlink or embedded reference in a published web page or internal document? For institutions like the Fondazione Prada, whose digital presence spans its Largo Isarco campus and multiple international satellite sites, the answer to all three questions is frequently yes.
Software tools have matured considerably since 2020. Platforms using perceptual hashing — a technique that identifies visually similar images even when file names and formats differ — can now process libraries of several million assets in under 48 hours. The cost of a mid-market enterprise licence for such tools currently runs between roughly €18,000 and €45,000 annually, depending on storage volume, according to pricing published by vendors active in the European market. That is a manageable figure for a fashion conglomerate; it represents a genuine budget decision for a municipal archive.
For organisations still working through the process, the practical priority is establishing a single source of truth before the Olympics media influx arrives in late January 2027. That means completing metadata reconciliation, retiring redundant storage buckets, and ensuring that any image served externally carries a single, auditable licence record. Institutions that have not begun should note that the Comune di Milano's smart city and digital transformation office has published guidance under its Piano per la Trasformazione Digitale framework, which sets out recommended standards for public-sector digital asset governance. Following that framework now is considerably cheaper than resolving a licensing dispute after a duplicated image has been broadcast to a global Olympic audience.