Milan's archivists have a problem hiding in plain sight. As the city accelerates the digitisation of everything from Porta Nuova construction permits to Triennale di Milano exhibition catalogues, a growing number of institutions have discovered that their image databases contain thousands of duplicate files — the same photograph catalogued under different file names, different metadata, or in some cases different formats — eating up server space, distorting search results, and in several documented cases causing restoration teams to work from the wrong version of a source image.
The issue has sharpened in 2026, with Milan-Cortina Winter Olympics venues demanding rapid, reliable access to architectural and infrastructure imagery for ongoing fit-out work. When the same rendering of the PalaItalia Santa Giulia arena exists in three separate versions inside a contractor's digital asset system, each tagged differently, the question of which image is the master copy stops being an administrative nuisance and becomes an operational risk.
How the Duplication Problem Built Up Over Years
The roots run back to the mid-2000s, when Milan's major cultural institutions began digitising in parallel rather than in coordination. The Comune di Milano's urban planning directorate, the Archivio di Stato in Via Senato, and private-sector bodies including the Camera Nazionale della Moda Italiana all built separate digital workflows without a shared technical standard for image naming or metadata encoding. Files migrated from CD-ROMs to local servers to cloud storage multiple times over fifteen years, and each migration created new copies without reliably retiring old ones.
The fashion economy compounded the scale. Milan accounts for a disproportionate share of European luxury brand archiving activity, and houses based in the Quadrilatero della Moda — the stretch running from Via Montenapoleone through Via della Spiga — maintain image libraries of product photography, runway footage, and campaign stills that number in the tens of millions of files. Industry-wide, the cost of storing redundant digital assets is not trivial: according to figures published by the European Commission's 2024 Digital Decade progress report, unmanaged data duplication across European creative industries costs an estimated €2.3 billion annually in unnecessary storage and labour.
The public sector side tells a similar story. The Politecnico di Milano's architecture faculty, which holds one of the most extensive collections of Milanese urban development imagery in Europe, acknowledged in its 2025 annual report that a systematic audit of its digital holdings had identified duplication rates of roughly 18 percent across image files added before 2018. The faculty has since begun a remediation programme, but the work is ongoing.
What Comes Next for Institutions Still Sorting the Problem
The practical fix involves two distinct steps that many organisations are now working through simultaneously. The first is automated detection: software tools that compare image files using perceptual hashing — a technique that identifies visually identical or near-identical images regardless of file name or format — can flag probable duplicates for human review. The second, harder step is deciding which version to keep, which requires agreed metadata standards and, in institutional settings, sign-off from archivists who understand provenance.
The Comune di Milano's digital services office is reported to be piloting a unified asset management protocol across its departments as part of the broader Milano 2030 digital transformation agenda, though full rollout timelines have not been confirmed publicly. For private businesses in sectors like fashion and design, where image assets have direct commercial value, several Milan-based consultancies operating out of the Isola and Porta Nuova districts have begun offering duplicate-image audits as a standalone service — typically priced between €3,000 and €12,000 depending on library size, according to publicly listed service rates from two firms reviewed by The Daily Milan.
For the Olympics preparation teams working against a hard calendar — the Games open in February 2026 — the immediate priority is establishing a single authoritative image source for each venue, agreed upon by all contractors and logged in a shared system. The longer-term lesson for Milan, a city whose global reputation rests in no small part on the precision and quality of its visual output, is that getting the digital foundations right matters as much as the images themselves.