Milan's cultural institutions are sitting on a quietly expensive problem. Across the city's major public archives — from the Pinacoteca di Brera to the Comune di Milano's digital patrimony servers on Via Larga — duplicate images have accumulated over years of uncoordinated digitisation campaigns, and the bill for storing, cataloguing, and misidentifying them is no longer trivial.
The issue surfaced publicly this spring when the Civico Archivio Fotografico, housed inside the Castello Sforzesco complex, published an internal review acknowledging that a significant share of its online collection contained redundant image files — scans of the same physical photograph uploaded multiple times under different catalogue numbers. The review did not publish a specific figure for the duplication rate, but the problem has been widely described inside archival circles as systemic, not marginal.
Why now? Two forces are colliding in 2026. First, Milan-Cortina Winter Olympics preparation has pushed city administrators to accelerate the public digitisation of heritage assets, creating promotional and educational content at speed. Second, the post-pandemic funding cycle for EU-backed cultural digitisation under the Creative Europe programme is winding toward its next application window, and institutions with messy data face real risks of disqualification from grants worth hundreds of thousands of euros.
What the Experts and Officials Are Saying
Professionals working inside the sector are blunt about the cause. Rushed digitisation without a shared metadata standard is the root problem, according to curators and archivists who spoke at a round-table organised by the Associazione Nazionale Archivistica Italiana at its Milan chapter meeting in May 2026. The meeting, held at the Fondazione Feltrinelli on Viale Pasubio, drew representatives from the Biblioteca Ambrosiana, the Archivio di Stato di Milano on Via Senato, and several private fashion-house archives, including labels with offices in the Quadrilatero della Moda.
The fashion sector's involvement is not incidental. Milan's luxury and design economy depends on controlled, licensable image libraries. Redundant or misattributed files create legal exposure when images are licensed twice under different terms — a problem several houses have reportedly flagged to their legal departments, though none have made public statements to that effect. The Fondazione Altagamma, which represents Italy's luxury goods industry, has publicly backed a push for a unified national image-metadata standard, a position it reaffirmed at a June 2026 event in Porta Nuova's Gioia 22 tower.
At the municipal level, the Comune di Milano's Assessorato alla Cultura has indicated it is reviewing its internal digital asset management protocols, though it has not yet announced a specific programme or budget allocation. City administrators have pointed to the OpenMilano data portal, launched in 2019, as the intended backbone for a more disciplined approach — but critics in the archival community argue the portal was never designed to handle image deduplication at institutional scale.
The Cost and the Fix
Storage costs alone give a sense of the scale. Commercial cloud storage for large uncompressed image archives runs at roughly €0.02 per gigabyte per month on standard European server infrastructure, and a mid-size civic archive managing tens of thousands of high-resolution scans can easily accumulate several terabytes of redundant data. That is not a catastrophic figure in isolation, but combined with staff time spent on duplicate cataloguing — a task that can consume 15 to 20 percent of a junior archivist's working week, according to benchmarks published by the Istituto Centrale per il Catalogo e la Documentazione in Rome — the aggregate cost across a city like Milan adds up.
Technology exists to address it. Perceptual hashing tools and AI-assisted deduplication software are already used by commercial photo agencies across Europe, and at least two Milan-based tech startups with offices in the Bovisa innovation district have developed tools adapted for cultural heritage collections. Whether public institutions move quickly enough to adopt them before the next Creative Europe deadline — expected in early 2027 — is the question archivists are pressing administrators to answer now.
For individual institutions, the practical advice from specialists is consistent: audit before you upload, agree on a shared file-naming convention across departments, and do not treat digitisation as a one-off project. Milan's archives took decades to build. Cleaning them up will not happen in a single budget cycle, but starting in 2026, with Olympic scrutiny already on the city, is as good a forcing function as any.