The Daily Milan

Milan news, every day

News

How Milan's Digital Archive Crisis Reached Breaking Point: The Story Behind the Duplicate Image Replacement Drive

Years of fragmented cataloguing across the city's cultural institutions have forced a reckoning with tens of thousands of redundant image files — and now a coordinated effort is finally underway to fix it.

By Milan News Desk · Published 4 July 2026, 8:58 pm

3 min read

How Milan's Digital Archive Crisis Reached Breaking Point: The Story Behind the Duplicate Image Replacement Drive
Photo: Photo by Andrew Patrick Photo on Pexels

Milan's network of public cultural institutions is midway through a phased programme to strip out and replace duplicate digital images from its shared heritage databases, a problem that archivists at the Civico Archivio Fotografico in Via Moscova say has accumulated over more than a decade of poorly coordinated digitisation campaigns.

The scale matters because Milan is not managing a modest municipal photo library. The city's cultural holdings — spanning the Pinacoteca di Brera, the Museo del Novecento overlooking Piazza del Duomo, and the network of civic museums operating under the Fondazione Milano — run to millions of catalogued digital assets. When the same image sits in three separate database entries under different file names, curators waste hours cross-referencing records, grant applications cite inflated collection figures, and licensing revenue leaks away through administrative confusion.

How the Duplication Problem Was Built, Layer by Layer

The roots go back to roughly 2010, when a wave of European Union co-funded digitisation grants pushed Italian civic institutions to scan physical collections at speed. Milan's institutions moved quickly — Brera alone processed thousands of works between 2011 and 2014 using contractors who delivered files in incompatible formats. When those files were ingested into the city's central catalogue, the MiBAC-aligned metadata standards in use at the time did not automatically flag images already present under a different project code.

A second wave of duplication arrived with the Porta Nuova regeneration era. As the Isola and Garibaldi districts transformed through the early 2010s, neighbourhood archives, community photography projects, and corporate documentation from developers like Hines Italia were donated or loaned to civic repositories. Each batch arrived with its own internal filing logic. Librarians at the Biblioteca Centrale in Corso di Porta Vittoria, who manage a portion of the urban documentation archive, absorbed thousands of images from at least four separate Porta Nuova-era sources between 2013 and 2019 without a unified deduplication protocol in place.

The Milan-Cortina 2026 Winter Olympics preparation added fresh urgency. With international media partners and licensing agencies requesting high-resolution images of Milan's infrastructure, venues, and public spaces at volume since late 2024, the duplication errors became commercially visible in a way they had not been before. An internal audit commissioned in early 2025 — the results of which were described in general terms in a Fondazione Milano public board document circulated in March 2025 — identified redundancy rates in certain subcollections as high as 30 percent.

What the Replacement Programme Actually Involves

The current programme, running through the end of 2026, pairs each duplicate cluster with a designated canonical master file. Lower-resolution or metadata-poor duplicates are flagged for suppression rather than deletion — a distinction archivists consider important, since hard deletion of files that may carry rights-management histories creates legal exposure. The work is being carried out in phases, with photography collections from the Museo di Storia Naturale in Corso Venezia among the first subcollections to go through the process.

Institutions are also standardising on IIIF — the International Image Interoperability Framework — as the presentation layer for all outward-facing collections. The framework, already adopted by major European galleries including the Rijksmuseum in Amsterdam and the Bibliothèque nationale de France, allows a single master image to be served across multiple platforms without spawning additional file copies at the point of distribution. Milan's adoption timeline aligns with a broader Italian cultural ministry push that set a compliance target of December 2026.

For researchers, journalists, and the fashion and design industry professionals who regularly licence archival material from city collections — Zona Tortona studios and Via Tortona showrooms among the most frequent commercial users — the practical outcome should be faster search results and cleaner licensing paperwork. Institutions expect to publish updated, deduplicated image counts in their annual reports covering 2026, giving the public the first clean baseline figure for what Milan's digital cultural archive actually contains.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.