The Daily Milan

Milan news, every day

News

How Milan's Digital Archives Ended Up Full of Duplicate Images — And What's Being Done to Fix It

A slow accumulation of copied, re-uploaded and mis-catalogued photographs has left the city's cultural and commercial image libraries in a costly tangle, and the reckoning is now here.

By Milan News Desk · Published 4 July 2026, 8:45 pm

3 min read

How Milan's Digital Archives Ended Up Full of Duplicate Images — And What's Being Done to Fix It
Photo: Spencer, George John Spencer, Earl, 1758-1834 Dibdin, Thomas Frognall, 1776-1847 / Public domain (Wikimedia Commons)

Milan's public and institutional image archives contain tens of thousands of duplicate photographs — redundant files that inflate storage costs, confuse cataloguing systems and, in some cases, have led to the same licensed image being paid for twice. The problem, which has been building quietly for more than a decade, is now forcing a reckoning across the city's cultural sector, from the Comune di Milano's own digital infrastructure to the archival holdings at institutions along Via Brera and the design documentation libraries serving the Salone del Mobile trade network.

The issue matters now because Milan is spending heavily on its digital presence. The city is finalising venues and communication materials tied to the Milan-Cortina 2026 Winter Olympics, with key infrastructure deadlines falling before the end of this calendar year. Every duplicated asset sitting in a bloated content management system represents a liability — a potential rights conflict, an unnecessary licensing fee, or simply dead weight slowing the workflows of creative teams already under pressure.

How the Pile-Up Happened

The roots run back to the mid-2010s, when institutions and city agencies began migrating from physical photo libraries to digital asset management platforms with little standardised protocol. Files were uploaded multiple times under different naming conventions, sometimes by different departments, sometimes following server migrations that failed to deduplicate on import. The Porta Nuova district's development documentation alone — spanning construction phases from 2009 through to the district's commercial opening years — was handled by at least three separate contractors over that period, each depositing imagery into systems that did not talk to one another.

Fashion and design sit at the heart of the economic problem. Milan's creative economy, anchored in the Quadrilatero della Moda and the network of showrooms between Corso Como and the Tortona district, generates a high volume of commissioned photography every season. Agencies working on short deadlines frequently re-upload images from previous campaigns rather than pulling from a central repository. Industry estimates have put the proportion of duplicate images in mid-sized creative agency archives at between 20 and 35 percent, though figures vary by organisation and no single audit of Milan's sector-wide holdings has been published.

The Fondazione Prada and the Triennale di Milano, both of which maintain extensive digital archives of exhibition photography and design documentation, have each undertaken internal deduplication reviews in recent years. Neither institution has publicly disclosed the scale of redundancy found, but the process of standardising metadata — applying consistent author credits, usage rights and file provenance — has become a recognised operational priority in Milan's museum and cultural-foundation sector.

What a Fix Actually Looks Like

Deduplication is not simply a matter of deleting obvious copies. The technical challenge involves distinguishing between true duplicates — identical files stored more than once — and near-duplicates, which are the same image cropped, colour-corrected or exported at a different resolution. Perceptual hashing algorithms, which generate a fingerprint for each image based on visual content rather than file metadata, have become the standard tool for this work. Several digital agencies operating out of the Isola neighbourhood and the creative cluster around Via Tortona have begun offering deduplication audits as a standalone service, typically priced at between €2,500 and €8,000 depending on archive size.

For the city's public institutions, timing is critical. The Comune di Milano has a stated goal of consolidating its digital communications infrastructure before the Winter Olympics opening ceremony in February 2026 — a deadline that, as of this month, is fewer than eight months away. Procurement processes for technology services in the public sector move slowly, and any deduplication project that also involves rights clearance on third-party images requires legal review, adding weeks to the timeline.

The practical advice from archivists and digital asset specialists is consistent: start with a read-only audit before touching any files, preserve the original folder structure until duplicates are confirmed, and build a controlled vocabulary for metadata before re-ingesting cleaned assets. Getting the cataloguing right the first time is what the sector failed to do a decade ago. Milan's institutions, with the deadline pressure of a global sporting event concentrating minds, may not get a third chance to learn the same lesson.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.