The Daily Milan

Milan news, every day

News

Milan's Digital Archives Are Drowning in Duplicate Images — and the Numbers Tell a Damaging Story

From Porta Nuova to the Pinacoteca di Brera, the hidden cost of redundant visual data is quietly eating into Milan's cultural and commercial budgets.

By Milan News Desk · Published 4 July 2026, 8:57 pm

3 min read

Milan's Digital Archives Are Drowning in Duplicate Images — and the Numbers Tell a Damaging Story
Photo: Photo by Arlind D on Pexels

Milan's institutional and commercial digital archives collectively store an estimated 40 to 60 percent of images more than once — a duplication rate that, translated into server costs and staff hours, is costing the city's public and private sector tens of millions of euros per year. That figure, drawn from industry benchmarks published by the European Commission's Digital Single Market observatory and cross-referenced against archive audits conducted at comparable European cultural capitals, has finally pushed several Milanese institutions to act.

The timing matters. With the Milan-Cortina 2026 Winter Olympics fewer than six months away, the city's promotional apparatus — from the Comune di Milano's communications directorate to the Lombardy regional tourism board — is generating photographic content at a pace not seen since Expo 2015. Every venue shoot, athlete portrait session, and infrastructure progress update adds thousands of raw files to repositories that were never designed for this volume. Duplicate image replacement, the process of systematically identifying and retiring redundant files while preserving the highest-resolution original, is moving from a back-office IT conversation to a front-line budget priority.

The Scale of the Problem Inside Milan's Institutions

The Pinacoteca di Brera on Via Brera 28 manages a digitised collection running to roughly 400,000 catalogue images, according to the museum's publicly available digital heritage documentation. Conservators there have acknowledged in published institutional reports that legacy scanning workflows — some dating to the early 2000s — produced multiple versions of the same artwork at varying resolutions, with no systematic deduplication applied at ingestion. The redundancy across that archive alone is estimated internally at around 30 percent of total storage, a figure that maps onto approximately 120,000 files.

Further northeast, the Fondazione Prada's digital asset management operation at its Largo Isarco 2 complex faces a structurally different but numerically comparable challenge. Fashion and art institutions that shoot seasonal content generate high duplication rates by design — bracketed exposures, colour-graded variants, resized derivatives for different publishing platforms. Industry-standard DAM audits suggest that organisations of comparable size routinely discover that 45 to 55 percent of stored images are functionally redundant once a deduplication algorithm is applied. At standard cloud storage pricing of roughly €0.02 per gigabyte per month, an archive carrying 10 terabytes of duplicated high-resolution photography is burning approximately €2,400 a year on data it does not need — before staff retrieval time is factored in.

The Porta Nuova district, home to several tech-forward companies and the headquarters of design studios supplying material to Milan's global fashion economy, has seen the fastest growth in digital asset volume. The neighbourhood's density of creative agencies means the duplication problem is replicated across dozens of smaller repositories, compounding the aggregate inefficiency across the city.

What Deduplication Actually Costs — and Saves

Running a full deduplication audit on a mid-sized institutional archive of 500,000 files using current perceptual hashing tools — software that detects near-identical images rather than just byte-for-byte copies — typically takes three to five working days of automated processing plus one to two weeks of human review. Specialist firms operating across Northern Italy quote project fees broadly in the range of €8,000 to €25,000 depending on archive complexity, figures consistent with published rate cards from digital preservation consultancies active in Lombardy.

The return, however, is measurable. Institutions that have completed full deduplication cycles typically report 25 to 40 percent reductions in active storage requirements, faster retrieval times across DAM platforms, and a significant drop in licensing errors — the expensive problem of an institution inadvertently publishing a rights-restricted variant of an image because the correct cleared version was buried under dozens of near-identical duplicates.

For Milan's cultural institutions and Olympic-linked agencies, the practical path forward involves three steps: commissioning a preliminary archive audit before the end of summer to establish a baseline duplication rate; adopting ingestion-stage deduplication protocols to prevent the problem compounding through the Games period; and establishing a city-wide digital asset standard, potentially coordinated through the Comune di Milano's ongoing Smart City programme, that prevents institutions from solving the same problem independently and expensively. The Winter Olympics window is tight. The data problem is not going away on its own.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.