The Daily Milan

Milan news, every day

News

Milan's Digital Archives Race to Purge Duplicate Images This Week

A coordinated push by cultural institutions and design firms across the city is tackling a long-ignored backlog of duplicate visual assets before the Winter Olympics spotlight arrives.

By Milan News Desk · Published 4 July 2026, 9:00 pm

3 min read

Milan's Digital Archives Race to Purge Duplicate Images This Week
Photo: Photo by Alexandro D'Elia on Pexels

Milan's major cultural and commercial institutions moved this week to address a mounting problem that has quietly undermined digital archives for years: thousands of duplicate images clogging databases, inflating storage costs, and degrading search reliability. The push, which intensified Monday across several organisations in the city's design and fashion sectors, comes as Milan-Cortina 2026 preparations force institutions to present clean, professional digital fronts to an international audience arriving this winter.

The timing matters. With the February 2026 Winter Olympics opening ceremony now fewer than eight months away and Porta Nuova's redeveloped media hub already fielding accreditation requests from broadcasters across 40 countries, the pressure on Milan's digital infrastructure is tangible. Duplicate image files — redundant JPEGs, mirrored product shots, repeated archival photographs stored under different filenames — slow retrieval systems and create legal exposure when rights-cleared images become indistinguishable from unlicensed duplicates in the same folder.

Where the Problem Is Concentrated

The issue is not uniform across the city. Institutions with the deepest legacy archives are feeling it most acutely. The Triennale di Milano, on Viale Alemagna in Parco Sempione, manages a photographic collection spanning more than nine decades of design history. Staff there have been running deduplication protocols since late June, using perceptual hashing software that compares visual similarity rather than just file metadata — a more demanding but more accurate method than simple checksum matching.

In the Brera district, several independent fashion studios and smaller design agencies clustered around Via Solferino and Corso Garibaldi have reported similar exercises this week, prompted partly by guidance circulated by Camera Nazionale della Moda Italiana ahead of the September fashion week cycle. Storage costs on commercial cloud platforms have risen sharply; Amazon Web Services standard S3 storage in the EU West region currently runs at roughly €0.023 per gigabyte per month, a figure that compounds quickly when a mid-sized fashion archive holds 60 to 80 terabytes of unaudited image files.

Fondazione Prada, whose angular campus on Largo Isarco in the southern Ortortona zone houses one of the city's most technically sophisticated digital teams, began a full deduplication audit on 1 July. The foundation declined this week to provide figures on how many files are under review, but the exercise is understood to be part of a broader digital asset management overhaul connected to planned new online exhibition infrastructure.

What the Technology Actually Does

Duplicate image replacement is not simply deletion. The better practice — now standard in professional digital asset management — involves identifying the canonical version of an image, updating all internal links and metadata references to point to that single file, and then archiving or destroying the redundant copies. Get it wrong and internal publishing systems break, pulling blank spaces into printed catalogues or web pages where images should appear.

That risk is particularly pointed for organisations producing physical print runs. Skira Editore, the Milan-based art publisher founded in 1928 and still operating from offices near the Duomo, has long maintained rigorous version-control discipline for exactly this reason. Errors in a digital archive translate directly into production errors in books that sell at €60 to €120 per copy.

The wider Italian context adds urgency. Italy's data protection authority, the Garante per la Protezione dei Dati Personali, has signalled in its 2026 work programme a focus on how institutions manage image files that may contain identifiable individuals — a category that intersects significantly with duplicate archives where consent documentation is often attached only to the original file, not its copies.

For Milan's institutions, the practical advice emerging from this week's activity is consistent: run a deduplication audit before September, not after. Fashion week generates tens of thousands of new image assets in a compressed window, and adding that volume to an already bloated archive without first clearing the backlog compounds the problem exponentially. Tools including Adobe Bridge's cache management functions, Google's Vision API for visual similarity clustering, and open-source options like dupeGuru are all in active use across the city this week. The window to resolve legacy problems quietly, before the cameras of the world are pointed at Milan, is closing.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.