Knowledge Base

Find asset duplicates

You can use our duplicate finder to locate duplicate assets in your portal. The duplicates are found based on pHash comparison. You can decide which of the duplicate set should be marked as original, and you can delete other duplicate files. Currently, there is not option to resolve or ignore duplicates.

To start the finder, open your asset overview and click in your action bar.

You need to have the Edit media right to use the finder.

pHash comparison

We find duplicates based on pHash (perceptual hash) comparison. We turn the image to black and white, make it smaller and calculate the hash. When you run the duplicate finder image hashes are compared and if they are the same, these images are considered duplicates.

Our duplicate finder works best with images. Since it scans thumbnails, it might generate imprecise results for PDF and PPT files. For example, if two PDFs have the same front page used as the thumb but are in different languages, they will still be considered duplicates. Sometimes, a color difference of an element might show images as duplicates.

Working with results

You can scroll down the results page to see the full list of duplicate sets. The sets are ordered in the same way your Asset Bank is ordered. If you have the "by date added" arrangement applied, you will find the latest duplicates on top of the results list.

In each results line you see a set of images considered duplicates.

duplicate

You can select which image you want to keep in the Asset Bank and delete the rest. Clicking Make original keeps the selected image and deletes the other images from the set.

Currently, metadata is not merged, so if you decide to delete a duplicate, its metadata gets deleted as well.

Limit the finder scope

To limit the scope of the finder, or to find results in a specific category, you can apply filters and then run the finder. Alternatively, you can run the finder first, and then apply filters to find specific duplicates.

For example, you can use the Advanced > Added by filter to see if there might be any duplicates in the assets you have added to the Asset Bank.

Learn more