TreeSize Professional Duplicate Search

Find and remove duplicate files

Professional project management with our TreeSize Duplicate Search

Disk Space Management Top Features Tips & Tricks
03.08.2021

Working together in the same file system is challenging. Especially when several people are involved within a project, the work documents must be well organized to avoid any mess or even chaos. That's why it's important to use the right tools to support your project work professionally. With TreeSize, it is possible to react to file duplications with ease and speed.

 

Duplicate files waste hard disk space

No matter whether you work on projects together via SharePoint, a network drive or alone on your local drive, files are often temporarily moved or copied and then forgotten. If you don't take regular action against this, more and more file duplicates are created over time - i.e. duplicate or identical files that are stored in different folders. Depending on the depth of the structure, these duplicate files can disappear into the depths of your own folder paths and waste large amounts of storage space.

A customer of ours had this happen to him as well: A construction company organizes its many projects into subdirectories in which it stores the respective planning and work files. In the process, each project receives multiple PDF files with plenty of specifications and standards that are essential for the respective construction project. Many of these PDF files are also needed in other projects and are accordingly stored in the working directories. As a result, the construction company constantly accumulates true duplicates of these source documents over time. Thus, in the course of time, hundreds of gigabytes of storage space would be wasted on the file duplicates alone. Manually searching for these files is unmanageable and uneconomical due to the sheer volume of files.

Thanks to TreeSize, our allrounder for file & disk space management, it is possible to effectively search for duplicate files on local file systems and servers and deduplicate them. In doing so, the file duplicates are replaced by NTFS hardlinks - so that each file duplicate no longer consumes its own space, but points to the same location on the hard disk.

Thus, TreeSize prevents losses of important files and frees up new storage space at the same time.

Our customer now regularly runs a scan of their file systems at the end of the month, looking for duplicate files as well. A simple click is all it takes to deduplicate duplicate files with TreeSize. 

But how does it in practice?

 

Find duplicate files with TreeSize and remove duplicates safely

The TreeSize file search is a very powerful tool, with which you can professionally support the organization of your project work. To do so, open TreeSize and select the entry "Duplicate files" under "Open TreeSize file search". Here you can initially select the drives that are to be included in the search for file duplicates.

For example, would you like to select a server as a scan target to check for duplicate files on your SharePoint? Click on the plus icon, select "SharePoint“, and enter your login credentials. Besides SharePoint, TreeSize can also scan Amazon S3 cloud storage, Linux and Unix servers using SSH and WebDav.

Before starting the duplicate search, you can also set filter rules to exclude files that are smaller than a certain size from the results list, for example. This will increase the clarity of your results.

Once all settings have been entered, the duplicate search can begin. Our tool offers various check methods, such as the MD5 or SHA256 checksum. TreeSize lists the results concisely, so that you can see exactly which files have duplicates at which locations. Sort the results list by size, for example, to identify the biggest space squanderers.

Finally, you can select either all or just individual result entries for deduplication. In the deduplication window you can again get an overview of the storage space you can potentially regain with TreeSize.

Done! Next, schedule the search for duplicate files with TreeSize as part of your project management process.

 

For the big chunks: Search for duplicate folder structures with TreeSize

Not every use case is covered by a simple file search. Another customer told us some time ago that his use case is even more specific. A state archive faces the problem that not only individual files, but entire folder structures are present on its servers multiple times. In total, more than 10,000 file duplicates can be identified with the TreeSize file search. To check them individually on file level would take far too much time, which is why our customer would like to have a duplicate search on folder level.

We are pleased to announce that in the upcoming version 8.2 we will next develop a search for duplicate folder structures for TreeSize. This will allow us to offer our customers an even more sophisticated search for duplicate files.

TreeSize is a must-have for neatly organizing projects and a lifesaver for overfilled hard drives. TreeSize Duplicate Search can detect duplicate files in a quick and easy way, both on servers and on local drives. This allows you to resolve file duplicates without having to worry about losing important content. 

 

 

Do you like what you've just read, have new ideas or feedback? Visit our contact form and let us know your thoughts!

Blog author Joey

Joey

File & Disk Space Management, Banking Software