Top > System administration > Configuration > Dupseek

Dupseek - Finds and removes duplicate files

Dupseek groups files by size, then reads and compares small chunks of the files of the same size. It creates smaller groups depending on these comparisons. It proceeds with bigger and bigger chunks (of size up to a hard-coded limit). It stops reading from files as soon as they form a single-element group or they are read completely (which only happens when they have a very high probability of having duplicates). The program does remove files, but it asks first.

Dupseek aims for maximum efficiency by keeping file reads to a minimum and is much better than other similar programs when dealing with groups of large files of the same size. It can be interrupted at any moment. The user is then presented with partial results and can either intervene manually or go on with the reading and computation, on a group-by-group basis.

Obtaining

Web pagehttp://www.beautylabs.net/software/dupseek.html
Source tarballhttp://www.beautylabs.it/software/dupseek-1.1.tgz
Version 1.1 (beta) released on 2003-06-27
Licensed under The GNU General Public License, Version 2.
This is not a GNU package.

Support contacts

Help List<antonio@beautylabs.net>
Developer List<antonio@beautylabs.net>
Bug List<antonio@beautylabs.net>

Project contacts

Maintainers
Developers

Related information

Interfacescommand line
Source languagesPerl
Use requirementsPerl, File::Find

Entry information

License verified byJanet Casey <jcasey@gnu.org> on 2003-06-02
Entry compiled byJanet Casey <jcasey@gnu.org>

Categories



The copyright licensing notice below applies to this text. The software described in this text has its own copyright notice and license, which can usually be found in the distribution itself.

Copyright © 2000, 2001, 2002, 2003 Free Software Foundation, Inc.

Permission is granted to copy, distribute, and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts. A copy of this license is included in the file COPYING.DOC.

Please report any problems in this page to bug-directory@gnu.org, or find out how you can help fix them.

The FSF provides this directory as a service to the free software community. Please consider donating to the FSF to help support this project.