I have an "images" directory with several subdirectories (and
sub-sub-directories), one of them called "products".
Pretty much every image in the "products" directory is a duplicate of an
image in the parent "images" directory or one of its other
subdirectories, although with a different filename (binary contents will
be the same, and there's a good chance that the timestamp will also be
the same although this is not guaranteed).
I want to locate the duplicates, and remove them - the ones I must keep
are the ones in the "products" subdirectory. Ideally I'd like to move
the duplicates into a temporary directory (retaining their sub-directory
paths) rather than delete them in case I need to restore them, but
deletion wouldn't be too bad (I can take backups first, although it's on
a hosted server and the "images" directory is 1.1G, with free space on
the server a little less than that, so just duplicating the directory
won't be as easy as it might sound - the lack of free space is one of
the main reasons I need to do this!).
Oh, and there are some sub-directories I'd like to exclude.
So: Which tools should I look at? fslint springs to mind but I've never
really used it in anger and the documentation seems a bit vague. The GUI
option is out because there's no X on the server. Are there other,
better (more appropriate) tools?
--
Mark Rogers // More Solutions Ltd (Peterborough Office) // 0844 251 1450
Registered in England (0456 0902) @ 13 Clarke Rd, Milton Keynes, MK1 1LG