![]() you will have to find specific handlers for each and you may not be happy with the results even after a lot of work. Now for the really bad news: if you want to look for all sorts of files that are "similar" but not identical such as text files, PDFs, ZIP files, etc. Some image processing AI will do well with these, but I'm assuming you don't want to build one of those. In fact, stenographically altered images will look identical to you but not to a file-comparing algorithm. dupeGuru above has an image similarity search which some think is OK, but it will miss some that you might think are nearly identical (or identical). If you are looking for similar photos/images, check out How can I find duplicate photos in a very large pool of data (tens to hundreds of gigs)?. GNU/Linux: duff (often available with sudo apt install duffįor "Similar" files in general, it gets tougher - a lot tougher. ![]() GNU/Linux: fdupes (often available with sudo apt install fdupes).! -empty -type f -exec md5sum + | sort | uniq -w32 -dD Determining files that are identical is easy, any duplication finder will do the trick: I am going to assume you care about the file "content".
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |