Pdf diff ?
May 13th, 2008 by Melaneum
Situation
Your exchanging pdf files with an editor for a journal publication. Each time you send a list of modifications to be applied and the modified pdf file comes back to you, usually with some additional modifications that you didn’t ask for. How to spot these change easily and to make sure they are relevant?
Problem
The solution that come to mind is to use a pdf diff tool, similar to what you usually use for text files or source code: diff, diff3, kdiff3, meld are such examples. However, a quick search on the internet couldn’t reveal any tool fulfilling the conditions (being free and open source are mandatory conditions).
Solution
The easy trick is to first convert these pdf to text file and then use the usual text comparison programs:
pdftotext file1.pdf pdftotext file2.pdf kdiff3 file1.txt file2.txt
and that’s it, you are now able to see what has been modified between the two files. Don’t count on that to be able to make a merge though!
This entry was posted on Tuesday, May 13th, 2008 at 01:56 CEST and is filed under Linux. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.
June 16th, 2008 at 09:11 CEST
Or even better in your .vimrc:
autocmd BufReadPre *.pdf set ro
autocmd BufReadPre *.pdf set hlsearch!
autocmd BufReadPost *.pdf silent %!pdftotext -layout -nopgbrk “%” - |fmt -csw78
It saves you from the two extra steps (manual pdftotext).