Last Active: Aug 06, 2024
Threads:
0
Posts:
3
Reputation:
0
Jul 04, 2020, 22:32 pm
(This post was last modified: Jul 04, 2020, 22:36 pm by graatch. Edited 1 time in total.
Edit Reason: additional info
)
One of the more difficult conversions to get right, simply due to how variously (and badly) PDFs can be put together. You're usually better off converting to docx from PDF using Acrobat (arr) if the thing has a good text overlay built into it or an acceptable OCR. I'm not an academic, I'm a starving autodidact, but I gather that scholars who read a lot of pdfs end up getting devices that can handle even really badly constructed pdfs. Someday.
Though I hope someday someone does something like this: create a set of categorizations for different sorts of PDF. i.e. Does this pdf have a lot of images? How does this pdf break up text from line to line? And then you could have filters which we'd all refer to for a conversion, to make the epub turn out better, since a lot of the problems you see again and again. This will still never get everything right. But then, we don't even have epub/mobi standards that get too widely followed.
Sometimes I still prefer PDF to epub, due to better options for annotations WITH A REASONABLY SIMPLE WAY TO GO OVER THEM LATER if you have Acrobat (arr), or one of the other programs and apps which can put highlights and notes into the pdf. I'll recommend sumnotes as well for that, they have a very good service.