Written by

Question Surendiran S · Dec 20, 2016

Reading PDF File in Zen

Hi Frnds, I am beginner of Cache. I know how to copy from a text file to text file. But i'm facing problem while reading PDF directly using Files concepts. Please provide some solution for copying content from PDF to TXT file.

Thanks

Comments

Dmitry Maslennikov · Dec 20, 2016

Before doing such task, you should know a bit more about technologies which you going to use. PDF - it is a binary format, so, you can't just copy text from PDF. Some of the files may contain a text, some just a picture with text. And to get plain text from PDF, you have to use some third party OCR software. One of the leaders is ABBYY FineReader it is proprietary, but if you need you can find some products for free.

0
Jonathan Ebbers · Dec 21, 2016

I have a similar question - I've been asked to annotate a PDF file (in Intersystems Cache).  That is, take in an existing PDF (printable form with areas for people to write in) and use Cache to update it (automatically fill the fields in).

Does Zen have the ability to set a starting/initial PDF?  I see the ability to define an XML format to write in, but that's not exactly what I need, because that will generate my "answers".

I'd like to overlay my "answers" on top of an original PDF.  Do you know how to do that?

0
Dmitry Maslennikov  Dec 22, 2016 to Jonathan Ebbers

It is a very different question, please create as a separate one. 

0