Sidebar

How to parse PDF file and convert the data to JSON?

0 votes
347 views
asked Apr 22, 2021 by deepthi-a-5184 (240 points)

1 Answer

0 votes
Discreet data in xml, HL7, or some other format is really required. This is because PDFs are not consistent in the layout. Text can be organized by single characters, words, sentences, or even text is an image in the pdf. There is some third-party jar files that will open PDFs. However, due to the many different ways a PDF can be constructed, it is very difficult to do this.
answered Apr 22, 2021 by brandon-w-8204 (33,270 points)
...