I have this question. I have 5 PDFs having around 38,000 objective questions. So i want to make an application which imports this questions and save it into database and then give interface to the user for choosing question with four objective. I used itextsharp to read from PDFs as a chunk and also line by line. The content after reading is scattered and i cannot figure out a sequence by which i can split or differentiate between the question and the four objectives. Is there any better way by which I can import data from PDFs?? The content in PDFs is in tabular format.
Please check Pdf file (
s8.postimage.org/owm0hsej9/Qbank.jpg)
resulting string in Window(
s13.postimage.org/4toy70lqf/Resulting_String.jpg)