0
With LEADTOOLS OCR features you can recognize text from almost all common image formats along with PDF files.
You can achieve that using LEADTOOLS with .NET code similar to this:
- IOcrEngine _ocrEngine;
- _ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.Advantage, false);
- _ocrEngine.Startup(null, null, null, null);
- IOcrDocument ocrDoc = _ocrEngine.DocumentManager.CreateDocument();
- ocrDoc.Pages.AddPage(inputFileName, null);
- ocrDoc.Pages[0].Recognize(null);
-
- string outputFileName = "output.txt";
- ocrDoc.Save(outputFileName, Leadtools.Forms.DocumentWriters.DocumentFormat.Text, null);
- _ocrEngine.Shutdown();
To see how it works, check this link:
http://demo.leadtools.com/JavaScript/OCR/index.html
0
Hi Tushar,
Refer below links
https://www.e-iceblue.com/Tutorials/Spire.PDF/Spire.PDF-Program-Guide/Read-PDF-Read-PDF-Images-and-Text-in-C-VB.NET.html
http://www.nullskull.com/q/10465415/read-image-text-from-pdf-file-to-itextsharp-in-aspnet-c.aspx