Answers

How to read text on PDF file and Image File using C# ?

tushar gawande

278

HI,

We have an application which Gets a Scanned paper PDF files,

Our requirement is to read the text present on those files and Save that text while user Uploads a PDF at that time.

Is it possible to do with.Net or please tell me if any other approaches are available.

Answers (2)

With LEADTOOLS OCR features you can recognize text from almost all common image formats along with PDF files.
You can achieve that using LEADTOOLS with .NET code similar to this:

IOcrEngine _ocrEngine;
_ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.Advantage, false);
_ocrEngine.Startup(null, null, null, null);
IOcrDocument ocrDoc = _ocrEngine.DocumentManager.CreateDocument();
ocrDoc.Pages.AddPage(inputFileName, null);
ocrDoc.Pages[0].Recognize(null);
// Save the result to a disk as plain text
string outputFileName = "output.txt";
ocrDoc.Save(outputFileName, Leadtools.Forms.DocumentWriters.DocumentFormat.Text, null);
_ocrEngine.Shutdown();

To see how it works, check this link:
http://demo.leadtools.com/JavaScript/OCR/index.html

Hi Tushar,

Refer below links

https://www.e-iceblue.com/Tutorials/Spire.PDF/Spire.PDF-Program-Guide/Read-PDF-Read-PDF-Images-and-Text-in-C-VB.NET.html

http://www.nullskull.com/q/10465415/read-image-text-from-pdf-file-to-itextsharp-in-aspnet-c.aspx

Next Recommended Forum

Handling JSON arrays returned from ASP.NET web services with

Export gridview data to excel sheet

Upcoming events

View all

Forum Statistics

Please welcome our newest member .
users have contributed to threads and
In the past 24 hours, we have new threads, new posts, and new users.
In last week, the most popular thread is .

How to read text on PDF file and Image File using C# ?

Mohamed Abedallah

Nilesh Patil