您的位置:首页 > 其它

Convert image to text in .net

2014-10-20 11:24 429 查看
http://www.dotnetspider.com/resources/45094-Convert-image-text-net.aspx

This article is about Image to Text converter using MODI (Microsoft Office Document Imaging) adds programming features to the document scanning and viewing tools. We can easily convert scanned document
to text.




[b]What is MODI?

MODI (Microsoft Office Document Imaging) is library which contained in Microsoft office package use to extract text and layout information from scanned pages by using OCR (optical character recognition).

Adding MODI to Project?

Add reference to your project from COM -> Microsoft Office Document Imaging 11.0 Type Library.

Note: If you have installed Microsoft Office 2003 then MODI is installed by default for office 2007 you have to add from installation package. You have run package and add MODI .In office 2010 unfortunately MODI is removed by Microsoft. But you can get by installing
Microsoft Office SharePoint Designer 2007.

To install refer given link
Click Here

After adding reference to project is more than your half your work done

1)Create a form and add text box and button to upload image.

2)You can design as your here I designed according to my requirement.

3)Take picture box to display image and text box to display text.

4)Change text box property of Multiline=true;

5)On Button upload on button_click event write the following code.

Code is Here :
[/b]

private void btnUpload_Click(object sender, EventArgs e)
{
OpenFileDialog op1 = new OpenFileDialog();

op1.ShowDialog();
txtUpload.Text = op1.FileName;
MODI.Document DOC = new MODI.Document();
StringBuilder str = new StringBuilder();
string filename = txtUpload.Text;
MODI.Word word = null;
MODI.Image image = null;
MODI.Layout layout = null;

pictureBox1.ImageLocation = txtUpload.Text;
DOC.Create(filename);
DOC.OCR(MODI.MiLANGUAGES.miLANG_ENGLISH, true, true);

for (int i = 0; i < DOC.Images.Count; i++)
{

image = (MODI.Image)DOC.Images[i];
layout = image.Layout;
for (int j = 0; j < layout.Words.Count; j++)
{

word = (MODI.Word)layout.Words[j];
str.Append(" "+word.Text);

}
}
textBox1.Text = str.ToString();
DOC.Close(false);

}


and you can run your project and check result my output is like this.

Or You can check attachment I have attached project.
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: