Skip to content

azzhu/document-image-word-spotting

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

INPUT and OUTPUT

input:a image(For example, a handwritten picture of an English composition)

output:many image lists(Represents each line of the English composition in the input image)

The effect is as follows: input

Brief Description

This project is mainly aimed at the preprocessing part of English handwriting recognition image. When doing handwriting recognition, it will be very easy if the images have been cut as above, but often the most difficult part is not to design the neural network of handwriting recognition, but the image preprocessing part in front of it, how to cut out the images in various cases by lines. That's what this project is all about.

Of course, not only English, handwritten Chinese composition can also be cut out; If it's printed instead of handwritten, it's easier to cut.

Algorithm Principle

Heatmap is mainly divided into two steps. The first step is to calculate a heatmap (text area heatmap). In the second step, heatmap is used to plan the rows.

Core idea: the accumulation of rules.

How the specific principle, look at the code, very simple, anyway, is a variety of rules.

Heatmap was calculated in the first step. The unet network was used for deep learning, and the effect was quite good. However, the speed was a little slower.

This code is written primarily in Python, mainly to meet the needs of other projects. Of course, if you use C++, it will definitely be faster. In fact, I wrote it in C++ (including C++ code) at first, and later converted it into Python code. This code was written very early, I just learned Python at that time, so the code is very messy and has a lot of c++ style (manual face cover), but please ignore, just look at the effect.

  • img
  • img
  • img
  • img
  • img
  • img

More Experimental Results

  • img

  • img

  • img

  • img

Please leave a message if you have any questions

If you have any questions in use, please feel free to give me feedback. You can also communicate with me through the following contact information

About

Hand-Written English Composition Segment line by line.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published