最新消息:雨落星辰是一个专注网站SEO优化、网站SEO诊断、搜索引擎研究、网络营销推广、网站策划运营及站长类的自媒体原创博客

python - drawing from OpenCV fillConvexPoly() does not match the input polygon - Stack Overflow

programmeradmin0浏览0评论

I'm trying to follow the solution detailed at this question to prepare a dataset to train a CRNN for HTR (Handwritten Text Recognition). I'm using eScriptorium to adjust text segmentation and transcription, exporting in ALTO format (one XML with text region coordinates for each image) and parsing the ALTO XML to grab the text image regions and export them individually to create a dataset.

The problem I'm finding is that I have the region defined at eScriptorium, like this:

But when I apply this code from the selected solution for the above linked question:

# Initialize mask
mask = np.zeros((img.shape[0], img.shape[1]))

# Create mask that defines the polygon of points
cv2.fillConvexPoly(mask, pts, 1)
mask = mask > 0 # To convert to Boolean

# Create output image (untranslated)
out = np.zeros_like(img)
out[mask] = img[mask]

and display the image I get some parts of the text region filled:

As you can see, some areas that should be inside the mask are filled and, therefore, the image pixels in them are not copied. I've made sure the pixels that make the polygon are correctly parsed and handed to OpenCV to build the mask. I can't find the reason why those areas are filled and I wonder if anyone got into a similar problem and managed to find out the reason or how to avoid it.

TIA

I'm trying to follow the solution detailed at this question to prepare a dataset to train a CRNN for HTR (Handwritten Text Recognition). I'm using eScriptorium to adjust text segmentation and transcription, exporting in ALTO format (one XML with text region coordinates for each image) and parsing the ALTO XML to grab the text image regions and export them individually to create a dataset.

The problem I'm finding is that I have the region defined at eScriptorium, like this:

But when I apply this code from the selected solution for the above linked question:

# Initialize mask
mask = np.zeros((img.shape[0], img.shape[1]))

# Create mask that defines the polygon of points
cv2.fillConvexPoly(mask, pts, 1)
mask = mask > 0 # To convert to Boolean

# Create output image (untranslated)
out = np.zeros_like(img)
out[mask] = img[mask]

and display the image I get some parts of the text region filled:

As you can see, some areas that should be inside the mask are filled and, therefore, the image pixels in them are not copied. I've made sure the pixels that make the polygon are correctly parsed and handed to OpenCV to build the mask. I can't find the reason why those areas are filled and I wonder if anyone got into a similar problem and managed to find out the reason or how to avoid it.

TIA

Share Improve this question edited yesterday Christoph Rackwitz 15.7k5 gold badges39 silver badges51 bronze badges asked 2 days ago Ricardo Palomares MartínezRicardo Palomares Martínez 1,1779 silver badges13 bronze badges 5
  • Not near a machine to test, but maybe you need to specify a dtype of np.uint8 when you create the mask, e.g. mask = np.zeros((h,w), dtype=np.uint8) – Mark Setchell Commented 2 days ago
  • Please show your mask – fmw42 Commented 2 days ago
  • Thanks, @Mark-setchell, for your suggestion. Apparently, that was not the reason, though (but it didn't harm, either). – Ricardo Palomares Martínez Commented yesterday
  • Thanks for your comment, @fmw42. I presume you mean the array of points defining the mask, right? It is too long to fit this comment. The head of it is this: [[2690, 152], [2647, 152], [2645, 152], [2582, 154], [2580, 154], [2578, 154], [2542, 179], [2488, 163] It started in the right top side, although not in the corner, but the most elevated point over the "n" in "note" and moves through the left. In the end, Christoph's suggestion of using fillPoly did the trick. – Ricardo Palomares Martínez Commented yesterday
  • Actually I meant your created mask image. But Christoph Rackwitz has identified the issue. I agree with him that your mask is not convex. – fmw42 Commented yesterday
Add a comment  | 

1 Answer 1

Reset to default 1

You called cv.fillConvexPoly(). Your polygon is not convex. The algorithm assumed it to be convex and took some shortcuts to simplify the drawing code, so it came out wrong.

Use cv.fillPoly() instead. That will draw non-convex polygons correctly.

As you point out, the function signatures are not drop-in compatible. fillPoly() works on a list of polygons, while fillComplexPoly() just takes a single polygon.

cv.fillConvexPoly(img, points, color)
# would be replaced with
cv.fillPoly(img, [points], color) # list of one polygon

Each polygon should be a numpy array of shape (N, 1, 2) and it probably needs to be of an integer dtype too, although I'm not sure about that now and it might support floating point dtype in the future.

发布评论

评论列表(0)

  1. 暂无评论