python - 无法在Python中获得从扫描图像提取的文本的准确输出（将扫描的pdf /图像转换为文本） - Thinbug

无法在Python中获得从扫描图像提取的文本的准确输出（将扫描的pdf /图像转换为文本）

时间：2019-04-05 13:26:16

标签： python python-imaging-library ocr python-tesseract

“在Python中无法准确输出从扫描图像提取的文本（将扫描的pdf /图像转换为文本）”

这是我尝试过的无法提供准确结果的代码：

img = PIL.Image.open（“ r.jpeg”） arr = numpy.array（img）打印（arr）

cv2.imshow（“输出”，arr） cv2.imwrite（'Output Image.jpg'，arr） cv2.waitKey（0）

output = pytesseract.image_to_string（PIL.Image.open（'Output Image.PNG'）。convert（“ RGB”），lang ='eng'）打印（输出）[在此处输入图像说明]

这是'r.jpg'扫描图像1

0 个答案:

没有答案