如何使用Google Cloud Vision API阅读一栏文字

编程入门行业动态更新时间:2024-10-27 05:37:08

本文介绍了如何使用Google Cloud Vision API阅读一栏文字的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！问题描述

我有下一个文档图像

当我尝试将图像转换为文本时，结果是下一个:

When I try to convert the image to text, the result is the next:

热门文字

参考:Rad:Dte:Ddo:

Ejecutivo 76520400300 Bogo deBogotáLuz Adriana银行

按钮文本

问题是Google API像两列一样将其重新识别，因此，如何配置Google API以获取一列文字?

The problem is Google API recongnize it like two columns so, How can I config the Google API in order to obtain one column text?

我的目标是获得:

热门文字

参考号:Ejecutivo Rad:76520400300 Dte:Banco deBogotáDdo:Luz Adriana

按钮文本

推荐答案

Cloud Vision API 没有特定的请求属性来指定用于读取或排序文件数据的格式.相反，我认为可用的解决方法是使用 BoundingPoly 和顶点响应属性，显示与图像中包含的每个单词相关的坐标，以便处理代码逻辑中的顶点数据并定义需要按列和行.您可以查看此链接，其中包括一些包含以下内容的响应示例属性.

Cloud Vision API doesn't have a specific request property to specify the format used to read or sort the file's data. Instead, I think that the available workaround is to use the BoundingPoly and Vertex response properties, that display the coordinates related to each word contained in the image, in order to process the vertices data within your code logic and define the text that need to be grouped by columns and rows. You can take a look on this link which includes some response examples that include these properties.

如果此功能不能满足您的当前需求，则可以使用发送反馈按钮，该按钮位于服务公共文档，并查看问题跟踪器工具，以便提出Vision API功能请求，并将此所需功能通知Google.

In case this feature doesn't cover your current needs, you can use the Send Feedback button, located at the lower left and upper right corners of the service public documentation, as well as take a look the Issue Tracker tool in order to raise a Vision API feature request and notify to Google about this desired functionality.

更多推荐

如何使用Google Cloud Vision API阅读一栏文字

本文发布于:2023-10-14 19:06:56，感谢您对本站的认可！

本文链接:https://www.elefans.com/category/jswz/34/1491943.html