白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Computing system for extraction of textual elements from a document

專利號
US11176364B2
公開日期
2021-11-16
申請人
Hyland Software, Inc.(US OH Westlake)
發(fā)明人
Ralph Meier; Thorsten Wanschura; Johannes Hausmann; Harry Urbschat
IPC分類
G06K9/00; G06K9/20; G06T7/70; G06K9/72; G06T7/50; G06K9/62
技術(shù)領(lǐng)域
textual,document,text,computer,readable,in,extraction,element,computing,documents
地域: OH OH Westlake

摘要

Described herein are various technologies pertaining to text extraction from a document. A computing device receives the document. The document comprises computer-readable text and a layout, wherein the layout defines positions of the computer-readable text within a two-dimensional area represented by the document. Responsive to receiving the document, the computing device identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text and contextual relationships between the portions of the computer-readable text. The computing device then outputs the at least one textual element.

說明書

In operation, a computing device that executes the textual extraction application receives a document comprising computer-readable text and a layout. The computer-readable text may include letters, numbers, punctuation, and/or mathematical symbols. The layout defines positions of the computer-readable text within a two-dimensional area represented by the document. The document may have a defined type, wherein the defined type is indicative of a purpose of the document. In an example, a defined type of a document may be an educational transcript, and as such, computer-readable text of the educational transcript may be indicative of classes taken by a student, credit hours received by the students for the classes, and grades that the student received in the classes. In a further example, portions of the computer-readable text and/or the layout of the document may not have been encountered previously by the textual extraction application.

Responsive to receiving the document, the textual extraction application identifies at least one textual element in the computer-readable text based upon spatial factors between portions of the computer-readable text in the document and contextual relationships between the portions of the computer-readable text. The spatial factors may include distances between the portions of the computer-readable text, angles between the portions of the computer-readable text and an axis of the document, and/or orderings between the portions of the computer-readable text. The textual extract application may calculate the spatial factors based upon the positions of the computer-readable text within the document. The contextual relationships are determined via at least one computer-implemented model. Exemplary contextual relationships include source to object, object to use, person to location, whole to part, and/or type to subtype.

權(quán)利要求

1
微信群二維碼
意見反饋