Handwriting detector, extractor, and language classifier

專利號(hào)

US11176361B2

公開日期

2021-11-16

申請(qǐng)人

Raytheon Company（US MA Waltham）

發(fā)明人

Darrell L. Young; Kevin C. Holley

IPC分類

G06F40/171; G06F40/263; G06K9/00; G06K9/34; G06K9/38; G06K9/62; G06K9/68; G06K9/72

技術(shù)領(lǐng)域

language,may,or,in,bounding,be,hardware,features,geometric,image

地域： MA MA Waltham

摘要

Disclosed are methods for handwriting recognition. In some aspects, an image representing a page of a sample document is analyzed to identify a region having indications of handwriting. The region is analyzed to determine frequencies of a plurality of geometric features within the region. The frequencies may be compared to profiles or histograms of known language types, to determine if there are similarities between the frequencies in the sample document relative to those of the known language types. In some aspects, machine learning may be used to characterize the document as a particular language type based on the frequencies of the geometric features.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52

FIG. 9 lists feature tokens detected in handwriting and used to build n-gram feature vector histograms. In various embodiments, to classify the language, almost 100 individual features were used to capture the uniqueness of each language. For example, the French phrase, “Où dans la forêt le gar?on étudiant na?f?” illustrates all five French accent marks: 1) grave; 2) circumflex; 3) cedilla; 4) acute; and 5) umlaut. The appearance of these marks are detected and encoded as features. Each feature was assigned a number. Detectors were designated for each of features. There were other features such as a unique arrangement of circles and lines found in Korean, “ custom character ”, (“beauty is in the eye of the beholder”), the curves of Arabic, “”, (“be patient”), the multiple orthogonal intersections of Chinese, “” (“l(fā)ove at first sight”), and so on for Japanese, Urdu, Persian, Bengali, Hindu, Portuguese, Russian, Swahili, Tamil, Telugu, and Turkish.

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Handwriting detector, extractor, and language classifier

摘要

說明書

權(quán)利要求

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Handwriting detector, extractor, and language classifier

摘要

說明書

權(quán)利要求

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：