白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Handwriting detector, extractor, and language classifier

專利號(hào)
US11176361B2
公開日期
2021-11-16
申請(qǐng)人
Raytheon Company(US MA Waltham)
發(fā)明人
Darrell L. Young; Kevin C. Holley
IPC分類
G06F40/171; G06F40/263; G06K9/00; G06K9/34; G06K9/38; G06K9/62; G06K9/68; G06K9/72
技術(shù)領(lǐng)域
language,may,or,in,bounding,be,hardware,features,geometric,image
地域: MA MA Waltham

摘要

Disclosed are methods for handwriting recognition. In some aspects, an image representing a page of a sample document is analyzed to identify a region having indications of handwriting. The region is analyzed to determine frequencies of a plurality of geometric features within the region. The frequencies may be compared to profiles or histograms of known language types, to determine if there are similarities between the frequencies in the sample document relative to those of the known language types. In some aspects, machine learning may be used to characterize the document as a particular language type based on the frequencies of the geometric features.

說明書

FIG. 9 lists feature tokens detected in handwriting and used to build n-gram feature vector histograms. In various embodiments, to classify the language, almost 100 individual features were used to capture the uniqueness of each language. For example, the French phrase, “Où dans la forêt le gar?on étudiant na?f?” illustrates all five French accent marks: 1) grave; 2) circumflex; 3) cedilla; 4) acute; and 5) umlaut. The appearance of these marks are detected and encoded as features. Each feature was assigned a number. Detectors were designated for each of features. There were other features such as a unique arrangement of circles and lines found in Korean, “custom character”, (“beauty is in the eye of the beholder”), the curves of Arabic, “custom character”, (“be patient”), the multiple orthogonal intersections of Chinese, “custom character” (“l(fā)ove at first sight”), and so on for Japanese, Urdu, Persian, Bengali, Hindu, Portuguese, Russian, Swahili, Tamil, Telugu, and Turkish.

權(quán)利要求

1
微信群二維碼
意見反饋