白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Method for generating text string dictionary, method for searching text string dictionary, and system for processing text string dictionary

專利號
US10867134B2
公開日期
2020-12-15
申請人
HITACHI HIGH-TECHNOLOGIES CORPORATION(JP Tokyo)
發(fā)明人
Kouichi Kimura
IPC分類
G06F40/30; G06F16/00; H03M7/30; G16B30/00; G06F40/242
技術(shù)領(lǐng)域
string,text,multicore,in,block,process,link,cpu,dictionary,registered
地域: Tokyo

摘要

A multicore CPU of a text string data analyzing device: loads a plurality of blocks obtained by dividing a text string dictionary into a memory; executes, in parallel on block groups executable independently of each other, an entry registration process of registering, character by character, unregistered text strings of text string data as new entries in the blocks in order from last characters; and outputs, as BW transformed data of the text string dictionary in which the text string data is already registered, a text string obtained by coupling text strings registered in entries of the blocks in a state in which no unregistered text strings of the blocks exists.

說明書

As described with S301, when r=2 and alphabetical characters are of the two types of A and T, blocks ($ block, A$ block, T$ block, AA block, AT block, TA block, and TT block) of seven types are used. Specifically, the number of blocks can be calculated according to an equation “(the number of blocks)=((h(r+1)?1)/(h?1)” obtained by generalizing an equation “7=23?1”. Then, the multicore CPU 101 can calculate the number of blocks based on the number of CPU cores so that (the number of CPU cores)×K=(the number of blocks) (K is, for example, a constant number in a range of 10 to 90 indicating several tens of times). Then, the multicore CPU 101 can substitute the calculated number of blocks into the equation obtained by the generalization, thereby automatically calculating the parameter r.

When the positive integer parameter r is increased, the number of blocks is exponentially increased. Thus, the value of r can be determined so that the number of blocks is several tens of times larger than the number of CPU cores. In this case, even when dynamic load distribution is executed by multithreading, and calculation time in the processes P(w) and Q(w) varies, calculation loads of the cores can be equalized and the speed can be efficiently increased by effectively using all the cores.

For example, the number of processes Q(w) executable independently of each other in parallel is equal to the number of text strings w having a length of r?1. When the alphabet size is h, the number of processes Q(w) executable independently of each other in parallel is equal to h(r?1). Thus, even when h=4 like the case of DNA sequence data, r can be selected so that the number of processes Q(w) is several tens of times larger than the number of usable CPU cores.

權(quán)利要求

1
微信群二維碼
意見反饋