白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

專利號(hào)
US11175957B1
公開(kāi)日期
2021-11-16
申請(qǐng)人
International Business Machines Corporation(US NY Armonk)
發(fā)明人
Dionysios Diamantopoulos; Florian Michael Scheidegger; Adelmo Cristiano Innocenza Malossi; Christoph Hagleitner; Konstantinos Bekas
IPC分類
G06F9/30; G06F9/50; G06F9/38
技術(shù)領(lǐng)域
bit,may,unit,input,be,units,data,operands,tensor,hardware
地域: NY NY Armonk

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說(shuō)明書

Each of the computation units 310.1 through 310.8 is configured to perform an operation (e.g., multiplication) of the two operands stored in the pair of input registers to which the computation unit is connected and to output the result of the operation into an output register to which the computation unit is connected. The computation unit 310.8 is connected to the 16-bit output register (named tensor register 16-bit). The computation unit 310.7 is connected to the 14-bit output register (named tensor register 14-bit). The computation unit 310.6 is connected to the 12-bit output register (named tensor register 12-bit). The computation unit 310.5 is connected to the 10-bit output register (named tensor register 10-bit). The computation unit 310.4 is connected to the 8-bit output register (named tensor register 8-bit). The computation unit 310.3 is connected to the 6-bit output register (named tensor register 6-bit). The computation unit 310.2 is connected to the 4-bit output register (named tensor register 4-bit). The computation unit 310.1 is connected to the 1-bit output register (named tensor register 1-bit).

The hardware accelerator 302 further comprises a controller 315 and a selector 317. The controller, also called the speculative precision controller, 315 is connected to the set of computation units 310.1 through 310.N and to the selector 317. The selector 317 is configured to receive data of the output registers. Although shown as separate components, the controller 315 and the selector 317 may, in another example, form an integral part.

權(quán)利要求

1
微信群二維碼
意見(jiàn)反饋