白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

專利號(hào)
US11175957B1
公開日期
2021-11-16
申請(qǐng)人
International Business Machines Corporation(US NY Armonk)
發(fā)明人
Dionysios Diamantopoulos; Florian Michael Scheidegger; Adelmo Cristiano Innocenza Malossi; Christoph Hagleitner; Konstantinos Bekas
IPC分類
G06F9/30; G06F9/50; G06F9/38
技術(shù)領(lǐng)域
bit,may,unit,input,be,units,data,operands,tensor,hardware
地域: NY NY Armonk

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說明書

A neural network application may be provided in step 601. For example, the architecture and parameters of the neural network may be provided. The parameters may be weights, biases, and types of input/output data of the neural network. The code of the neural network application may be received in step 602. The code may, for example, be in a programming language such as Tensorflow, Keras, pyToch, etc. The received code may be adapted in step 603 in order to be executed by the hardware accelerator. For example, the code may be lowered to an intermediate representation (IR) level that can be executed by the hardware accelerator. Replication units may be created in step 604 from the first computation unit according to different precisions. The parameters of the neural network may be forwarded in step 605 to the replication units so that processing of the application may start at the first computation unit and the replication units. It may be determined in step 606 if the controller has found the replication unit that can deliver the result of the operation. If it is determined in step 606 that the controller has not found the replication unit that can deliver the result of the operation, the result of the first computation unit may be used in step 608. If it is determined in step 606 that the controller has found a replication unit that can deliver the result of the operation the result of the found replication unit may be used in step 607.

權(quán)利要求

1
微信群二維碼
意見反饋