白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

專利號
US11175957B1
公開日期
2021-11-16
申請人
International Business Machines Corporation(US NY Armonk)
發(fā)明人
Dionysios Diamantopoulos; Florian Michael Scheidegger; Adelmo Cristiano Innocenza Malossi; Christoph Hagleitner; Konstantinos Bekas
IPC分類
G06F9/30; G06F9/50; G06F9/38
技術(shù)領(lǐng)域
bit,may,unit,input,be,units,data,operands,tensor,hardware
地域: NY NY Armonk

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說明書

The hardware acceleration system 100 may be configured to implement an application such as an inference of a trained neural network, e.g., the hardware acceleration system 100 may be a FPGA based neural network accelerator system.

The host computer 101 and the hardware accelerator 102 are adapted to communicate data. This data communication may be done through a connection such as a PCIe bus or Ethernet connection. In another example, the hardware accelerator 102 may be part of the host computer 101. The hardware accelerator 102 and the host processor 103 may share the same package or the same die. In this case, the communication link between the hardware accelerator 102 and the host processor 103 may be any of the commonly used in-package or on-chip communication buses, e.g., AXI, Wishbone, etc. The hardware accelerator 102 may read input data from a global memory and perform the computation. The input data may be received via a network interface as a stream of network phits (the network interface may stream in and out fixed size data). The outputs may be written back to the global memory and may be sent as a stream of network phits via the network interface.

權(quán)利要求

1
微信群二維碼
意見反饋