白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

專利號
US11175957B1
公開日期
2021-11-16
申請人
International Business Machines Corporation(US NY Armonk)
發(fā)明人
Dionysios Diamantopoulos; Florian Michael Scheidegger; Adelmo Cristiano Innocenza Malossi; Christoph Hagleitner; Konstantinos Bekas
IPC分類
G06F9/30; G06F9/50; G06F9/38
技術(shù)領(lǐng)域
bit,may,unit,input,be,units,data,operands,tensor,hardware
地域: NY NY Armonk

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說明書

According to one embodiment, the set of computation units comprise a first computation unit associated with the highest bit length of the set of bit lengths, wherein each computation unit of the set of computation units, which is different from the first computation unit, is a replication unit of the first computation unit. Computation unit replication refers to determining one or more copies of the first computation unit. The replication unit is a copy of the first computation unit.

The first computation unit may be associated with the bit length n-bit. Each computation unit of the remaining N?1 computation units may be associated with a distinct bit length n?j-bit where j has a value varying from 1 to n?1.

According to one embodiment, the highest bit length of the set of bit lengths is n-bit, wherein each computation unit of the set of computation units that is associated with a bit length k-bit smaller than n-bit is configured to read the k least significant bits (LSB) of the received input data. This embodiment may enable to provide inputs of each of the computation units from a same loaded data. That is, instead of converting all parameter values from a high precision to low precision, the same input data may be loaded and further used as input to the set of computation units. This may enable a speculation of precision by speculating that the ignored (unread) MSB bits are 0.

According to one embodiment, the hardware accelerator comprises a FPGA, a GPU, an ASIC, a neuromorphic device, or a bit-addressable device.

權(quán)利要求

1
微信群二維碼
意見反饋