白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

專利號(hào)
US11175957B1
公開日期
2021-11-16
申請(qǐng)人
International Business Machines Corporation(US NY Armonk)
發(fā)明人
Dionysios Diamantopoulos; Florian Michael Scheidegger; Adelmo Cristiano Innocenza Malossi; Christoph Hagleitner; Konstantinos Bekas
IPC分類
G06F9/30; G06F9/50; G06F9/38
技術(shù)領(lǐng)域
bit,may,unit,input,be,units,data,operands,tensor,hardware
地域: NY NY Armonk

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說(shuō)明書

The input data may, for example, comprise operands of the data operation. In case the data operation is a multiplication operation, the operands of the input data may be two operands. The controller may be configured to select the computation unit by, for example, calculating the maximum number of leading zeros that is present in both operands of the input data. The calculated number of leading zeros is indicative of a highest bit length that can be used to perform the operation faster without losing in accuracy. For example, if the number of calculated leading zeros is L and the highest bit length that is processed by the controller is n-bit, the selected computation unit may be associated with a bit length k-bit, wherein k≥n?L and k≤n. This may be advantageous as it may be implemented using a simplistic design of comparators (e.g., only logic gates may be used in the controller). Thus, the area and latency overhead of the controller may be small. The controller may be able to quickly decide which of the computation units can deliver valid results without overfloats.

According to one embodiment, the controller is configured to stop execution of the operation by the non-selected computation unit(s). The controller may be configured to generate control signals or commands to the set of computation units and other units of the hardware accelerator. For example, the set of computation units comprises N computation units, where N≥2. The controller is configured to stop execution of the operation by the non-selected N?1 computation units. For example, as soon as the controller makes a decision, the results of the selected computation unit are forwarded, and all the rest speculated execution may be forced to cancel the calculation.

權(quán)利要求

1
微信群二維碼
意見反饋