www亚洲精品久久久乳,chinese国产xxxx实拍,…久久精品99久久香蕉国产

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38

According to one embodiment, the set of computation units comprises a minimum number of computation units such that a logic utilization of the FPGA is higher than a predefined threshold. For the example, the logic utilization of the FPGA may be higher than 90%. Therefore, the silicon area of the FPGA may be used optimally.

According to one embodiment, the number of computation units of the set of computation units is the number of bits of the highest bit length of the set of bit lengths. For example, if the highest bit length is 8-bit, the set of computation units comprises 8 computation units. In one example, the set computation units are associated with bit lengths 8-bit, 7-bit, 6-bit, 5-bit, 4-bit, 3-bit, 2-bit, and 1-bit respectively. In this case, the controller has a step size of one. This may be advantageous as it covers all possible representations. In another example, the controller may have a step size of two such that the set computation units are associated with bit lengths 8-bit, 6-bit, 4-bit and 2-bit. This may be advantageous as it may lower the area occupied by the controller. In one example, the step size of the controller may be defined by a user. In another example, the step size may be determined by profiling the input data, and based on their statistic distributions, the controller's step size may be determined. For example, if 80% of the calculations involve numbers in the form of XXX0000X, then the step size may be defined accordingly.

According to one embodiment, the computation task is one of: training a deep neural network, inference of a deep neural network, matrix-vector multiplication, and matrix-matrix multiplication.

權(quán)利要求

1

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

摘要

說明書

權(quán)利要求

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

摘要

說明書

權(quán)利要求

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：