99国产亚洲精品美女久久久久,97国产精品视频在线观看,99久久婷婷国产综合精品青草免费

摘要

The present disclosure relates to a hardware accelerator for executing a computation task composed of a set of operations. The hardware accelerator comprises a controller and a set of computation units. Each computation unit of the set of computation units is configured to receive input data of an operation of the set of operations and to perform the operation, wherein the input data is represented with a distinct bit length associated with each computation unit. The controller is configured to receive the input data represented with a certain bit length of the bit lengths and to select one of the set of computation units that can deliver a valid result and that is associated with a bit length smaller than or equal to the certain bit length.

說明書

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38

The diagram of FIG. 5 shows data paths associated with the components involved in the computation of the data operation. The data path 501 indicates the time at which the input operands are ready in the DMA loads 321 and 322. The data operation starts at the 8-bit GEMM and 3-bit GEMM at the same time t0 as indicated by data paths 502 and 504 respectively. The data paths 503 and 505 indicate the time at which the 8-bit GEMM and 3-bit GEMM are ready respectively. The data path 506 indicates that the selector 317 selects the 3-bit GEMM at time t1 before any of the 8-bit GEMM and 3-bit GEMM was ready. Thus, the controller 315 may be ready at the same time the selected 3-bit GEMM was ready. This is indicated in data path 507. The result of the data operation may be provided at time t2 as output of the controller 315 and shown in data path 508. As shown in FIG. 5, the present invention may enable to gain the time difference between time t2 and the time t3 at which the 8-bit GEMM was ready.

FIG. 6 is a flowchart of a method for performing a computation task using a hardware accelerator in accordance with an embodiment of the present invention. The computation task may be an inference of a neural network. The hardware accelerator may be an FPGA tensor accelerator. The hardware accelerator may comprise a first computation unit that is configured to perform operations of the computation task with a full precision, e.g., 8-bit. For the purpose of explanation, the method may be implemented in the hardware acceleration system 100 illustrated in previous FIGS. 1-2 but is not limited to this implementation.

權(quán)利要求

1

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

摘要

說明書

權(quán)利要求

白丝美女被狂躁免费视频网站,500av导航大全精品,yw.193.cnc爆乳尤物未满,97se亚洲综合色区,аⅴ天堂中文在线网官网

Hardware accelerator for executing a computation task

摘要

說明書

權(quán)利要求

該功能需要專業(yè)版企業(yè)版VIP權(quán)限，您可以：