Ray tracing needs re-accessing to too many geometry data so a device should have its own memory(such as with Nvidia or Amd), to let CPU do its work. 768 Gflops is more than a low-end discrete gpu such as R7-240 of AMD.(As of, AMD's low-end is RX550 with 1200 GFlops, faster than Intel's Iris Plus 650 which is nearly 900 GFlops). This means a maximum of 48*8*2(1 add+1multiply)*1G = 768 Giga floating point operations per second but only if each ALU is capable of concurrently doing 1 addition and 1 multiplication. Your integrated gpu has 48 execution units each having 8 ALU units that can do add,multiply and many more operations. Scratch-a-pixel-raytracing-tutorial ( I read it then wrote its teraflops gpu version) Some overview of hardware, benchmark and parallel programming subjects But there are some sites:Īmd's parallel programming guide for opencl Maybe even Raspberry pi-x can support in future.ĭocumentation for opencl in is under development. Amd, Nvidia, Intel, Xilinx, Altera, Qualcomm, MediaTek, Marvell, Texas Instruments. Opencl works everywhere as long as both hardware and os supports. Cuda works only on nvidia hardware but there may be some libraries converting it to run on cpu cores(not igpu).ĪMD is working on "hipify"ing old cuda kernels to translate them to opencl or similar codes so they can become more general.
0 Comments
Leave a Reply. |