Developed a CUDA version of the FDTD method and achieved a speedup 40x. Implemented on a NVIDIA Quadro FX 3800 GPU, which has 192 SPs, 1GB global memory, and a memory bandwidth of 51.2 GB/s.
This paper presents a simple but effective and efficient approach to improve the accuracy and stability of the least-squares Monte Carlo method. The key idea is to construct an ansatz for the ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results