全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

FDTD Acceleration for Cylindrical Resonator Design Based on the Hybrid of Single and Double Precision Floating-Point Computation

DOI: 10.1155/2014/634269

Full-Text   Cite this paper   Add to My Lib

Abstract:

Acceleration of FDTD (finite-difference time-domain) is very important for the fields such as computational electromagnetic simulation. We consider the FDTD simulation model of cylindrical resonator design that requires double precision floating-point and cannot be done using single precision. Conventional FDTD acceleration methods have a common problem of memory-bandwidth limitation due to the large amount of parallel data access. To overcome this problem, we propose a hybrid of single and double precision floating-point computation method that reduces the data-transfer amount. We analyze the characteristics of the FDTD simulation to find out when we can use single precision instead of double precision. According to the experimental results, we achieved over 15 times of speed-up compared to the CPU single-core implementation and over 1.52 times of speed-up compared to the conventional GPU-based implementation. 1. Introduction Computational electromagnetic simulation shows a rapid development recently due to the introduction of processors that have parallel processing capability such as multicore CPUs and GPUs (graphic processing units). The FDTD (finite-difference time-domain) algorithm [1, 2] is one of the most popular methods of computational electromagnetic simulations due to its simplicity and very high computational efficiency. It has been widely used in many applications such as coil modeling [3] and resonance characteristics analysis of a cylindrical cavity [4, 5]. Many of these applications require double precision floating-point computation to satisfy the stability condition [6]. The FDTD simulation requires a large amount of data. When more processor cores are used in parallel, more data transfers occur between the memory and the processor cores. Therefore, the memory-bandwidth limitation is a major problem in the FDTD simulation using computers. To overcome this problem, we have to reduce the data transfer amount, so that we can use more cores in parallel. To do this, we propose a hybrid precision computation method that uses both single and double precision. Single precision data use 4 bytes compared to 8 bytes used in double precision data. Therefore, using single precision reduces the data amount and increases the processing speed. However, using single precision could bring inaccurate results. In some cases, the FDTD simulation does not converge when it is executed for a large number of iterations. In this paper, we consider the FDTD simulation of a cylindrical resonator [5]. It is one of the most fundamental types of resonant cavities

References

[1]  H. S. Yee, “Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media,” IEEE Transactions on Antennas and Propagation, vol. 14, no. 3, pp. 302–307, 1966.
[2]  A. Taflove and S. C. Hagness, Advances in Computational Electrodynamics: The Finite-Difference Time-Domain Method, Artech House, ', 3rd edition, 2005.
[3]  T. S. Ibrahim, R. Lee, B. A. Baertlein, Y. Yu, and P.-M. L. Robitaille, “Computational analysis of the high pass birdcage resonator: finite difference time domain simulations for high-field MRI,” Magnetic Resonance Imaging, vol. 18, no. 7, pp. 835–843, 2000.
[4]  E. Kuramochi, H. Taniyama, T. Tanabe, K. Kawasaki, Y.-G. Roh, and M. Notomi, “Ultrahigh-Q one-dimensional photonic crystal nanocavities with modulated mode-gap barriers on SiO2 claddings and on air claddings,” Optics Express, vol. 18, no. 15, pp. 15859–15869, 2010.
[5]  Y. Ohtera, S. Iijima, and H. Yamada, “Cylindrical resonator utilizing a curved resonant grating as a cavity wall,” Micromachines, vol. 3, no. 1, pp. 101–113, 2012.
[6]  A. Taflove and M. E. Brodwin, “Numerical solution of steady-state electromagnetic scattering problems using the time-dependent maxwell's equations,” IEEE Transactions on Microwave Theory and Techniques, vol. 23, no. 8, pp. 623–630, 1975.
[7]  R. E. Collin, “Electromagnetic resonators,” in Foundations for Microwave Engineering, pp. 496–517, Wiley, Hoboken, NJ, USA, 2nd edition, 2001.
[8]  S. B. Cohn, “Microwave bandpass filters containing high-Q dielectric resonators',” IEEE Transactions on Microwave Theory and Techniques, vol. MTT-16, no. 4, pp. 218–227, 1968.
[9]  N. G. Alexopoulos and R. K. Stephen, “Coupled power theorem and orthogonality relations for optical disk waveguides,” Journal of the Optical Society of America, vol. 67, no. 12, pp. 1634–1638, 1977.
[10]  K. J. Vahala, “Optical microcavities,” Nature, vol. 424, no. 6950, pp. 839–846, 2003.
[11]  V. S. Ilchenko and A. B. Matsko, “Optical resonators with whispering-gallery modes—part II: applications,” IEEE Journal on Selected Topics in Quantum Electronics, vol. 12, no. 1, pp. 15–32, 2006.
[12]  B. Zhang, Z.-H. Xue, W. Ren, W.-M. Li, and X.-Q. Sheng, “Accelerating FDTD algorithm using GPU computing,” in Proceedings of the 2nd IEEE International Conference on Microwave Technology and Computational Electromagnetics (ICMTCE '11), pp. 410–413, May 2011.
[13]  T. Nagaoka and S. Watanabe, “A GPU-based calculation using the three-dimensional FDTD method for electromagnetic field analysis,” in Proceedings of the 32nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC '10), pp. 327–330, Buenos Aires, Argentina, September 2010.
[14]  P. Micikevicius, “3D finite difference computation on GPUs using CUDA,” in Proceedings of the 2nd Workshop on General Purpose Processing on Graphics Processing Units, pp. 79–84, March 2009.
[15]  T. Nagaoka and S. Watanabe, “Multi-GPU accelerated three-dimensional FDTD method for electromagnetic simulation,” in Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS '11), pp. 401–404, Boston, Mass, USA, September 2011.
[16]  L. Mattes and S. Kofuji, “Overcoming the GPU memory limitation on FDTD through the use of overlapping subgrids,” in Proceedings of the International Conference on Microwave and Millimeter Wave Technology (ICMMT '10), pp. 1536–1539, May 2010.
[17]  N. Kawada, K. Okubo, and N. Tagawa, “Multi-GPU numerical simulation of electromagnetic field with high-speed visualization using CUDA and OpenGL,” IEICE Transactions on Communications, vol. 95, no. 2, pp. 375–380, 2012.
[18]  “NVIDIA CUDA C Programming Guide Ver.4.1,” 2011.
[19]  http://www.nvidia.com/docs/IO/43395/NV-DS-Tesla-C2075.pdf.
[20]  Y. Inoue, T. Sekine, and H. Asai, “Fast transient simulation of plane distribution network by using GPGPU-LIM,” IEICE Transactions on Electronics, vol. 93, no. 11, pp. 406–413, 2010 (Japanese).
[21]  J. Nickolls, I. Buck, M. Garland, and K. Skadron, “Scalable parallel programming with CUDA,” Queue—GPU Computing, vol. 6, no. 2, pp. 40–53, 2008.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133