A Stacking Scheme to Improve the Efficiency of Finite-Difference Time-Domain Solutions on Graphics Processing Units
关键词:
A Stacking Scheme to Improve the Efficiency of Finite-Difference Time-Domain Solutions on Graphics Processing Units摘要
Advances in computer hardware technologies accompanied by easy-to-use parallel programming software platforms have led to the wide spread use of parallel processing architectures, such as multi-core central processor units (CPUs) and graphic processing units (GPUs), in technical and scientific computing. Among electromagnetic numerical analysis methods, the finite-difference time-domain (FDTD) method is very well suited for parallel programming, and several implementations of FDTD have been developed and reported to solve electromagnetics problems orders of magnitude faster. Examination of performances of these implementations reveals that, in general, it is more efficient to solve larger FDTD domains than smaller domains. In this paper it is demonstrated that one can exploit the higher efficiency inherent to the solution of larger problem sizes to solve parameter sweep and optimization problems faster: instead of solving multiple smaller FDTD domains separately, these domains can be combined or stacked to form a larger problem and the large problem can be solved more efficiently. It has been shown that up to 40% faster solution can be achieved on GPUs with this method. Index Terms—FDTD methods,
##plugins.generic.usageStats.downloads##
参考
K. S. Yee, “Numerical Solution of Initial
Boundary Value Problems Involving
Maxwell's Equations in Isotropic Media,”
IEEE Transactions on Antennas and
Propagation, vol. 14, pp. 302–307, May
A. Taflove and S. C. Hagness, Computational
Electrodynamics: The Finite-Difference
Time-Domain Method, 3 rd edition, Artech
House, 2005.
A. Elsherbeni and V . Demir, The Finite
Difference Time Domain Method for
Electromagnetics: With MATLAB
Simulations, SciTech Publishing, 2009.
S. E. Krakiwsky, L. E. Turner, and M. M.
Okoniewski, “Graphics Processor Unit (GPU)
Acceleration of Finite-Difference Time-
Domain (FDTD) Algorithm,” Proc. 2004
International Symposium on Circuits and
Systems, vol. 5, pp. V-265–V-268, May 2004.
S. E. Krakiwsky, L. E. Turner, and M. M.
Okoniewski, “Acceleration of Finite-
Difference Time-Domain (FDTD) Using
Graphics Processor Units (GPU),” 2004 IEEE
MTT-S International Microwave Symposium
Digest, vol. 2, pp. 1033–1036, Jun. 2004.
R. Schneider, S. Krakiwsky, L. Turner, and
M. Okoniewski, “Advances in Hardware
Acceleration for FDTD,” Chapter 20 in
Computational Electrodynamics: The Finite-
Difference Time-Domain Method, 3 rd edition,
Artech House, 2005.
S. Adams, J. Payne, and R. Boppana, “Finite
Difference Time Domain (FDTD)
Simulations Using Graphics Processors,”
Proceedings of the 2007 DoD High
Performance Computing Modernization
Program Users Group (HPCMP) Conference,
pp. 334–338, 2007.
M. J. Inman, A. Z. Elsherbeni, and C. E.
Smith, “GPU Programming for FDTD
Calculations,” The Applied Computational
ACES JOURNAL, VOL. 25, NO. 4, APRIL 2010
Electromagnetics Society (ACES) Conference,
M. J. Inman and A. Z. Elsherbeni,
“Programming Video Cards for
Computational Electromagnetics
Applications,” IEEE Antennas and
Propagation Magazine, vol. 47, no. 6, pp. 71–
, December 2005.
M. J. Inman and A. Z. Elsherbeni,
“Acceleration of Field Computations Using
Graphical Processing Units,” The Twelfth
Biennial IEEE Conference on
Electromagnetic Field Computation CEFC
, April 30 - May 3, 2006.
M. J. Inman, A. Z. Elsherbeni, J. G. Maloney,
and B. N. Baker, “Practical Implementation of
a CPML Absorbing Boundary for GPU
Accelerated FDTD Technique,” The 23rd
Annual Review of Progress in Applied
Computational Electromagnetics Society, 19-
March 2007.
M. Inman, A. Elsherbeni, J. Maloney, and B.
Baker, “Practical Implementation of a CPML
Absorbing Boundary for GPU Accelerated
FDTD Technique,” Applied Computational
Electromagnetics Society Journal, vol. 23, no.
, pp. 16–22, 2008.
M. J. Inman and A. Z. Elsherbeni,
“Optimization and parameter exploration
using GPU based FDTD solvers,” IEEE MTT-
S International Microwave Symposium
Digest, pp. 149–152, June 2008.
M. J. Inman, A. Elsherbeni, and V. Demir,
“Graphics Processing Unit Acceleration of
Finite Difference Time Domain”, Chapter 12
in The Finite Difference Time Domain
Method for Electromagnetics (with MATLAB
Simulations), SciTech Publishing, 2009.
N. Takada, N. Masuda, T. Tanaka, Y. Abe,
and T. Ito, “A GPU Implementation of the 2-
D Finite-Difference Time-Domain Code
Using High Level Shader Language,” Applied
Computational Electromagnetics Society
Journal, vol. 23, no. 4, pp. 309–316, 2008.
A. Valcarce, G. de la Roche, and J. Zhang, “A
GPU Approach to FDTD for Radio Coverage
Prediction,” Proceedings of the 11 th IEEE
Singapore International Conference on
Communication Systems (ICCS '08), pp.
–1590, November 2008.
P. Sypek and M. Michal, “Optimization of an
FDTD Code for Graphical Processing Units,”
th International Conference on Microwaves,
Radar and Wireless Communications,
MIKON 2008, pp. 1–3, 19-21 May 2008.
P. Sypek, A. Dziekonski, and M. Mrozowski,
“How to Render FDTD Computations More
Effective Using a Graphics Accelerator,”
IEEE Transactions on Magnetics, vol. 45, no.
, pp. 1324–1327, 2009.
N. Takada, T. Shimobaba, N. Masuda, and T.
Ito, “High-speed FDTD Simulation Algorithm
for GPU with Compute Unified Device
Architecture,” IEEE International Symposium
on Antennas & Propagation & USNC/URSI
National Radio Science Meeting, p. 4, 2009.
A. Valcarce, G. De La Roche, A. Jüttner, D.
López-Pérez, and J. Zhang,“Applying FDTD
to the coverage prediction of WiMAX
femtocells,” EURASIP Journal on Wireless
Communications and Networking, February
D. K. Price, J. R. Humphrey, and E. J.
Kelmelis, “GPU-based Accelerated 2D and
D FDTD Solvers,” in Physics and
Simulation of Optoelectronic Devices XV,
Proceedings of SPIE, vol. 6468, 2007.
D. K. Price, J. R. Humphrey, and E. J.
Kelmelis, “Accelerated Simulators for Nano-
Photonic Devices,” International Conference
on Numerical Simulation of Optoelectronic
Devices 2007, NUSOD '07, pp. 103–104,
September 2007.
A. Balevic, L. Rockstroh, A. Tausendfreund,
S. Patzelt, G. Goch, and S. Simon,
“Accelerating Simulations of Light Scattering
Based on Finite-Difference Time-Domain
Method with General Purpose GPUs,”
Proceedings of the 2008 11 th IEEE
International Conference on Computational
Science and Engineering, pp. 327–334, 2008.
C. Ong, M. Weldon, D. Cyca, and M.
Okoniewski, “Acceleration of Large-Scale
FDTD Simulations on High Performance
GPU Clusters,” 2009 IEEE International
Symposium on Antennas & Propagation &
USNC/URSI National Radio Science Meeting,
NVIDIA CUDA ZONE:
www.nvidia.com/object/cuda_home.html.
DEMIR: STACKING SCHEME TO IMPROVE EFFICIENCY OF FDTD SOLUTIONS ON GPUS
Acceleware: www.acceleware.com.
CUDA_Getting_Started_2.3_Windows.pdf:
http://www.nvidia.com/object/cuda_develop.
html.
http://en.wikipedia.org/wiki/CUDA.
V. Demir and A . Z. Elsherbeni, “Compute
Unified Device Architecture (CUDA) Based
Finite-Difference Time-Domain (FDTD)
Implementation,” Journal of the Applied
Computational Electromagnetics Society
(ACES), vol. 25, no. 4, 2010