A High Performance Parallel FDTD Based on Winsock and Multi-Threading on a PC-Cluster
Keywords:
A High Performance Parallel FDTD Based on Winsock and Multi-Threading on a PC-ClusterAbstract
Parallel technology is a powerful tool to provide the necessary computing power and memory resources for the FDTD method to simulate electrically-large and complex structures. In this paper, a high performance parallel FDTD is developed for multi-core cluster systems. It employs Winsock to achieve efficient inter-process communication as well as multi-threading to make full use of the hardware resources of multi-core processors on a PC-cluster. Key steps for parallel FDTD such as synchronization, data exchange, load balancing, etc., are investigated. An experiment simulating the scattering of an incident electromagnetic wave form of a computer case is presented which shows that the proposed parallel FDTD achieved speedup of 25.1 and parallel efficiency of 83.7% when 10 processors with 30 cores are utilized, and outperforms traditional parallel FDTD based on MPI or MPI-OpenMP, which gained speedup of 22.9, 24.9 and parallel efficiency of 76.3%, 83.1% respectively under the same circumstances.
Downloads
References
K. Yee, “Numerical Solution of Initial
Boundary Value Problems Involving
Maxwell's Equations in Isotropic Media,”
IEEE Transactions on Antennas and
Propagation, vol. 14, no.3, pp. 302-307,
A. Taflove, Computational Electromagnetics:
The Finite-Difference Time-Domain Method,
Artech House, Norwood, 2000.
C. Guiffaut and K. Mahdjoubi, “A Parallel
FDTD Algorithm using the MPI Library,”
IEEE Antennas and Propagation Magazine,
vol. 43, no. 2, pp. 94-103, 2001.
G. A. Schiavone, I. Codreanu, R. Palaniappan,
and P. Wahid, “FDTD Speedups Obtained in
Distributed Computing on a Linux
Workstation Cluster,” Antennas and
Propagation Society International
Symposium, vol. 3, pp. 1336-1339, 2000.
V. Varadarajan and R. Mittra,
“Finite-Difference Time-Domain (FDTD)
Analysis using Distributed Computing,”
IEEE Microwave and Guided Wave Letters,
vol. 4, no.5, pp. 144-145, 1994.
W. Yu, R. Mittra, T. Su, Y. Liu, and X. Yang,
Parallel Finite-Difference Time-Domain
Method, Artech House, 2006.
W. Yu, Y. Liu, T. Su, N.-T. Hunag, and R.
Mittra, “A Robust Parallel Conformal
Finite-Difference Time-Domain Processing
Package using the MPI Library,” IEEE
Antennas and Propagation Magazine, vol. 47,
no. 3, pp. 39-59, 2005.
W. Yu, M. R. Hashemi, R. Mittra, D. N. de
Araujo, M. Cases, N. Pham, E. Matoglu, P.
Patel, and B. Herrman, “Massively Parallel
Conformal FDTD on a BlueGene
Supercomputer,” IEEE Transactions on
Advanced Packaging, vol. 30, no. 2, pp.
-341, 2007.
DUAN, CHEN, HUANG, ZHOU: HIGH PERFORMANCE PARALLEL FDTD BASED ON WINSOCK AND MULTI-THREADING
M. Snir, S. Otto, S. Huss-Lederman, D.
Walker, and J. Dongarra, MPI: The Complete
Reference, The MIT Press, 1996.
W. Gropp, E. Lusk, and A. Skjellum, Using
MPI: Portable Parallel Programming with
the Message-Passing Interface, second
edition, The MIT Press, 1999.
D. Buntinas, G. Mercier, and W. Gropp,
"Implementation and Shared-Memory
Evaluation of MPICH2 over the Nemesis
Communication Subsystem," Proc. of the
th European PVM/MPI Users' Group
Meeting (Euro PVM/MPI 2006), September
D. Buntinas, G. Mercier, and W. Gropp,
“Design and Evaluation of Nemesis, A
Scalable Low-Latency Message-Passing
Communication Subsystem,” Proceedings of
International Symposium on Cluster
Computing and the Grid 2006 (CCGRID ’06),
S. Akhter and J. Roberts, Multi-Core
Programming: Increasing Performance
through Software Multi-threading, Intel Press,
F. Cappello and D. Etiemble, “MPI Versus
MPI+OpenMP on the IBM SP for the NAS
Benchmarks,” Supercomputing ACM/IEEE
Conference, 2000.
M. F. Su, I. El-Kady, D. A. Bader, and S.-Y.
Lin, “A Novel FDTD Application Featuring
OpenMP-MPI Hybrid Parallelization,”
Parallel Processing, 2004 International
Conference, pp. 373-379, 2004.
R. Rosenberg, G. Norton, J. C. Novarini, W.
Aderson, and M. Lanzagorta, “Modeling
Pulse Propagation and Scattering in a
Dispersive Medium: Performance of
MPI/OpenMP Hybrid Code,” SC 2006
Conference, Proceeding of the 2006
ACM/IEEE, pp. 47-47, 2006.
B. Chapman, G. Jost, and R. Van Der Pas,
Using OpenMP: Portable Shared Memory
Parallel Programming, The MIT Press, 2008.
R. Chandra, L. Dagum, D. Kohr, D. Maydan,
J. McDonald, and R. Menon, Parallel
Programming in OpenMP, Academic Press,
A. Rane and D. Stanzione, “Experiences in
Tuning Performance of Hybrid MPI/OpenMP
Applications on Quad-Core Systems,” Proc.
of 10th LCI Int’l Conference on
High-Performance Cl ustered Computing,
J. Berenger, “A Perfectly Matched Layer
Medium for the Absorption of
Electromagnetic Waves,” J. Comput., vol.
, 1994, pp. 185-200.
T. Rauber and G. Rünger, Parallel
Programming for Multicore and Cluster
Systems, Springer, 2010.
A. Jones and J. Ohlund, Network
Programming for Microsoft Windows,
Microsoft Press, 2002.
B. Quinn and D. Shute, Windows Sockets
Network Programming, Addison-Wesley
Professional, 2009.
J. Watts and S. Taylor, “A Practical Approach
to Dynamic Load Balancing,” IEEE
Transactions on Parallel and Distributed
Systems, vol. 9, no. 3, pp. 235-248, 1998


