GPU Based TLM Algorithms in CUDA and OpenCL
关键词:
GPU Based TLM Algorithms in CUDA and OpenCL摘要
Recent advancements in graphics computing technology has brought highly parallel processing power to desktop computers. As multi-core multi-processor computing technology becomes mature, a new front in parallel computing technology based on graphics processing units has emerged. This paper reports a highly parallel symmetrical condensed node TLM procedure for the NVIDIA graphics processing units. The algorithm has been tested on three NVIDIA processors, from low-end laptop graphics card to highend workstation graphics processors.
##plugins.generic.usageStats.downloads##
参考
M. Macedonia, "The GPU Enters Computing's
Mainstream", IEEE Computer , vol. 36, no. 10, pp.
–108, October 2003.
G. Shen, G. P. Gao, S. Li, H. Y. Shum and
Y. Q. Zhang, "Accelerating Video Decoding Using
GPU", IEEE Transactions on Circuits and Systems
for Video Technology , vol. 15, no. 5, pp. 685–693,
May 2005.
J. Y. Hong and M. D. Wang, "High speed
processing of biomedical images using
programmable GPU", International Conference on
Image Processing, vol. 4, pp. 2455–2458, October
Y. Heng and L. Gu, "GPU-based Volume
Rendering for Medical Image Visualization", 27th
Annual International Conference on Engineering in
Medicine and Biology, pp. 5145–5148, 2005.
O. Fialka and M. Cadik, "FFT and Convolution
Performance in Image Filtering on GPU", IEEE
Proceedings of the Information Visualization ,
pp. 609–614, July 2006.
J. S. Meredith, S. R. Alam and J. S. Vetter,
"Analysis of a Computational Biology Simulation
Technique on Emerging Processing Architectures",
IEEE International Symposium on Parallel and
Distributed Processing, pp. 1–8, March 2007.
S. E. Krakiwsky, L. E. Turner and M.
M. Okoniewski, "Graphics Processor Unit
Acceleration of Finite-Difference Time-Domain
Algorithm", Proceedings of IEEE International
Symposium on Circuits and Systems , vol. 5,
pp. V265 – V268, May 2004.
M. J. Inman, and A. Z. Elsherbeni, “Programming
video cards for computational electromagnetics
applications”, IEEE Antennas and Propagation
Magazine, vol. 47, no. 6, pp. 71–78, December
H. Takizawa, N. Yamada, S. Sakai, and
H. Kobayashi, "Radiative Heat Transfer Simulation
Using Programmable Graphics Hardware", 5th
IEEE/ACIS International Conference on Computer
and Information Science, pp. 29–37, July 2006.
Z. Luo, H. Liu, and X. Wu, "Artificial Neural
Network Computation on Graphic Process Unit",
Proceedings of IEEE International Joint
Conference on Neural Networks , vol. 1, pp. 622–
, August 2005.
S. Harding, W. Banzhaf, "Fast Genetic
Programming and Artificial Developmental
Systems on GPUs", 21st International Symposium
on High Performance Computing Systems and
Applications, p. 2, May 2007.
F. Zhe, Q. Feng, A. Kaufman and S. Yoakum-
Stover, "GPU Cluster for High Performance
Computing", Proceedings of the ACM/IEEE
Conference on Supercomputing, pp. 47, 2004.
ACES JOURNAL, VOL. 25, NO. 4, APRIL 2010
folding.stanford.edu/FAQ-ATI.html
F. V. Rossi, “Massively Parallel Two-Dimensional
TLM Algorithm on Graphics Processing Units,”
IEEE International Microwave Symposium, June
F. Rossi and P. P. M. So, “Parallelized three-
dimensional TLM algorithms on a graphics
processing unit”, 25th International Review of
Progress in Applied Computational
Electromagnetics Symposium, pp. 110–114, March
W. J. R. Hoefer, “The Transmission-Line Matrix
Method – Theory and Applications”, IEEE
Transactions on Microwave Theory and
Techniques, vol. MTT-33. No. 10, pp.882-893,
October 1995.
P. B. Johns, “A symmetrical condensed node for the
TLM method,” IEEE Transactions on Microwave
Theory and Technique, vol-35, no. 4, pp. 370–377,
April 1987.
http://www.nvidia.com/object/cuda_home_new.htm
l, April 2010.
http://www.khronos.org/opencl/
ClusterInABox Quad (Q30) Product Info,
http://www.acceleware.com/default/index.cfm/our-
products/clusterinabox-quad, November 2008.
V. Trenkic, C. Christopoulos, and T. M. Benson,
“Development of a general symmetrical condensed
node for the TLM method”, IEEE Trans. on
Microwave Theory and Techniques, vol. MTT-44,
no. 12, pp. 2129–2135, December 1996.
V. Trenkic, C. Christopoulos, and T. M. Benson,
“Advanced node formulations in TLM — the
adaptable symmetrical condensed node”, IEEE
Trans. on Microwave Theory and Techniques, vol.
MTT-44, no. 12, pp. 2473–2478, December 1996.