GPU Based TLM Algorithms in CUDA and OpenCL

Filippo  Rossi; Colter  McQuay; Poman  So

Authors

Filippo Rossi Computational Electromagnetics Research Laboratory Department of Electrical and Computer Engineering University of Victoria, Victoria, BC, V8W 3P6, Canada
Colter McQuay Computational Electromagnetics Research Laboratory Department of Electrical and Computer Engineering University of Victoria, Victoria, BC, V8W 3P6, Canada
Poman So Computational Electromagnetics Research Laboratory Department of Electrical and Computer Engineering University of Victoria, Victoria, BC, V8W 3P6, Canada

Keywords:

GPU Based TLM Algorithms in CUDA and OpenCL

Abstract

Recent advancements in graphics computing technology has brought highly parallel processing power to desktop computers. As multi-core multi-processor computing technology becomes mature, a new front in parallel computing technology based on graphics processing units has emerged. This paper reports a highly parallel symmetrical condensed node TLM procedure for the NVIDIA graphics processing units. The algorithm has been tested on three NVIDIA processors, from low-end laptop graphics card to highend workstation graphics processors.

Downloads

Download data is not yet available.

References

M. Macedonia, "The GPU Enters Computing's

Mainstream", IEEE Computer , vol. 36, no. 10, pp.

–108, October 2003.

G. Shen, G. P. Gao, S. Li, H. Y. Shum and

Y. Q. Zhang, "Accelerating Video Decoding Using

GPU", IEEE Transactions on Circuits and Systems

for Video Technology , vol. 15, no. 5, pp. 685–693,

May 2005.

J. Y. Hong and M. D. Wang, "High speed

processing of biomedical images using

programmable GPU", International Conference on

Image Processing, vol. 4, pp. 2455–2458, October

Y. Heng and L. Gu, "GPU-based Volume

Rendering for Medical Image Visualization", 27th

Annual International Conference on Engineering in

Medicine and Biology, pp. 5145–5148, 2005.

O. Fialka and M. Cadik, "FFT and Convolution

Performance in Image Filtering on GPU", IEEE

Proceedings of the Information Visualization ,

pp. 609–614, July 2006.

J. S. Meredith, S. R. Alam and J. S. Vetter,

"Analysis of a Computational Biology Simulation

Technique on Emerging Processing Architectures",

IEEE International Symposium on Parallel and

Distributed Processing, pp. 1–8, March 2007.

S. E. Krakiwsky, L. E. Turner and M.

M. Okoniewski, "Graphics Processor Unit

Acceleration of Finite-Difference Time-Domain

Algorithm", Proceedings of IEEE International

Symposium on Circuits and Systems , vol. 5,

pp. V265 – V268, May 2004.

M. J. Inman, and A. Z. Elsherbeni, “Programming

video cards for computational electromagnetics

applications”, IEEE Antennas and Propagation

Magazine, vol. 47, no. 6, pp. 71–78, December

H. Takizawa, N. Yamada, S. Sakai, and

H. Kobayashi, "Radiative Heat Transfer Simulation

Using Programmable Graphics Hardware", 5th

IEEE/ACIS International Conference on Computer

and Information Science, pp. 29–37, July 2006.

Z. Luo, H. Liu, and X. Wu, "Artificial Neural

Network Computation on Graphic Process Unit",

Proceedings of IEEE International Joint

Conference on Neural Networks , vol. 1, pp. 622–

, August 2005.

S. Harding, W. Banzhaf, "Fast Genetic

Programming and Artificial Developmental

Systems on GPUs", 21st International Symposium

on High Performance Computing Systems and

Applications, p. 2, May 2007.

F. Zhe, Q. Feng, A. Kaufman and S. Yoakum-

Stover, "GPU Cluster for High Performance

Computing", Proceedings of the ACM/IEEE

Conference on Supercomputing, pp. 47, 2004.

ACES JOURNAL, VOL. 25, NO. 4, APRIL 2010

folding.stanford.edu/FAQ-ATI.html

F. V. Rossi, “Massively Parallel Two-Dimensional

TLM Algorithm on Graphics Processing Units,”

IEEE International Microwave Symposium, June

F. Rossi and P. P. M. So, “Parallelized three-

dimensional TLM algorithms on a graphics

processing unit”, 25th International Review of

Progress in Applied Computational

Electromagnetics Symposium, pp. 110–114, March

W. J. R. Hoefer, “The Transmission-Line Matrix

Method – Theory and Applications”, IEEE

Transactions on Microwave Theory and

Techniques, vol. MTT-33. No. 10, pp.882-893,

October 1995.

P. B. Johns, “A symmetrical condensed node for the

TLM method,” IEEE Transactions on Microwave

Theory and Technique, vol-35, no. 4, pp. 370–377,

April 1987.

http://www.nvidia.com/object/cuda_home_new.htm

l, April 2010.

http://www.khronos.org/opencl/

http://www.nvidia.com

ClusterInABox Quad (Q30) Product Info,

http://www.acceleware.com/default/index.cfm/our-

products/clusterinabox-quad, November 2008.

http://OpenMP.org/wp/

V. Trenkic, C. Christopoulos, and T. M. Benson,

“Development of a general symmetrical condensed

node for the TLM method”, IEEE Trans. on

Microwave Theory and Techniques, vol. MTT-44,

no. 12, pp. 2129–2135, December 1996.

V. Trenkic, C. Christopoulos, and T. M. Benson,

“Advanced node formulations in TLM — the

adaptable symmetrical condensed node”, IEEE

Trans. on Microwave Theory and Techniques, vol.

MTT-44, no. 12, pp. 2473–2478, December 1996.

GPU Based TLM Algorithms in CUDA and OpenCL

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

ImpactScore

issn

coverimage

CallForPaper

index

archivesite

Language

Information