SPARSE CANONICAL CORRELATION ANALYSIS FOR MOBILE MEDIA RECOGNITION ON THE CLOUD
Keywords:
Canonical correlation analysis (CCA), sparse canonical correlation analysis (SCCA), power method, mobile mediaAbstract
With the rapid development of the Internet technology and smartphone, people can easily capture and upload media information including text, audio, photos, and video. And then it becomes one critical demand to effectively and efficiently manage these personal multimedia that are often presented in multiple modalities. Canonical correlation analysis (CCA) has been widely employed for multi-modal data in many applications because of its promising performance in feature extraction and subspace learning for multivariate vectors. However, the traditional CCA may be difficult to interpret especially when the original variables are expected to involve only a few components. In this paper, we develop a mobile media recognition method on the cloud. Particularly, we propose sparse canonical correlation analysis (SCCA) on the cloud. SCCA can find a reasonable trade-off between statistical fidelity and interpretability. Furthermore, we employ a generalised power method to optimise the SCCA algorithm. Finally, we conduct extensive experiments for recognition on several popular databases including UCI datasets and USAA dataset. Experimental results demonstrate that the proposed SCCA algorithm outperforms the traditional CCA algorithm.
Downloads
References
Akaho, S. (2007), A kernel method for canonical correlation analysis, arXiv preprint
cs/0609071.
d'Aspremont, A., Bach, F., Ghaoui, L.E. (2008), Optimal solutions for sparse principal
component analysis, Journal of Machine Learning Research, vol. 9, pp. 1269-1294.
d'Aspremont, A., El Ghaoui, L., Jordan, M.I., Lanckriet, G.R. (2007), A direct formulation for
sparse pca using semidefinite programming, SIAM review 49(3), 434-448.
Frey, P.W., Slate, D.J. (1991), Letter recognition using holland-style adaptive classifiers,
Machine learning, 6(2), 161-182.
Fyfe, C., Lai, P.L. (2000), Ica using kernel canonical correlation analysis, In: In Proc. Int.
Workshop on Independent Component Analysis and Blind Signal Separation (ICA2000).
Hardoon, D.R., Shawe-Taylor, J. (2011), Sparse canonical correlation analysis, Machine
Learning, 83(3), 331-353.
Hotelling, H. (1936), Relations between two sets of variates, Biometrika, vol. 28, pp. 321-377.
Jiang, Y.G., Ye, G., Chang, S.F., Ellis, D., Loui, A.C.(2011), Consumer video understanding:
A benchmark database and an evaluation of human and machine performance, In: Proceedings of the 1st
ACM International Conference on Multimedia Retrieval, p. 29. ACM.
Jolliffe, I. (2002), Principal component analysis, Wiley Online Library.
Journée, M., Nesterov, Y., Richtárik, P., Sepulchre, R. (2010), Generalized power method for
sparse principal component analysis, The Journal of Machine Learning Research, vol. 11, pp. 517-553.
Liu, W., Liu, H., Tao, D., Wang, Y., Lu, K. (2015), Multiview hessian regularized logistic
regression for action recognition, Signal Processing, vol. 110, pp. 101-107.
Liu, W., Tao, D., Cheng, J., Tang, Y. (2014), Multiview hessian discriminative sparse coding
for image annotation, Computer Vision and Image Understanding, vol. 118, pp. 50-60.
Liu, W., Zhang, H., Tao, D., Wang, Y., Lu, K. (2016), Large-scale paralleled sparse principal
component analysis, Multimedia Tools and Applications, 75(3), 1481-1493.
Luo, Y., Tao, D., Geng, B., Xu, C., Maybank, S.J. (2013), Manifold regularized multitask
learning for semi-supervised multilabel image classification, IEEE Transactions on Image Processing, 22(2),
-536.
Luo, Y., Tao, D., Ramamohanarao, K., Xu, C., Wen, Y. (2015), Tensor canonical correlation
analysis for multi-view dimension reduction, IEEE transactions on Knowledge and Data Engineering, 27(11),
-3124.
Luo, Y., Wen, Y., Tao, D., Gui, J., Xu, C. (2016), Large margin multi-modal multi-task
feature extraction for image classification, IEEE Transactions on Image Processing, 25(1), 414-427.
Melzer, T., Reiter, M., Bischof, H. (2003), Appearance models based on kernel canonical
correlation analysis, Pattern recognition, 36(9), 1961-1971.
Murtagh, F., Heck, A. (1987), Multivariate data analysis, Astrophysics and Space Science
Library, vol. 131.
Tao, D., Jin, L., Liu, W., Li, X. (2013), Hessian regularized support vector machines for
mobile image annotation on the cloud, IEEE Transactions on Multimedia, 15(4), 833-844.
Tao, D., Li, X., Wu, X., Maybank, S.J. (2007), General tensor discriminant analysis and
gabor features for gait recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(10),
-1715.
Tao, D., Li, X., Wu, X., Maybank, S.J. (2009), Geometric mean for subspace selection, IEEE
Transactions on Pattern Analysis and Machine Intelligence, 31(2), 260-274.
Tao, D., Lin, X., Jin, L., Li, X. (2015), Principal component 2-d long short-term memory for
font recognition on single chinese characters, IEEE Transactions on Cybernetics, 46(3), 756-765.
Tao, D., Tang, X., Li, X., Wu, X. (2006), Asymmetric bagging and random subspace for
support vector machines-based relevance feedback in image retrieval, IEEE Transactions on Pattern Analysis
and Machine Intelligence, 28(7), 1088-1099.
Wang, M., Li, W., Liu, D., Ni, B., Shen, J., Yan, S. (2015), Facilitating image search with a
scalable and compact semantic mapping, IEEE Transactions on Cybernetics, 45(8), 1561-1574.
Wang, M., Ni, B., Hua, X.S., Chua, T.S. (2012), Assistive tagging: A survey of multimedia
tagging with human-computer joint exploration, ACM Computing Surveys (CSUR), 44(4), 25.
Yu, J., Wang, M., Tao, D. (2012), Semisupervised multiview distance metric learning for
cartoon synthesis, IEEE Transactions on Image Processing, 21(11), 4636-4648.
Zhang, H., Liu, W., Zha, Z.J.(2015), Sparse canonical correlation analysis for recognition, In:
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, p. 17. ACM.
Zheng, H., Wang, M., Li, Z. (2010), Audio-visual speaker identification with multi-view
distance metric learning, In: Image Processing (ICIP), 2010 17th IEEE International Conference on, pp.
-4564.
Zou, H., Hastie, T., Tibshirani, R. (2006), Sparse principal component analysis, Journal of
computational and graphical statistics, 15(2), 265-286.