An Analytical Framework for Employee Promotion Clustering
DOI:
https://doi.org/10.13052/jmm1550-4646.2167Keywords:
Employee Promotion, Clustering Performance, Feature Engineering, feature augmentation, feature extraction, principal component analysisAbstract
Choosing which employees to promote is a complex task that demands both fairness and effectiveness. Machine learning has significantly improved promotion decisions by providing data-driven insights and automation. Clustering-based promotion models are popular, but their performance is often hindered by deficiencies in the input data, which are typically noisy, class imbalanced, and high dimensional. These problems can be alleviated through data preprocessing. Accordingly, this study introduces an analytical framework for employee promotion clustering that incorporates feature engineering – both feature extraction and feature augmentation – to enhance clustering performance and generalizability. The principal contribution is the development of the generative performance feature (GPF), an augmented representation that amplifies the influence of performance-oriented features extracted via principal component analysis (PCA). The GPF captures the intrinsic structure of the original dataset and is formulated as an additive composite feature. The PCA-transformed dataset combined with the GPF is then used to construct a clustering model. The proposed framework was evaluated on two public datasets with both K-means and fuzzy C-means (FCM) clustering models. The GPF led to significant improvements across key evaluation metrics, namely, the Rand index, the mutual information score, the V-measure, and the Fowlkes–Mallows index. K-means clustering demonstrated superior performance to FCM clustering across all evaluation metrics for both datasets.
Downloads
References
Yildirmaz, A., C. Ryan, and J. Nezaj. 2019. 2019 State of the Workforce Report: Pay, Promotions and Retention. p. B4–B7.
Jay, S. 2023. What is a (good) promotion rate and how to calculate it. Retrieved from https://www.aihr.com/blog/promotion-rate/#what
Yiğit, F. 2023. A three-stage fuzzy neutrosophic decision support system for human resources decisions in organizations. Decis. Anal. J. 7: 100259. doi: 10.1016/j.dajour.2023.100259.
Ligare, B. S., K. W. Wanyama, and V. L. Aliata. 2020. Job promotion and employee performance among the Administration Police in Bungoma County, Kenya. Cross Curr. Int. J. Econ. Manag. Media Stud. 2: 34–41.
Noori, R. 2023. The state of promotions at work: How companies can fuel employee growth in 2023. Retrieved from https://nectarhr.com/blog/workplace-promotion-statistics.
Liu, J., T. Wang, J. Li, J. Huang, F. Yao, and R. He. 2019. A data-driven analysis of employee promotion: The role of the position of organization. 2019 IEEE International Conference on Systems, Man and Cybernetics (SMC). p. 4056–4062.
Alqahtani, F. A., and A. Almaleh. 2022. Analysis and prediction of employee promotions using machine learning. 2022 5th International Conference on Data Science and Information Technology (DSIT). p. 1–9.
Long, Y., J. Liu, M. Fang, T. Wang, and W. Jiang. 2018. Prediction of employee promotion based on personal basic features and post features. International Conference on Data Processing and Applications. p. 5–10.
Pahmi, S., S. Saepudin, N. Maesarah, U. I. Solehudin, and Wulandari. 2018. Implementation of CART (classification and regression trees) algorithm for determining factors affecting employee performance. 2018 International Conference on Computing, Engineering, and Design (ICCED). p. 57–62.
Priyanka, R., K. Ravindran, B. Sankaranarayanan, and S. M. Ali. 2023. A fuzzy DEMATEL decision modelling framework for identifying key human resources challenges in start-up companies: Implications for sustainable development. Decis. Anal. J. 6: 100192. doi: 10.1016/j.dajour.2023.100192.
Tang, A., T. Lu, Z. S. Lynch, O. Schaer, and S. Adams. 2020. Enhancing promotion decisions using classification and network-based methods. 2020 Systems and Information Engineering Design Symposium (SIEDS). p. 1–6.
Kaewwiset, T., and P. Temdee. 2022. Promotion classification using decision tree and principal component analysis. 2022 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT & NCON). p. 489–492.
Ilwani, M., G. Nassreddine, and J. A. Younis. 2023. Machine learning application on employee promotion. Mesop. J. Comput. Sci. 2023: 100–114.
Sahinbas, K. 2022. Employee promotion prediction by using machine learning algorithms for imbalanced dataset. 2022 2nd International Conference on Computing and Machine Intelligence (ICMI). p. 1–5.
Dang, Q., M. Truong, and M. Huynh. 2021. Studying the fuzzy clustering methods to understand employee performance. 2021 4th International Conference on Artificial Intelligence and Big Data (ICAIBD). p. 541–544.
Mousavian, S.A., A. Haeri, and F. Moslehi. 2021. Providing a hybrid clustering method as an auxiliary system in automatic labeling to divide employee into different levels of productivity and their retention. Iran. J. Manag. Stud. 15: 207–226.
Todeschini, B. V., C. M. Rodrigues, M. J. Anzanello, and G. L. Tortorella. 2016. Clustering tool usage to align a company strategy to its talent management needs. Eur. J. Appl. Bus. Manag. 2.
Sun, Q., T. Wu, and J. Hua. 2022. Design of distributed human resource management system of Spark framework based on fuzzy clustering. J. Sens. 2022: 1–9.
Liu, Y., C. Xie, X. Chen, Q. Zhu, and Y. Sun. 2023. Intelligent collection system of human resource information based on clustering algorithm. International Conference on Signal Processing and Communication Technology (SPCT 2022). p. 126151Z.
Sun, H., and Q. Li. 2019. Research on application of PCA and K-means clustering in enterprise human resources. Proceeding of the 24th International Conference on Industrial Engineering and Engineering Management 2018. p. 695–705.
Zhao, Y. 2020. Application of K-means clustering algorithm in human resource data informatization. Proceedings of the 2020 International Conference on Cyberspace Innovation of Advanced Technologies. p. 12–16.
Sarker, A., S. M. Shamim, M. S. Zama, and M. A. Rahman. 2018. Employee’s performance analysis and prediction using K-means clustering & decision tree algorithm. Glob. J. Comput. Sci. Technol. 18: 1–5.
Thakur, D., A. Guzzo, and G. Fortino. 2021. t-SNE and PCA in ensemble learning based human activity recognition with smartwatch. 2021 IEEE 2nd International Conference on Human-Machine Systems (ICHMS). p. 1–6.
Petkov, P. N., H. Helgason, and W. Kleijn. 2012. Feature set augmentation for enhancing the performance of a non-intrusive quality predictor. 2012 Fourth International Workshop on Quality of Multimedia Experience. p. 121–126.
Duan, Y., X. Niu, and G. Nie. 2018. Data augmentation based on interest points of feature. International Conference on Digital Image Processing. p. 108060B.
Bagdadli, S., Q. M. Roberson, and F. Paoletti. 2006. The mediating role of procedural justice in responses to promotion decisions. J. Bus. Psychol. 21: 83–102.
Gathungu, E., M. Iravo, and S. Namusonge. 2015. Effect of promotion strategies on the organizational commitment of banking sector employees in Kenya. IOSR J. Humanit. Soc. Sci. 20: 36–45. doi: 10.9790/0837-201013645.
Klaauw, B. V., and A. Dias da Silva. 2006. Wage dynamics and promotions inside and between firms. J. Popul. Econ. 24: 1513–1548.
Huang, Y. B. 2009. Study of college human resources data mining based on the SOM algorithm. 2009 Asia-Pacific Conference on Information Processing. p. 324–327.
Isaac, R. G., W. J. Zerbe, and D. C. Pitt. 2001. Leadership and motivation: The effective application of expectancy theory. J. Manag. Issues 13: 212.
Silva, I. E., and R. A. Krohling. 2018. A fuzzy sociometric approach to human resource allocation. 2018 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). p. 1–8.
Liu, Y. 2021. Analysis of human resource management mode and its selection factors based on clustering algorithm. 2021 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA). p. 535–538.
Ouyang, J., and H. Ge. 2020. Data analysis framework of human resource estimation system based on MySQL-SAAS and fuzzy clustering. 2020 International Conference on Smart Electronics and Communication (ICOSEC). p. 417–420.
Wang, Z. 2021. Research on digital economy and human resources based on fuzzy clustering and edge computing. Secur. Commun. Netw. 2021: 5583967.
Peng, F., M. Guo, C. Zheng, S. Wang, X. Wang, and M. Xu. 2023. An assessment model of digital literacy for the students in vocational education based on principal component analysis in machine learning. 2023 IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). p. 1382–1386.
Guo, Z., and Y. Zhang. 2010. The third-party logistics performance evaluation based on the AHP-PCA model. 2010 International Conference on E-Product E-Service and E-Entertainment. p. 1–4.
Kumar, S. 2020. HR analysis case study. Retrieved October 1, 2022, from https://www.kaggle.com/shivan118/hranalysis.
Sulaiman, M. 2019. Data science staff promotion prediction. Retrieved October 1, 2022, from https://www.kaggle.com/datasets/behordeun/data-science-staff-promotion-prediction
Liu, L., Q. Wang, M. Dong, Z. Zhang, Y. Li, Z. Wang, and S. Wang. 2020. Application of K-means ++ algorithm based on t-SNE dimension reduction in transformer district clustering. 2020 Asia Energy and Electrical Engineering Symposium (AEEES). p. 74–78. doi: 10.1109/aeees48850.2020.9121438.
Pal, K., and M. Sharma. 2020. Performance evaluation of non-linear techniques UMAP and t-SNE for data in higher dimensional topological space. 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC). p. 1106–1110. doi: 10.1109/I-SMAC49090.2020.9243502.



