Data Profiling and Machine Learning to Identify Influencers from Social Media Platforms


  • Bahaa Eddine Elbaghazaoui Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco
  • Mohamed Amnai Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco
  • Youssef Fakhri Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco



Data profiling, machine learning, pagerank, influecer, social media


Because of the numerous applications domains in which social media networks can be used, the huge volume of data and information uploaded by them is gaining significant interest. Publishing allows consumers to express their thoughts on products and services. Some feedbacks could also influence other users on those things. Therefore, extracting and identifying influencers from social media networks, also profiling their product perceptions and preferences, is critical for marketers to use efficient viral marketing and recommendation strategies. Our major goal in this research is to find the best machine learning model for characterizing influencers on social media networks. However, to achieve this objective, our strategy revolves around applying the PageRank algorithm to profile influential nodes throughout the social media network graph. The results of our experiment showed that the correlation is always different when adding a new parameter to machine learning models, also to determine the suitable model for our needs. In any event, the experiment outcomes are critical and significant to profiling influencers from social media platforms.


Download data is not yet available.

Author Biographies

Bahaa Eddine Elbaghazaoui, Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco

Bahaa Eddine Elbaghazaoui started their course with a scientific baccalaureate option mathematical science. In 2013, after directly integrating the national school of applied sciences in Khouribga, he passed the preparatory classes that integrate into the school, then he hooked the computer engineering sector and obtained a diploma as a software engineer in 2019. Bahaa Eddine is currently a third-year doctoral student in 2022, he doing his research in the Laboratory of Computer Science in Kenitra, Morocco.

Mohamed Amnai, Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco

Mohamed Amnai received his bachelor’s degree in 2000, in IEEA (Computers, Electronics, Electrical and Automation) from Molay Ismail’s University, the Errachidia city. Then, the author obtained his master’s degree in 2007, from Ibn Tofail University, the Kenitra city. In 2011, he received his Ph.D. in Telecommunication and computer science, from Ibn Tofail University in Kenitra city, Morocco. Since March 2014, he has been an Assistant at the National School of Applied Sciences Khouribga, Settat University, Morocco. He joined the Faculty of Sciences of Kénitra, Department of Computer Science and Mathematics, Ibn Tofail University, Morocco, as an Associate Professor in 2018. The author is also an associate member of the Research Laboratory in Computer Science and Telecommunications (LaRIT), Team Networks and Telecommunications Faculty of Science, Kenitra, Morocco. He is also an associate member of laboratory IPOSI National School of Applied Sciences, Sultan Moulay Slimane University, Khouribga, Morocco.

Youssef Fakhri, Laboratory of Computer Sciences, Faculty of Sciences Kenitra, IbnTofail University, Morocco

Youssef Fakhri received his Bachelor’s Degree (B.S) in Electronic Physics in 2001 and his Master’s Degree (DESA) in Computer and Telecommunication from the Faculty of Sciences, University Mohammed V, Rabat, Morocco, in 2003, where he developed his Master’s Project at the ICI Company, Morocco. He received a Ph.D. in 2007 from the University Mohammed V – Agdal, Rabat, Morocco, in collaboration with the Polytechnic University of Catalonia (UPC), Spain. He joined the Faculty of Sciences of Kénitra, Department of Computer Science and Mathematics, Ibn Tofail University, Morocco, as an Associate Professor on Mars in 2009. He is the Laboratory head at LaRIT, Associate Researcher at the Laboratory for Research in Computing and Telecommunications (LaRIT) in the Faculty of Sciences of Rabat, and Member of Pole of Competences STIC Morocco.


A. Ahmad. “Social Network Sites and Its Popularity”. International Journal of Research and Reviews in Computer Science; Kohat Vol. 2, N. 2, (Apr 2011): 522–526.

Chen, A., Lu, Y., and Gupta, S. “Enhancing the Decision Quality through Learning from the Social Commerce Components”. Journal of Global Information Management (JGIM), 25(1), 66–91. 2017.

Vakeel, K. A. and Panigrahi, P. K. “Social Media Usage in E-Government: Mediating Role of Government Participation”. Journal of Global Information Management, 26(1), 1–19. 2018.

Grover, P., Kar, A. K., Dwivedi, Y. K., and Janssen, M. “Polarization and acculturation in US Election 2016 outcomesCan twitter analytics predict changes in voting preferences”. Technological Forecasting and Social Change. 2018.

Parsons, A. L., and Lepkowska-White, E. “Social Media Marketing Management: A Conceptual Framework”. Journal of Internet Commerce, 1–15. 2018.

E Oro, C Pizzuti, M Ruffolo. “A Methodology for Identifying Influencers and their Products Perception on Twitter”. National Research Council of Italy (CNR), Institute for High Performance Computing and Networking (ICAR), Via Pietro Bucci, 4–11C, 87036 Rende (CS), Italy. 2018.

D. M. Romero, W. Galuba, S. Asur, and B. A. Huberman, “Influence and passivity in social media”. Machine learning and knowledge discovery in databases. Springer, 2011, p. 1833.

M. Giannakis, R. Dubey, S. Yan, K. Spanaki and T. Papadopoulos, “Social media and sensemaking patterns in new product development: demystifying the customer sentiment”. Annals of Operations Research,, (2020).

Boerman, S. “The effects of the standardized Instagram disclosure for micro- and meso-influencers”. Computers in Human Behavior, 103, 199207. 2020.

De Veirman, M., Cauberghe, V., and Hudders, L. “Marketing through Instagram influencers: The impact of number of followers and product divergence on brand attitude”. International Journal of Advertising, 36(5), 798828. 2017.

A Goswami, A Kumar. “A survey of event detection techniques in online social networks”. Social Network Analysis and Mining, Springer 2016.

Diaz, F., Gamon, M., Hofman, J. M., Kiciman, E., and Rothschild, D. “Online and social media data as an imperfect continuous panel survey”. PLoS One, 11(1), 2016.

B Manaskasemsak, N Dejkajonwuth, A Rungsawang. “Community Centrality-Based Greedy Approach for Identifying Top-K Influencers in Social Networks”. ICCASA, 2015 – Springer.

D. Kempe, J. M. Kleinberg, and E. Tardos, “Maximizing the spread of influence through a social network”. Theory of Computing, vol. 11, p. 105147, 2015.

R. Zafarani, M. A. Abbasi, and H. Liu. “Social media mining: an introduction”. Cambridge University Press, 2014.

L Page, S Brin, R Motwani, and T Winograd. “The PageRank citation ranking: Bringing order to the web”. Technical Report. Stanford InfoLab. 1999.

R Makhija, S Ali, RJ Krishna. “Detecting Influencers in Social Networks Through Machine Learning Techniques”. International Conference on Advanced Machine Learning Technologies and Applications. Springer, Singapore. 2020.

P Harrigan, TM Daly, K Coussement, JA Lee, G. N. Soutar, and U. Ever. “Identifying influencers on social media”. International Journal of Information Management. ELSEVIER. 2021.

T Huynh, H Nguyen, I Zelinka, D Dinh, XH Pham. “Detecting the Influencer on Social Networks Using Passion Point and Measures of Information Propagation”. Sustainability, 2020.

D. Easley and J. Kleinberg. “Networks, Crowds, and Markets: Reasoning About a Highly Connected World”. Cambridge University Press, 2010.

D. J. Cook and L. B. Holder. “Mining Graph Data”. John Wiley & Sons, 2006.

J. V. Cossu, V. Labatut, and N. Dugue. “A review of features for the discrimination of twitter users: application to the prediction of offline influence”. Social Network Analysis and Mining, 2016 – Springer.

Z. Zengin Alp and S. Gunduz Oguducu. “Identifying topical influencers on twitter based on user behavior and network topology”. Knowledge-Based Systems, 2018 – Elsevier.

A. Pal and S. Counts. “Identifying topical authorities in microblogs”. Proc. fourth ACM Int. Conf. Web search data Min. 2011.

M. Cataldi and M. A. Aufaure. “The 10 million follower fallacy: audience size does not prove domain-influence on Twitter”. Knowledge and Information Systems. 2015 – Springer.

H. Kwak, C. Lee, H. Park, and S. Moon. “What is Twitter, a social network or a news media?”. Proceedings of the 19th International Conference on World Wide Web. 2010.

M. Cha, H. Haddai, F. Benevenuto, and K. P. Gummadi. “Measuring User Influence in Twitter: The Million Follower Fallacy”. in International AAAI Conference on Weblogs and Social Media. 2010.

Bahaa Eddine Elbaghazaoui, Amnai Mohamed, and Abdellatif Semmouri. “Data Profiling over Big Data Area: A Survey of Big Data Profiling: State-of-the-Art, Use Cases and Challenges”. In book: Intelligent Systems in Big Data, Semantic Web and Machine Learning. Springer. 2021.

Bahaa Eddine Elbaghazaoui, Amnai Mohamed & Youssef Fakhri. “Optimized influencers profiling from social media based on Machine Learning”. Proceedings of ICI2C’21, Book: Advances in Information, Communication and Cybersecurity. Series: Lecture Notes in Networks and Systems 2367–3370. Springer. 2022.






Intelligent Systems for Smart Applications