GitHubNet: Understanding the Characteristics of GitHub Network

Authors

DOI:

https://doi.org/10.13052/jwe1540-9589.19561

Keywords:

GitHub, social coding, social network analysis, link analysis, graph database

Abstract

Web 2.0 technologies have not only raised microblogs, but also social software development and collaboration platforms. GitHub is the most popular software development platform that provides social collaboration. Within the scope of this study, a novel graph-based analysis model is proposed which targets to reveal (1) the characteristics of the GitHub in order to shed light on social software development in general, and (2) the most popular programming languages, repositories, and developers in order to shed light on the trending software development technologies. To this end, a subset of the GitHub network, which contains 84, 737 developers and 209, 100 repositories, was collected through the GitHub API and stored on a graph database namely neo4j to be later analyzed. The result of the analysis shows that (1) the connections in GitHub are not mutually linked, (2) JavaScript, Python, and Java are currently the most popular three programming languages, (3) You-Dont-Know-JS, oh-my-zsh, and public-apis are the most popular three repositories, and (4) TarrySingh (Tarry Singh), indrajithban-dara (Indrajith Bandara), and rootsongjc (Jimmy Song) are the most popular three developers. Furthermore, the proposed novel analysis model can be easily applied to other social networks.

Downloads

Download data is not yet available.

Author Biography

Abdullah Talha Kabakus, Department of Computer Engineering, Faculty of Engineering, Duzce University, Turkey

Abdullah Talha Kabakus received the bachelor’s degree in computer engineering from Cankaya University in 2010, the master’s degree in computer engineering from Gazi University in 2014, and the philosophy of doctorate degree in Electrical-Electronics & Computer Engineering from Duzce University in 2017, respectively. He is currently working as an Assistant Professor at the Department of Computer Engineering, Faculty of Engineering, Duzce University. His research areas include mobile security, deep learning, and social network analysis. He has been serving as a reviewer for many highly-respected journals.

References

Z. Luo, X. Mao, and A. Li, “An Exploratory Research of GitHub Based on Graph Model,” in Proceedings - 2015 9th International Conference on Frontier of Computer Science and Technology, FCST 2015, 2015, pp. 96–103.

H. Borges and M. Tulio Valente, “What’s in a GitHub Star? Understanding Repository Starring Practices in a Social Coding Platform,” J. Syst. Softw., vol. 146, pp. 112–129, 2018.

L. Dabbish, C. Stuart, J. Tsay, and J. Herbsleb, “Social Coding in GitHub: Transparency and Collaboration in an Open Software Repository,” in Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work (CSCW ’12), 2012, pp. 1277–1286.

“Pricing - Plans for every developer,” GitHub, 2019. [Online]. Available: https://github.com/pricing. [Accessed: 06-Dec-2019].

K. Blincoe, J. Sheoran, S. Goggins, E. Petakovic, and D. Damian, “Understanding the popular users: Following, affiliation influence and leadership on GitHub,” Inf. Softw. Technol., vol. 70, pp. 30–39, 2016.

J. Jiang, D. Lo, Y. Yang, J. Li, and L. Zhang, “A first look at unfollowing behavior on GitHub,” Inf. Softw. Technol., vol. 105, pp. 150–160, 2019.

“The State of the Octoverse,” GitHub, 2019. [Online]. Available: https://octoverse.github.com. [Accessed: 06-Dec-2019].

“GitHub API v3 | GitHub Developer Guide,” GitHub, 2019. [Online]. Available: https://developer.github.com/v3/. [Accessed: 06-Dec-2019].

Y. Hu, J. Zhang, X. Bai, S. Yu, and Z. Yang, “Influence analysis of Github repositories,” Springerplus, vol. 5, no. 1, pp. 1–19, 2016.

F. Thung, T. F. Bissyandé, D. Lo, and L. Jiang, “Network structure of social coding in GitHub,” in Proceedings of the 17th European Conference on Software Maintenance and Reengineering (CSMR 2013), 2013, pp. 323–326.

J. Jiang, L. Zhang, and L. Li, “Understanding Project Dissemination on a Social Coding Site,” in Proceedings of 20th Working Conference on Reverse Engineering (WCRE 2013), 2013, pp. 132–141.

B. Ray, D. Posnett, P. Devanbu, and V. Filkov, “A large-scale study of programming languages and code quality in GitHub,” Commun. ACM, vol. 60, no. 10, pp. 91–100, 2017.

D. Rusk and Y. Coady, “Location-based analysis of developers and technologies on GitHub,” in Proceedings of 2014 28th International Conference on Advanced Information Networking and Applications Workshops, 2014, pp. 681–685.

B. Vasilescu, A. Serebrenik, and V. Filkov, “A Data Set for Social Diversity Studies of GitHub Teams,” in Proceedings of 2015 IEEE/ACM 12th Working Conference on Mining Software Repositories, 2015, pp. 514–517.

Y. Yu, G. Yin, H. Wang, and T. Wang, “Exploring the patterns of social behavior in GitHub,” in Proceedings of the 1st International Workshop on Crowd-based Software Development Methods and Technologies (CrowdSoft 2014), 2014, pp. 31–36.

G. Bougie, J. Starke, M.-A. Storey, and D. M. German, “Towards understanding twitter use in software engineering: Preliminary findings, ongoing challenges and future questions,” in Proceedings of the 2nd International Workshop on Web 2.0 for Software Engineering (Web2SE ’11), 2011, pp. 31–36.

Y. Tian, P. Achananuparp, I. N. Lubis, D. Lo, and E. P. Lim, “What does software engineering community microblog about?,” in IEEE International Working Conference on Mining Software Repositories, 2012, pp. 247–250.

C. Treude, O. Barzilay, and M.-A. Storey, “How do programmers ask and answer questions on the web?: NIER track,” in Proceeding of the 33rd international conference on Software engineering (ICSE ’11), 2011, pp. 804–807.

D. Surian, D. Lo, and E. P. Lim, “Mining Collaboration Patterns from a Large Developer Network,” in Proceedings of 2010 17th Working Conference on Reverse Engineering, 2010, pp. 269–273.

S. Brin and L. Page, “The anatomy of a large scale hypertextual Web search engine,” Comput. Networks ISDN Syst., vol. 30, pp. 107–117, 1998.

E. Agirre and A. Soroa, “Personalizing PageRank for word sense disambiguation,” in Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), 2009, pp. 33–41.

R. Bana and A. Arora, “Influence Indexing of Developers, Repositories, Technologies and Programming languages on Social Coding Community GitHub,” in 2018 Eleventh International Conference on Contemporary Computing (IC3), 2018, pp. 1–6.

W. Leibzon, “Social network of software development at GitHub,” in Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2016), 2016, pp. 1374–1376.

“Stack Overflow Developer Survey 2019,” Stack Overflow, 2019. [Online]. Available: https://insights.stackoverflow.com/survey/2019. [Accessed: 06-Dec-2019].

Published

2020-09-22

How to Cite

Kabakus, A. T. (2020). GitHubNet: Understanding the Characteristics of GitHub Network. Journal of Web Engineering, 19(5-6), 557–574. https://doi.org/10.13052/jwe1540-9589.19561

Issue

Section

Articles