AN EMPIRICAL COMPARISON OF NEO4J AND TIGERGRAPH DATABASES FOR NETWORK CENTRALITY

Authors

  • Bahzad Chicho Duhok Polytechnic University, Technical College of Informatics, Information Technology Department, Kurdistan Region, Iraq - bahzad.taha@dpu.edu.krd
  • Abdulhakeem Othman Mohammed Duhok Polytechnic University, Kurdistan Region, Iraq

DOI:

https://doi.org/10.25271/sjuoz.2023.11.2.1068

Keywords:

Graph Database, Relational Database, Database Model, Neo4j, TigerGraph

Abstract

Graph databases have recently gained a lot of attention in areas where the relationships between data and the data itself are equally important, like the semantic web, social networks, and biological networks. A graph database is simply a database designed to store, query, and modify graphs. Recently, several graph database models have been developed. The goal of this research is to evaluate the performance of the two most popular graph databases, Neo4j and TigerGraph, for network centrality metrics including degree centrality, betweenness centrality, closeness centrality, eigenvector centrality, and PageRank. We applied those metrics to a set of real-world networks in both graph databases to see their performance. Experimental results show Neo4j outperforms TigerGraph for computing the centrality metrics used in this study, but TigerGraph performs better during the data loading phase.

References

D. Fernandes and J. Bernardino, “Graph Databases Comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB.,” in Data, 2018, pp. 373–380.

F. Chen, Y.-C. Wang, B. Wang, and C.-C. J. Kuo, “Graph representation learning: a survey,” APSIPA Trans. Signal Inf. Process., vol. 9, 2020.

N. Shadbolt, T. Berners-Lee, and W. Hall, “The semantic web revisited,” IEEE Intell. Syst., vol. 21, no. 3, pp. 96–101, 2006.

X.-M. Xu, J. Zhan, and H. Zhu, “Using social networks to organize researcher community,” in International Conference on Intelligence and Security Informatics, 2008, pp. 421–427.

R. Soussi, M.-A. Aufaure, and H. Baazaoui, “Graph database for collaborative communities,” in Community-Built Databases, Springer, 2011, pp. 205–234.

R. Kumar Kaliyar, “Graph databases: A survey,” in International Conference on Computing, Communication & Automation, 2015, pp. 785–790.

B. Ristevski, “Using Graph Databases for Querying and Network Analysing,” 2019.

A. G. Baset, “Graphical Database Architecture For Clinical Trials,” 2015.

R. Angles and C. Gutierrez, “An introduction to graph data management,” in Graph Data Management, Springer, 2018, pp. 1–32.

H. Lu, Z. Hong, and M. Shi, “Analysis of film data based on Neo4j,” in 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS), 2017, pp. 675–677.

S. Almabdy, “Comparative analysis of relational and graph databases for social networks,” in 2018 1st International Conference on Computer Applications & Information Security (ICCAIS), 2018, pp. 1–4.

A. Lysenko, I. A. Roznovăţ, M. Saqi, A. Mazein, C. J. Rawlings, and C. Auffray, “Representing and querying disease networks using graph databases,” BioData Min., vol. 9, no. 1, pp. 1–19, 2016.

M. Šestak, M. Heričko, T. W. Družovec, and M. Turkanović, “Applying k-vertex cardinality constraints on a Neo4j graph database,” Future Gener. Comput. Syst., vol. 115, pp. 459–474, 2021.

M. Macak, M. Stovcik, B. Buhnova, and M. Merjavy, “How well a multi-model database performs against its single-model variants: Benchmarking OrientDB with Neo4j and MongoDB,” in 2020 15th Conference on Computer Science and Information Systems (FedCSIS), 2020, pp. 463–470.

M. Desai, R. G. Mehta, and D. P. Rana, “An empirical analysis to identify the effect of indexing on influence detection using graph databases,” Int J Innov Technol Explor. Eng, vol. 8, pp. 414–421, 2019.

S. Beis, S. Papadopoulos, and Y. Kompatsiaris, “Benchmarking graph databases on the problem of community detection,” in New Trends in Database and Information Systems II, Springer, 2015, pp. 3–14.

R. Wang, Z. Yang, W. Zhang, and X. Lin, “An empirical study on recent graph database systems,” in International Conference on Knowledge Science, Engineering and Management, 2020, pp. 328–340.

C. Liu and H. Duan, “A Graph Database Storage Engine for Provenance Graphs,” DBKDA 2020, p. 8.

T. Chen, C. Yuan, G. Liu, and R. Dai, “Graph based platform for electricity market study, education and training,” in 2018 IEEE Power & Energy Society General Meeting (PESGM), 2018, pp. 1–5.

F. Rusu and Z. Huang, “In-depth benchmarking of graph database systems with the Linked Data Benchmark Council (LDBC) Social Network Benchmark (SNB),” ArXiv Prepr. ArXiv190707405, 2019.

A. Deutsch, Y. Xu, M. Wu, and V. E. Lee, “Aggregation support for modern graph analytics in tigergraph,” in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, 2020, pp. 377–392.

R. Angles and C. Gutierrez, “Survey of graph database models,” ACM Comput. Surv. CSUR, vol. 40, no. 1, pp. 1–39, 2008.

N. S. Patil, P. Kiran, N. P. Kiran, and N. P. KM, “A survey on graph database management techniques for huge unstructured data,” Int. J. Electr. Comput. Eng., vol. 8, no. 2, p. 1140, 2018.

R. Angles, “The Property Graph Database Model.,” in AMW, 2018.

A. Das, A. Mitra, S. N. Bhagat, and S. Paul, “Issues and Concepts of Graph Database and a Comparative Analysis on list of Graph Database tools,” in 2020 International Conference on Computer Communication and Informatics (ICCCI), 2020, pp. 1–6.

R. H. Güting, “GraphDB: Modeling and querying graphs in databases,” in VLDB, 1994, vol. 94, pp. 12–15.

M. Consens and A. Mendelzon, “Hy+ A hygraph-based query and visualization system,” ACM SIGMOD Rec., vol. 22, no. 2, pp. 511–516, 1993.

U. Brandes, Network analysis: methodological foundations, vol. 3418. Springer Science & Business Media, 2005.

W. Nejdl, W. Siberski, and M. Sintek, “Design issues and challenges for RDF-and schema-based peer-to-peer systems,” ACM SIGMOD Rec., vol. 32, no. 3, pp. 41–46, 2003.

H. R. Vyawahare, P. P. Karde, and V. M. Thakare, “A hybrid database approach using graph and relational database,” in 2018 International Conference on Research in Intelligent and Computing in Engineering (RICE), 2018, pp. 1–4.

K. Islam, K. Ahsan, S. A. K. Bari, M. Saeed, and S. A. Ali, “Huge and Real-Time Database Systems: A Comparative Study and Review for SQL Server 2016, Oracle 12c & MySQL 5.7 for Personal Computer,” J. Basic Appl. Sci., vol. 13, pp. 481–490, 2017.

N. Jatana, S. Puri, M. Ahuja, I. Kathuria, and D. Gosain, “A survey and comparison of relational and non-relational database,” Int. J. Eng. Res. Technol., vol. 1, no. 6, pp. 1–5, 2012.

D. Dominguez-Sal, P. Urbón-Bayes, A. Giménez-Vanó, S. Gómez-Villamor, N. Martínez-Bazan, and J. L. Larriba-Pey, “Survey of graph database performance on the hpc scalable graph analysis benchmark,” in International Conference on Web-Age Information Management, 2010, pp. 37–48.

J. Guia, V. Gonçalves Soares, and J. Bernardino, “Graph Databases: Neo4j Analysis:,” in Proceedings of the 19th International Conference on Enterprise Information Systems, Porto, Portugal, 2017, pp. 351–356. doi: 10.5220/0006356003510356.

G. Li, H. T. Shen, Y. Yuan, X. Wang, H. Liu, and X. Zhao, Knowledge Science, Engineering and Management: 13th International Conference, KSEM 2020, Hangzhou, China, August 28-30, 2020, Proceedings, Part I, vol. 12274. Springer Nature, 2020.

S. Baker Effendi, B. van der Merwe, and W.-T. Balke, “Suitability of graph database technology for the analysis of spatio-temporal data,” Future Internet, vol. 12, no. 5, p. 78, 2020.

A. Bavelas, “A mathematical model for group structures,” Hum. Organ., vol. 7, no. 3, pp. 16–30, 1948.

V. Latora and M. Marchiori, “A measure of centrality based on network efficiency,” New J. Phys., vol. 9, no. 6, p. 188, 2007.

S. Adali, X. Lu, and M. Magdon-Ismail, “Deconstructing centrality: thinking locally and ranking globally in networks,” in Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2013, pp. 418–425.

Ö. Şimşek and A. Barto, “Skill characterization based on betweenness,” Adv. Neural Inf. Process. Syst., vol. 21, 2008.

F. A. Rodrigues, “Network centrality: an introduction,” in A mathematical modeling approach from nonlinear dynamics to complex systems, Springer, 2019, pp. 177–196.

A. Saxena and S. Iyengar, “Centrality measures in complex networks: A survey,” ArXiv Prepr. ArXiv201107190, 2020.

K. Das, S. Samanta, and M. Pal, “Study on centrality measures in social networks: a survey,” Soc. Netw. Anal. Min., vol. 8, no. 1, pp. 1–11, 2018.

P. Choudhary and U. Singh, “A survey on social network analysis for counter-terrorism,” Int. J. Comput. Appl., vol. 112, no. 9, pp. 24–29, 2015.

A. Bihari and M. K. Pandia, “Eigenvector centrality and its application in research professionals’ relationship network,” in 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), 2015, pp. 510–514.

A. Landherr, B. Friedl, and J. Heidemann, “A critical review of centrality measures in social networks,” Bus. Inf. Syst. Eng., vol. 2, no. 6, pp. 371–385, 2010.

K. Henni, N. Mezghani, and C. Gouin-Vallerand, “Unsupervised graph-based feature selection via subspace and pagerank centrality,” Expert Syst. Appl., vol. 114, pp. 46–53, 2018.

A. Hashemi, M. B. Dowlatshahi, and H. Nezamabadi-Pour, “MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality,” Expert Syst. Appl., vol. 142, p. 113024, 2020.

“Stanford Large Network Dataset Collection (SNAP).” Accessed: Feb. 08, 2022. [Online]. Available: “Stanford large network dataset,” http: //snap.stanford.edu/data/index.html.

J. Leskovec and J. Mcauley, “Learning to discover social circles in ego networks,” Adv. Neural Inf. Process. Syst., vol. 25, 2012.

B. Rozemberczki, C. Allen, and R. Sarkar, “Multi-scale attributed node embedding (2019),” ArXiv Prepr. ArXiv190913021, 2019.

J. Leskovec, J. Kleinberg, and C. Faloutsos, “Graph evolution: Densification and shrinking diameters,” ACM Trans. Knowl. Discov. Data TKDD, vol. 1, no. 1, pp. 2-es, 2007.

Downloads

Published

2023-04-30

How to Cite

Chicho , B., & Abdulhakeem Othman Mohammed. (2023). AN EMPIRICAL COMPARISON OF NEO4J AND TIGERGRAPH DATABASES FOR NETWORK CENTRALITY. Science Journal of University of Zakho, 11(2), 190–201. https://doi.org/10.25271/sjuoz.2023.11.2.1068

Issue

Section

Science Journal of University of Zakho