AN EMPIRICAL COMPARISON OF NEO4J AND TIGERGRAPH DATABASES FOR NETWORK CENTRALITY
DOI:
https://doi.org/10.25271/sjuoz.2023.11.2.1068Keywords:
Graph Database, Relational Database, Database Model, Neo4j, TigerGraphAbstract
Graph databases have recently gained a lot of attention in areas where the relationships between data and the data itself are equally important, like the semantic web, social networks, and biological networks. A graph database is simply a database designed to store, query, and modify graphs. Recently, several graph database models have been developed. The goal of this research is to evaluate the performance of the two most popular graph databases, Neo4j and TigerGraph, for network centrality metrics including degree centrality, betweenness centrality, closeness centrality, eigenvector centrality, and PageRank. We applied those metrics to a set of real-world networks in both graph databases to see their performance. Experimental results show Neo4j outperforms TigerGraph for computing the centrality metrics used in this study, but TigerGraph performs better during the data loading phase.
References
D. Fernandes and J. Bernardino, “Graph Databases Comparison: AllegroGraph, ArangoDB, InfiniteGraph, Neo4J, and OrientDB.,” in Data, 2018, pp. 373–380.
F. Chen, Y.-C. Wang, B. Wang, and C.-C. J. Kuo, “Graph representation learning: a survey,” APSIPA Trans. Signal Inf. Process., vol. 9, 2020.
N. Shadbolt, T. Berners-Lee, and W. Hall, “The semantic web revisited,” IEEE Intell. Syst., vol. 21, no. 3, pp. 96–101, 2006.
X.-M. Xu, J. Zhan, and H. Zhu, “Using social networks to organize researcher community,” in International Conference on Intelligence and Security Informatics, 2008, pp. 421–427.
R. Soussi, M.-A. Aufaure, and H. Baazaoui, “Graph database for collaborative communities,” in Community-Built Databases, Springer, 2011, pp. 205–234.
R. Kumar Kaliyar, “Graph databases: A survey,” in International Conference on Computing, Communication & Automation, 2015, pp. 785–790.
B. Ristevski, “Using Graph Databases for Querying and Network Analysing,” 2019.
A. G. Baset, “Graphical Database Architecture For Clinical Trials,” 2015.
R. Angles and C. Gutierrez, “An introduction to graph data management,” in Graph Data Management, Springer, 2018, pp. 1–32.
H. Lu, Z. Hong, and M. Shi, “Analysis of film data based on Neo4j,” in 2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS), 2017, pp. 675–677.
S. Almabdy, “Comparative analysis of relational and graph databases for social networks,” in 2018 1st International Conference on Computer Applications & Information Security (ICCAIS), 2018, pp. 1–4.
A. Lysenko, I. A. Roznovăţ, M. Saqi, A. Mazein, C. J. Rawlings, and C. Auffray, “Representing and querying disease networks using graph databases,” BioData Min., vol. 9, no. 1, pp. 1–19, 2016.
M. Šestak, M. Heričko, T. W. Družovec, and M. Turkanović, “Applying k-vertex cardinality constraints on a Neo4j graph database,” Future Gener. Comput. Syst., vol. 115, pp. 459–474, 2021.
M. Macak, M. Stovcik, B. Buhnova, and M. Merjavy, “How well a multi-model database performs against its single-model variants: Benchmarking OrientDB with Neo4j and MongoDB,” in 2020 15th Conference on Computer Science and Information Systems (FedCSIS), 2020, pp. 463–470.
M. Desai, R. G. Mehta, and D. P. Rana, “An empirical analysis to identify the effect of indexing on influence detection using graph databases,” Int J Innov Technol Explor. Eng, vol. 8, pp. 414–421, 2019.
S. Beis, S. Papadopoulos, and Y. Kompatsiaris, “Benchmarking graph databases on the problem of community detection,” in New Trends in Database and Information Systems II, Springer, 2015, pp. 3–14.
R. Wang, Z. Yang, W. Zhang, and X. Lin, “An empirical study on recent graph database systems,” in International Conference on Knowledge Science, Engineering and Management, 2020, pp. 328–340.
C. Liu and H. Duan, “A Graph Database Storage Engine for Provenance Graphs,” DBKDA 2020, p. 8.
T. Chen, C. Yuan, G. Liu, and R. Dai, “Graph based platform for electricity market study, education and training,” in 2018 IEEE Power & Energy Society General Meeting (PESGM), 2018, pp. 1–5.
F. Rusu and Z. Huang, “In-depth benchmarking of graph database systems with the Linked Data Benchmark Council (LDBC) Social Network Benchmark (SNB),” ArXiv Prepr. ArXiv190707405, 2019.
A. Deutsch, Y. Xu, M. Wu, and V. E. Lee, “Aggregation support for modern graph analytics in tigergraph,” in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, 2020, pp. 377–392.
R. Angles and C. Gutierrez, “Survey of graph database models,” ACM Comput. Surv. CSUR, vol. 40, no. 1, pp. 1–39, 2008.
N. S. Patil, P. Kiran, N. P. Kiran, and N. P. KM, “A survey on graph database management techniques for huge unstructured data,” Int. J. Electr. Comput. Eng., vol. 8, no. 2, p. 1140, 2018.
R. Angles, “The Property Graph Database Model.,” in AMW, 2018.
A. Das, A. Mitra, S. N. Bhagat, and S. Paul, “Issues and Concepts of Graph Database and a Comparative Analysis on list of Graph Database tools,” in 2020 International Conference on Computer Communication and Informatics (ICCCI), 2020, pp. 1–6.
R. H. Güting, “GraphDB: Modeling and querying graphs in databases,” in VLDB, 1994, vol. 94, pp. 12–15.
M. Consens and A. Mendelzon, “Hy+ A hygraph-based query and visualization system,” ACM SIGMOD Rec., vol. 22, no. 2, pp. 511–516, 1993.
U. Brandes, Network analysis: methodological foundations, vol. 3418. Springer Science & Business Media, 2005.
W. Nejdl, W. Siberski, and M. Sintek, “Design issues and challenges for RDF-and schema-based peer-to-peer systems,” ACM SIGMOD Rec., vol. 32, no. 3, pp. 41–46, 2003.
H. R. Vyawahare, P. P. Karde, and V. M. Thakare, “A hybrid database approach using graph and relational database,” in 2018 International Conference on Research in Intelligent and Computing in Engineering (RICE), 2018, pp. 1–4.
K. Islam, K. Ahsan, S. A. K. Bari, M. Saeed, and S. A. Ali, “Huge and Real-Time Database Systems: A Comparative Study and Review for SQL Server 2016, Oracle 12c & MySQL 5.7 for Personal Computer,” J. Basic Appl. Sci., vol. 13, pp. 481–490, 2017.
N. Jatana, S. Puri, M. Ahuja, I. Kathuria, and D. Gosain, “A survey and comparison of relational and non-relational database,” Int. J. Eng. Res. Technol., vol. 1, no. 6, pp. 1–5, 2012.
D. Dominguez-Sal, P. Urbón-Bayes, A. Giménez-Vanó, S. Gómez-Villamor, N. Martínez-Bazan, and J. L. Larriba-Pey, “Survey of graph database performance on the hpc scalable graph analysis benchmark,” in International Conference on Web-Age Information Management, 2010, pp. 37–48.
J. Guia, V. Gonçalves Soares, and J. Bernardino, “Graph Databases: Neo4j Analysis:,” in Proceedings of the 19th International Conference on Enterprise Information Systems, Porto, Portugal, 2017, pp. 351–356. doi: 10.5220/0006356003510356.
G. Li, H. T. Shen, Y. Yuan, X. Wang, H. Liu, and X. Zhao, Knowledge Science, Engineering and Management: 13th International Conference, KSEM 2020, Hangzhou, China, August 28-30, 2020, Proceedings, Part I, vol. 12274. Springer Nature, 2020.
S. Baker Effendi, B. van der Merwe, and W.-T. Balke, “Suitability of graph database technology for the analysis of spatio-temporal data,” Future Internet, vol. 12, no. 5, p. 78, 2020.
A. Bavelas, “A mathematical model for group structures,” Hum. Organ., vol. 7, no. 3, pp. 16–30, 1948.
V. Latora and M. Marchiori, “A measure of centrality based on network efficiency,” New J. Phys., vol. 9, no. 6, p. 188, 2007.
S. Adali, X. Lu, and M. Magdon-Ismail, “Deconstructing centrality: thinking locally and ranking globally in networks,” in Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2013, pp. 418–425.
Ö. Şimşek and A. Barto, “Skill characterization based on betweenness,” Adv. Neural Inf. Process. Syst., vol. 21, 2008.
F. A. Rodrigues, “Network centrality: an introduction,” in A mathematical modeling approach from nonlinear dynamics to complex systems, Springer, 2019, pp. 177–196.
A. Saxena and S. Iyengar, “Centrality measures in complex networks: A survey,” ArXiv Prepr. ArXiv201107190, 2020.
K. Das, S. Samanta, and M. Pal, “Study on centrality measures in social networks: a survey,” Soc. Netw. Anal. Min., vol. 8, no. 1, pp. 1–11, 2018.
P. Choudhary and U. Singh, “A survey on social network analysis for counter-terrorism,” Int. J. Comput. Appl., vol. 112, no. 9, pp. 24–29, 2015.
A. Bihari and M. K. Pandia, “Eigenvector centrality and its application in research professionals’ relationship network,” in 2015 International Conference on Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), 2015, pp. 510–514.
A. Landherr, B. Friedl, and J. Heidemann, “A critical review of centrality measures in social networks,” Bus. Inf. Syst. Eng., vol. 2, no. 6, pp. 371–385, 2010.
K. Henni, N. Mezghani, and C. Gouin-Vallerand, “Unsupervised graph-based feature selection via subspace and pagerank centrality,” Expert Syst. Appl., vol. 114, pp. 46–53, 2018.
A. Hashemi, M. B. Dowlatshahi, and H. Nezamabadi-Pour, “MGFS: A multi-label graph-based feature selection algorithm via PageRank centrality,” Expert Syst. Appl., vol. 142, p. 113024, 2020.
“Stanford Large Network Dataset Collection (SNAP).” Accessed: Feb. 08, 2022. [Online]. Available: “Stanford large network dataset,” http: //snap.stanford.edu/data/index.html.
J. Leskovec and J. Mcauley, “Learning to discover social circles in ego networks,” Adv. Neural Inf. Process. Syst., vol. 25, 2012.
B. Rozemberczki, C. Allen, and R. Sarkar, “Multi-scale attributed node embedding (2019),” ArXiv Prepr. ArXiv190913021, 2019.
J. Leskovec, J. Kleinberg, and C. Faloutsos, “Graph evolution: Densification and shrinking diameters,” ACM Trans. Knowl. Discov. Data TKDD, vol. 1, no. 1, pp. 2-es, 2007.
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Bahzad charbuty, Abdulhakeem Othman Mohammed
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License [CC BY-NC-SA 4.0] that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work, with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online.