Knowledge Graph Generation for Research Articles

Authors

  • Dhanalakshmi Teekaraman Jerusalem College of Engineering, Chennai, Tamil Nadu, India 600100
  • Tamizhselvi S P

DOI:

https://doi.org/10.12723/mjs.70.3

Keywords:

Query System, Triples , Natural Language Processing, SPARQL, RDF

Abstract

The increasing amount of web data has made accessing and processing information efficiently challenging. One solution is to transform the unstructured data into a machine-readable structured format like a knowledge graph. This paper addresses an information retrieval system that enables users to formulate queries in natural language and obtain pertinent information from a knowledge graph specific to a particular domain. Understanding any unstructured data is tougher than structured data. Inferring knowledge from any research article is difficult for naïve users.  To resolve this, we propose to create a knowledge graph for the same. Our system utilizes natural language processing techniques to analyze user queries, generating SPARQL queries to retrieve pertinent data from the knowledge graph. We leverage state-of-the-art knowledge graph models to assess the system's precision, recall, and F1 score. The result shows that the proposed model can effectively retrieve relevant information with an average precision of approximately 95%.

References

A faster way to build and share data apps, https://streamlit.io/

AllenNLP, https://allenai.org/allennlp

Bianchi, F., Rossiello, G., Costabello, L., Palmonari, M., Minervini, P.: Knowledge Graph Embeddings and Explainable AI. Knowledge Graphs for eXplainable Articial Intelligence, Computer Science. (2020)

Blazegraph Database, https://blazegraph.com/

Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, and George Karypis.: DGL-KE: Training Knowledge Graph Embeddings at Scale. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'20). Association for Computing Machinery, New York, NY, USA, 739748.https://doi.org/10.1145/3397271.3401172

Diefenbach, D., Lopez, V., Singh, K., Maret,P. : Core techniques of question answering systems over knowledge bases: a survey. Knowledge and Information Systems 55(2), 529569 (2018)

Gad-Elrab, M. H., Urbani, J., Stepanova, D., Weikum, G.: ExFaKT: A framework for explaining facts over knowledge graphs and text. In WSDM 2019- Proceedings of the 12th ACM International Conference on Web Search and Data Mining, pp.8795.

Association for Computing Machinery, Inc. Melbourne VIC Australia (2019)

H. Cai, V. Zheng and K. Chang: A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications. IEEE Transactions on Knowledge & Data Engineering, 30(9), 616-1637 (2018)

Kuansan Wang, Zhihong Shen, Chiyuan Huang, Chieh-Han Wu, Yuxiao Dong, and Anshul Kanakia: Microsoft academic graph: When experts are not enough. Quantitative Science Studies 1(1), 396413 (2020)

Liang, Shiqi & Stockinger, Kurt & Mendes de Farias, Tarcisio & Anisimova, Maria & Gil, Manuel: Querying Knowledge Graphs in Natural Language. Journal of Big Data 8(1):3 (2021)

Liu, Fenglin & You, Chenyu & Wu, Xian & Ge, Shen & Wang, Sheng & Sun, Xu.: Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation. arXiv:2111.04318 [cs.LG]. (2021)

Rossi, Andrea & Barbosa, Denilson & Firmani, Donatella & Matinata, Antonio & Merialdo, Paolo.: Knowledge Graph Embedding for Link Prediction: A Comparative Analysis. ACM Transactions on Knowledge Discovery from Data. 2(15), 149 (2021)

S. Ji, S. Pan, E. Cambria, P. Marttinen and P. S. Yu: A Survey on Knowledge Graphs: Representation, Acquisition, and Applications. IEEE Transactions on Neural Networks and Learning Systems, 33(2), 494 514 (2022)

The Stanford NLP Group, https://nlp.stanford.edu/software/openie.html

Weihua Hu, Bowen Liu, Joseph Gomes, Marinka Zitnik, Percy Liang, Vijay Pande, Jure Leskovec: Strategies for pre-training graph neural networks. In International Conference on Learning Representations (ICLR) (2020)

Yu Rong, Yatao Bian, Tingyang Xu, Weiyang Xie, Ying Wei, Wenbing Huang, and Junzhou Huang: Self-supervised graph transformer on large-scale molecular data. In Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS'20), Article 1053, pp.1255912571. Curran Associates Inc., Red Hook, NY, USA (2020). https://doi.org/10.5555/3495724.3496777

Additional Files

Published

2024-10-09