Querying knowledge graphs in natural language.

Liang, S.; Stockinger, K.; de Farias, T.M.; Anisimova, M.; Gil, M.

doi:10.1186/s40537-020-00383-w

Querying knowledge graphs in natural language.

Details

Download: Liang_40537_2020_Article_383.pdf (2038.67 [Ko])
State: Public
Version: Final published version
License: CC BY 4.0

Serval ID

serval:BIB_673D6C6C570E

Type

Article: article from journal or magazin.

Collection

Publications

Institution

UNIL/CHUV

Title

Querying knowledge graphs in natural language.

Journal

Journal of big data

Author(s)

Liang S., Stockinger K., de Farias T.M., Anisimova M., Gil M.

ISSN

2196-1115 (Print)

ISSN-L

2196-1115

Publication state

Published

Issued date

2021

Peer-reviewed

Oui

Volume

Number

Pages

1-23

Language

english

Notes

Publication types: Journal Article
Publication Status: ppublish

Abstract

Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available.

Keywords

Knowledge graphs, Natural language processing, Query processing, SPARQL

URN

urn:nbn:ch:serval-BIB_673D6C6C570E1

OAI-PMH

oai:serval.unil.ch:BIB_673D6C6C570E

DOI

10.1186/s40537-020-00383-w

Pubmed

33489717

Web of science

000610410900005