Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/31220
Наслов: RDFGraphGen: A Synthetic RDF Graph Generator based on SHACL Constraints
Authors: Marija Vecovska
Milos Jovanovik
Keywords: Data Generator
Synthetic Data
Knowledge Graphs
RDF
SHACL
Semantic Web
Issue Date: 25-јул-2024
Abstract: This paper introduces RDFGraphGen, a general-purpose, domain-independent generator of synthetic RDF graphs based on SHACL constraints. The Shapes Constraint Language (SHACL) is a W3C standard which specifies ways to validate data in RDF graphs, by defining constraining shapes. However, even though the main purpose of SHACL is validation of existing RDF data, in order to solve the problem with the lack of available RDF datasets in multiple RDF-based application development processes, we envisioned and implemented a reverse role for SHACL: we use SHACL shape definitions as a starting point to generate synthetic data for an RDF graph. The generation process involves extracting the constraints from the SHACL shapes, converting the specified constraints into rules, and then generating artificial data for a predefined number of RDF entities, based on these rules. The purpose of RDFGraphGen is the generation of small, medium or large RDF knowledge graphs for the purpose of benchmarking, testing, quality control, training and other similar purposes for applications from the RDF, Linked Data and Semantic Web domain. RDFGraphGen is open-source and is available as a ready-to-use Python package.
URI: http://hdl.handle.net/20.500.12188/31220
DOI: 10.48550/arXiv.2407.17941
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:
File Опис SizeFormat 
2407.17941v1.pdf179.78 kBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

25
checked on 7.11.2024

Download(s)

1
checked on 7.11.2024

Google ScholarTM

Проверете

Altmetric


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.