Automating Feature Extraction from Entity-Relation Models: Experimental Evaluation of Machine Learning Methods for Relational Learning

Stanoev, Boris; Mitrov, Goran; Kulakov, Andrea; Mirceva, Georgina; Lameski, Petre; Zdravevski, Eftim

Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/33562

DC Field	Value	Language
dc.contributor.author	Stanoev, Boris	en_US
dc.contributor.author	Mitrov, Goran	en_US
dc.contributor.author	Kulakov, Andrea	en_US
dc.contributor.author	Mirceva, Georgina	en_US
dc.contributor.author	Lameski, Petre	en_US
dc.contributor.author	Zdravevski, Eftim	en_US
dc.date.accessioned	2025-05-16T07:02:00Z	-
dc.date.available	2025-05-16T07:02:00Z	-
dc.date.issued	2024-04-01	-
dc.identifier.uri	http://hdl.handle.net/20.500.12188/33562	-
dc.description.abstract	With the exponential growth of data, extracting actionable insights becomes resource-intensive. In many organizations, normalized relational databases store a significant portion of this data, where tables are interconnected through some relations. This paper explores relational learning, which involves joining and merging database tables, often normalized in the third normal form. The subsequent processing includes extracting features and utilizing them in machine learning (ML) models. In this paper, we experiment with the propositionalization algorithm (i.e., Wordification) for feature engineering. Next, we compare the algorithms PropDRM and PropStar, which are designed explicitly for multi-relational data mining, to traditional machine learning algorithms. Based on the performed experiments, we concluded that Gradient Boost, compared to PropDRM, achieves similar performance (F1 score, accuracy, and AUC) on multiple datasets. PropStar consistently underperformed on some datasets while being comparable to the other algorithms on others. In summary, the propositionalization algorithm for feature extraction makes it feasible to apply traditional ML algorithms for relational learning directly. In contrast, approaches tailored specifically for relational learning still face challenges in scalability, interpretability, and efficiency. These findings have a practical impact that can help speed up the adoption of machine learning in business contexts where data is stored in relational format without requiring domain-specific feature extraction.	en_US
dc.publisher	MDPI AG	en_US
dc.relation.ispartof	Big Data and Cognitive Computing	en_US
dc.title	Automating Feature Extraction from Entity-Relation Models: Experimental Evaluation of Machine Learning Methods for Relational Learning	en_US
dc.identifier.doi	10.3390/bdcc8040039	-
dc.identifier.url	https://www.mdpi.com/2504-2289/8/4/39/pdf	-
dc.identifier.volume	8	-
dc.identifier.issue	4	-
item.grantfulltext	open	-
item.fulltext	With Fulltext	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
Appears in Collections:	Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:

File	Size	Format
BDCC-08-00039.pdf	680.42 kB	Adobe PDF	View/Open

Прикажи едноставен запис

Google Scholar^TM

Проверете

Altmetric

Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.

Репозиториум на трудови на УКИМ

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM