Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/25707
DC FieldValueLanguage
dc.contributor.authorTudjarski, Stojanchoen_US
dc.contributor.authorMadevska Bogdanova, Anaen_US
dc.date.accessioned2023-02-13T13:38:54Z-
dc.date.available2023-02-13T13:38:54Z-
dc.date.issued2022-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/25707-
dc.description.abstractWe implement a system that allows providing human-like answers to human-like questions extracted from a considerable amount of data in a reasonable time measured in seconds. To prove that the volume of the data used as a knowledge base where the answers to the questions are searched for, we used a complete English Wikipedia dump running on a local laptop under Windows10 OS, exposed to a software that receives questions and provides the three most relevant solutions. The entire technology stack of the implementation is the subject of this research. The main conclusion of this research is that it is possible to implement semantic search over a vast amount of text data on a local computer with an average hardware specifications, which is of outermost importance in developing different NLP systems.en_US
dc.subjectsemantic search, deep learning, transformers, BERTen_US
dc.titleRunning Semantic Search Over Complete English Wikipedia on a Local Computeren_US
dc.typeProceedingsen_US
dc.relation.conferenceThe 19th International Conference on Informatics and Information Technologies – CIIT 2022en_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers
Files in This Item:
File Description SizeFormat 
CIIT_2022_31.pdf1.04 MBAdobe PDFView/Open
Show simple item record

Page view(s)

104
checked on May 4, 2025

Download(s)

428
checked on May 4, 2025

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.