Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/25352
DC FieldValueLanguage
dc.contributor.authorJofche, Nasien_US
dc.contributor.authorMishev, Kostadinen_US
dc.contributor.authorStojanov, Risteen_US
dc.contributor.authorJovanovik, Milosen_US
dc.contributor.authorZdravevski, Eftimen_US
dc.contributor.authorTrajanov, Dimitaren_US
dc.date.accessioned2023-01-10T09:03:21Z-
dc.date.available2023-01-10T09:03:21Z-
dc.date.issued2023-01-09-
dc.identifier.citationJofche N, Mishev K, Stojanov R, Jovanovik M, Zdravevski E, Trajanov D. PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts Using Transfer Learning. Computers. 2023; 12(1):17. https://doi.org/10.3390/computers12010017en_US
dc.identifier.urihttp://hdl.handle.net/20.500.12188/25352-
dc.description.abstractEven though named entity recognition (NER) has seen tremendous development in recent years, some domain-specific use-cases still require tagging of unique entities, which is not well handled by pre-trained models. Solutions based on enhancing pre-trained models or creating new ones are efficient, but creating reliable labeled training for them to learn on is still challenging. In this paper, we introduce PharmKE, a text analysis platform tailored to the pharmaceutical industry that uses deep learning at several stages to perform an in-depth semantic analysis of relevant publications. The proposed methodology is used to produce reliably labeled datasets leveraging cutting-edge transfer learning, which are later used to train models for specific entity labeling tasks. By building models for the well-known text-processing libraries spaCy and AllenNLP, this technique is used to find Pharmaceutical Organizations and Drugs in texts from the pharmaceutical domain. The PharmKE platform also incorporates the NER findings to resolve co-references of entities and examine the semantic linkages in each phrase, creating a foundation for further text analysis tasks, such as fact extraction and question answering. Additionally, the knowledge graph created by DBpedia Spotlight for a specific pharmaceutical text is expanded using the identified entities. The obtained results with the proposed methodology result in about a 96% F1-score on the NER tasks, which is up to 2% better than those of the fine-tuned BERT and BioBERT models developed using the same dataset. The ultimate benefits of the platform are that pharmaceutical domain specialists may more easily identify the knowledge extracted from the input texts thanks to the platform’s visualization of the model findings. Likewise, the proposed techniques can be integrated into mobile and pervasive systems to give patients more relevant and comprehensive information from scanned medication guides. Similarly, it can provide preliminary insights to patients and even medical personnel on whether a drug from a different vendor is compatible with the patient’s prescription medication.en_US
dc.language.isoenen_US
dc.publisherMDPIen_US
dc.relation.ispartofComputersen_US
dc.subjectKnowledge extractionen_US
dc.subjectNatural Language Processingen_US
dc.subjectNamed entity recognitionen_US
dc.subjectKnowledge Graphsen_US
dc.subjectDrugsen_US
dc.subjectPharmacologyen_US
dc.titlePharmKE: Knowledge Extraction Platform for Pharmaceutical Texts Using Transfer Learningen_US
dc.typeArticleen_US
dc.typeJournal Articleen_US
dc.identifier.doi10.3390/computers12010017-
dc.identifier.urlhttps://www.mdpi.com/2073-431X/12/1/17-
dc.identifier.volume12-
dc.identifier.issue1-
item.grantfulltextopen-
item.fulltextWith Fulltext-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles
Files in This Item:
File Description SizeFormat 
computers-12-00017.pdfManuscript (PDF)924.36 kBAdobe PDFView/Open
Show simple item record

Page view(s)

59
checked on May 11, 2024

Download(s)

13
checked on May 11, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.