Journal ArticleOpen Access
A Partial String Matching Approach for Named Entity Recognition in Unstructured Bengali Data
Authors
Author Affiliations
Bangladesh University of Engineering and Technology, University of Dhaka
Published InInternational Journal of Modern Education and Computer Science
Year2018
Citations8
Abstract
In today's data driven, automated and digitized world, a significant stage of information extraction is to look for special keywords, more formally known as 'Named Entity'. This has been an active research topic for more than two decades and significant progresses have been made. Today we have models powered by deep learning that, although not perfect, have near human level accuracy on certain occasions. Unfortunately these algorithms require a lot of annotated training data, which we hardly have for Bengali language. This paper proposes a partial string matching approach to identify a named entity from an unstructured text corpus in Bengali. The algorithm is a partial string matching technique, based on Breadth First Search (BFS) search on a Trie data…
View at Publisher
BORR does not host full-text PDFs. The button above takes you to the original publisher.
Fields & Keywords
Physical SciencesComputer ScienceArtificial IntelligenceTopic ModelingNatural Language Processing TechniquesText and Document Classification TechnologiesArtificial intelligenceNatural language processingInformation retrievalData miningProgramming languageManagementQuantum mechanicsStatisticsMathematical analysis