Analysis of Machine-Based Learning Algorithm Used in Named Entity Recognition

Francis Mithanga Kamau, Kennedy Ogada, Cheruiyot Wilson Kipruto
Informing Science: The International Journal of an Emerging Transdiscipline  •  Volume 26  •  2023  •  pp. 069-084

The amount of information published has increased dramatically due to the information explosion. The issue of managing information as it expands at this rate lies in the development of information extraction technology that can turn unstructured data into organized data that is understandable and controllable by computers

The primary goal of named entity recognition (NER) is to extract named entities from amorphous materials and place them in pre-defined semantic classes.

In our work, we analyze various machine learning algorithms and implement K-NN which has been widely used in machine learning and remains one of the most popular methods to classify data.

To the researchers’ best knowledge, no published study has presented Named entity recognition for the Kikuyu language using a machine learning algorithm. This research will fill this gap by recognizing entities in the Kikuyu language.

An evaluation was done by testing precision, recall, and F-measure. The experiment results demonstrate that using K-NN is effective in classification performance.

With enough training data, researchers could perform an experiment and check the learning curve with accuracy that compares to state of art NER.

Future studies may be done using unsupervised and semi-supervised learning algorithms for other resource-scarce languages.

named entity recognition, memory-based learning algorithms, semantic web problems, K-Nearest Neighbor (KNN), precision, recall, F-Score measures
218 total downloads
Share this
 Back

Back to Top ↑