Analysis of Machine-Based Learning Algorithm Used in Named Entity Recognition
The amount of information published has increased dramatically due to the information explosion. The issue of managing information as it expands at this rate lies in the development of information extraction technology that can turn unstructured data into organized data that is understandable and controllable by computers
The primary goal of named entity recognition (NER) is to extract named entities from amorphous materials and place them in pre-defined semantic classes.
In our work, we analyze various machine learning algorithms and implement K-NN which has been widely used in machine learning and remains one of the most popular methods to classify data.
To the researchers’ best knowledge, no published study has presented Named entity recognition for the Kikuyu language using a machine learning algorithm. This research will fill this gap by recognizing entities in the Kikuyu language.
An evaluation was done by testing precision, recall, and F-measure. The experiment results demonstrate that using K-NN is effective in classification performance.
With enough training data, researchers could perform an experiment and check the learning curve with accuracy that compares to state of art NER.
Future studies may be done using unsupervised and semi-supervised learning algorithms for other resource-scarce languages.