A Study of “Categorization of Telugu Text under Language Mode”

On this page

Research Article | Open Access

Volume 11 2019 | None

Dr. K. Rama Krishna

Pages: 257-264

Abstract

In this paper we propose language dependent and independent models applicable to categorization of Telugu documents. India is a multilingual country; a provision is made for each of the Indian states to choose their own authorized language for communicating at the state level for legitimate purpose. The availability of constantly increasing amount of textual data of various Indian regional languages in electronic form has accelerated. Hence, the Classification of text documents based on languages is crucial. Telugu is the third most spoken language in India and one of the fifteen most spoken language n the world. It is the official language of the states of Telangana and Andhra Pradesh. A variant of k-nearest neighbors algorithm used for categorization process.

Keywords

categorization, classification, documents, process, purpose, regional language

PDF

Views

Downloads