Non ti piace? Non importa! Puoi restituire gli articoli fino a 30 giorni
Non puoi sbagliarti con un buono regalo. Con il buono regalo, il destinatario può scegliere qualsiasi prodotto della nostra offerta.
Fino a 30 giorni per il reso
Text classification (TC) is the task of automatically categorizing text into pre-set categories by analyzing their contents. In this monograph, a framework of a new TC model based on hidden Markov model is proposed, and its implementation is demonstrated in an application of library and information science. Two primary objectives are: First, the development of a new TC model based on hidden Markov model (HMM) proposed as a new framework for TC task. HMM has been applied to a wide range of applications in text processing such as text segmentation and event tracking, information retrieval, and information extraction. Few, however, have applied HMM to TC. Second, the application of the Library of Congress Classification (LCC) as a classification scheme for automatically organizing digital resources. A general prototype for an HMM-based TC model is proposed and implemented, so as to classify a collection of dissertation abstracts from the ProQuest Digital Dissertations database into LCC. The proposed model is compared to a Naďve Bayesian model, which has been extensively used in TC applications. Lastly, current TC challenges and issues are discussed.
Ciao! Sono Libroamiko, il tuo consulente di libri.
Come posso aiutarti?