Jo, Taeho.

Text Mining Concepts, Implementation, and Big Data Challenge / [electronic resource] : by Taeho Jo. - 1st ed. 2019. - XIII, 373 p. 236 illus., 148 illus. in color. online resource. - Studies in Big Data, 45 2197-6511 ; . - Studies in Big Data, 45 .

Part I: Foundation -- Introduction -- Text Indexing -- Text Encoding -- Text Association -- Part II: Text Categorization -- Text Categorization: Conceptual View -- Text Categorization: Approaches -- Text Categorization: Implementation -- Text Categorization: Evaluation -- Part III: Text Clustering -- Text Clustering: Conceptual View -- Text Clustering: Approaches -- Text Clustering: Implementation -- Text Clustering: Evaluation -- Part IV: Advanced Topics -- Text Summarization -- Text Segmentation -- Taxonomy Generation -- Dynamic Document Organization -- References -- Index.

This book discusses text mining and different ways this type of data mining can be used to find implicit knowledge from text collections. The author provides the guidelines for implementing text mining systems in Java, as well as concepts and approaches. The book starts by providing detailed text preprocessing techniques and then goes on to provide concepts, the techniques, the implementation, and the evaluation of text categorization. It then goes into more advanced topics including text summarization, text segmentation, topic mapping, and automatic text management. Presents techniques of preprocessing texts into structured forms; Outlines concepts of text categorization and clustering, their algorithms, and implementation guides; Includes advanced topics such as text summarization, text segmentation, topic mapping, and automatic text management.

9783319918150

10.1007/978-3-319-91815-0 doi


Telecommunication.
Computational intelligence.
Data mining.
Information storage and retrieval systems.
Quantitative research.
Communications Engineering, Networks.
Computational Intelligence.
Data Mining and Knowledge Discovery.
Information Storage and Retrieval.
Data Analysis and Big Data.

TK5101-5105.9

621.382