An Efficient Approach to Machine Learning Based Text Classification Through Distributed Computing

Download or Read eBook An Efficient Approach to Machine Learning Based Text Classification Through Distributed Computing PDF written by Raghu Nandan Immaneni and published by . This book was released on 2015 with total page 75 pages. Available in PDF, EPUB and Kindle.

Author	: Raghu Nandan Immaneni
Publisher	:
Total Pages	: 75
Release	: 2015
ISBN-10	: 1339214954
ISBN-13	: 9781339214955
Rating	: 4/5 (54 Downloads)

DOWNLOAD EBOOK

Book Synopsis An Efficient Approach to Machine Learning Based Text Classification Through Distributed Computing by : Raghu Nandan Immaneni

Book excerpt: Abstract: Text classification is one of the classical problems in computer science, which is primarily used for categorizing data, spam detection, anonymization, information extraction, text summarization etc. Given the large amounts of data involved in the above applications, automated and accurate training models and approaches to classify data efficiently are needed. In this thesis, an extensive study of the interaction between natural language processing, information retrieval and text classification has been performed. A case study named "keyword extraction" that deals with 'identifying keywords and tags from millions of text questions' is used as a reference. Different classifiers are implemented using MapReduce paradigm on the case study and the experimental results are recorded using two newly built distributed computing Hadoop clusters. The main aim is to enhance the prediction accuracy, to examine the role of text pre-processing for noise elimination and to reduce the computation time and resource utilization on the clusters.

An Efficient Approach to Machine Learning Based Text Classification Through Distributed Computing Related Books

Language: en
Pages: 75

An Efficient Approach to Machine Learning Based Text Classification Through Distributed Computing

Authors: Raghu Nandan Immaneni

Categories: Electronic data processing

Type: BOOK - Published: 2015 - Publisher:

DOWNLOAD EBOOK

Abstract: Text classification is one of the classical problems in computer science, which is primarily used for categorizing data, spam detection, anonymization

Language: en
Pages: 218

Learning to Classify Text Using Support Vector Machines

Authors: Thorsten Joachims

Categories: Computers

Type: BOOK - Published: 2012-12-06 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Based on ideas from Support Vector Machines (SVMs), Learning To Classify Text Using Support Vector Machines presents a new approach to generating text classifie

Language: en
Pages: 352

Distributed Computing and Artificial Intelligence, 19th International Conference

Authors: Sigeru Omatu

Categories: Technology & Engineering

Type: BOOK - Published: 2022-12-12 - Publisher: Springer Nature

DOWNLOAD EBOOK

DCAI 2022 is a forum to present applications of innovative techniques for studying and solving complex problems in artificial intelligence and computing areas.

Language: en
Pages: 169

Inductive Inference for Large Scale Text Classification

Authors: Catarina Silva

Categories: Mathematics

Type: BOOK - Published: 2009-11-13 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Text classification is becoming a crucial task to analysts in different areas. In the last few decades, the production of textual documents in digital form has

Language: en
Pages: 332

Data Science and Big Data Computing

Authors: Zaigham Mahmood

Categories: Business & Economics

Type: BOOK - Published: 2016-07-05 - Publisher: Springer

DOWNLOAD EBOOK

This illuminating text/reference surveys the state of the art in data science, and provides practical guidance on big data analytics. Expert perspectives are pr