Documents Classification System Using RapidMiner Tool

Abdulhaq, Razan; Deriyieh, Rasha; Adawi, Shorouq

Documents Classification System Using RapidMiner Tool

Files

Arabic Abstract.docx (12.94 KB)

Project Documentation(1).docx (1.77 MB)

Documents Classification System final project.pptx (1.44 MB)

Date

2019

Authors

Abdulhaq, Razan

Deriyieh, Rasha

Adawi, Shorouq

Abstract

The exponential growth of the Internet has led to a great deal of interest especially by companies in developing useful and efficient Tools and Software’s to assist employees for doing their job and users for searching the web. However, the complexity of Natural Languages and the extremely High Dimensionality of the feature space of documents have made this Classification problem very difficult. We investigate four different Methods for Document Classification such as: the Naive Bayes classifier, the Nearest Neighbor Classifier, Decision Trees and a Support Vector Machine. These were applied to five classes of BBC and Reuters's news groups which is (Business, Entertainment, Politics, Sports and Technology) individually by using RapidMiner as a Tool. Our experimental results indicate that the Naive Bayes Classifier outperform the other classifiers on our data sets with a best accuracy of 85%. So we recommended companies to use RapidMiner as a Tool to classify their Documents and Naive Base as an algorithm to do this Classification.

URI

https://hdl.handle.net/20.500.11888/15012

Collections

Management Information Systems

Full item page

Documents Classification System Using RapidMiner Tool

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By