Documents Classification System Using RapidMiner Tool

dc.contributor.advisorMaher Abu Baker, Kamal Irshaid
dc.contributor.authorAbdulhaq, Razan
dc.contributor.authorDeriyieh, Rasha
dc.contributor.authorAdawi, Shorouq
dc.date.accessioned2020-06-08T07:37:08Z
dc.date.available2020-06-08T07:37:08Z
dc.date.issued2019
dc.description.abstractThe exponential growth of the Internet has led to a great deal of interest especially by companies in developing useful and efficient Tools and Software’s to assist employees for doing their job and users for searching the web. However, the complexity of Natural Languages and the extremely High Dimensionality of the feature space of documents have made this Classification problem very difficult. We investigate four different Methods for Document Classification such as: the Naive Bayes classifier, the Nearest Neighbor Classifier, Decision Trees and a Support Vector Machine. These were applied to five classes of BBC and Reuters's news groups which is (Business, Entertainment, Politics, Sports and Technology) individually by using RapidMiner as a Tool. Our experimental results indicate that the Naive Bayes Classifier outperform the other classifiers on our data sets with a best accuracy of 85%. So we recommended companies to use RapidMiner as a Tool to classify their Documents and Naive Base as an algorithm to do this Classification.en_US
dc.identifier.urihttps://hdl.handle.net/20.500.11888/15012
dc.titleDocuments Classification System Using RapidMiner Toolen_US
dc.typeGraduation Projecten_US
Files
Original bundle
Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
Arabic Abstract.docx
Size:
12.94 KB
Format:
Microsoft Word XML
Description:
Loading...
Thumbnail Image
Name:
Project Documentation(1).docx
Size:
1.77 MB
Format:
Microsoft Word XML
Description:
Loading...
Thumbnail Image
Name:
Documents Classification System final project.pptx
Size:
1.44 MB
Format:
Microsoft Powerpoint XML
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: