UUM ETD | Universiti Utara Malaysian Electronic Theses and Dissertation
FAQs | Feedback | Search Tips | Sitemap

Text Categorization Using Naive Bayes Algorithm

Wan Hazimah, Wan Ismail (2005) Text Categorization Using Naive Bayes Algorithm. Masters thesis, Universiti Utara Malaysia.

[img] PDF
WAN_HAZIMAH_BT._WAN_ISMAIL.pdf
Restricted to Registered users only

Download (6MB)
[img]
Preview
PDF
1.WAN_HAZIMAH_BT._WAN_ISMAIL.pdf

Download (1MB) | Preview

Abstract

As the volume of information available on the internet and corporate intranet continues to increase, there is a growing interest in helping people better find, filter, and manage all these resources. Text categorization is one of the techniques that can be applied in this situation. This paper presents text categorization system based on naive Bayes algorithm. This algorithm has long been used for text categorization tasks. Naive Bayes classifier is based on probability model that integrate strong independence assumptions which often have no bearing in reality. The aims of this project are to categorize the textual document using naive Bayes algorithm and to measure the correctness of the chosen technique for the categorization process. This paper also discusses the experiment in categorizing articles using naive Bayes. The result shows that the accuracy for training is 81.82% whereas the accuracy for testing is 47.62%.

Item Type: Thesis (Masters)
Uncontrolled Keywords: Text Categorization, Naive Bayes Algorithm, Textual Document, Articles, Internet Resources
Subjects: Q Science > QA Mathematics > QA71-90 Instruments and machines
Divisions: Faculty and School System > Faculty of Information Technology
Depositing User: Mrs Shamsiah Mohd Shariff
Date Deposited: 09 Feb 2010 07:42
Last Modified: 24 Jul 2013 12:11
URI: http://etd.uum.edu.my/id/eprint/1368

Actions (login required)

View Item View Item