UUM Electronic Theses and Dissertation
UUM ETD | Universiti Utara Malaysian Electronic Theses and Dissertation
FAQs | Feedback | Search Tips | Sitemap

Indexing strategy for big data processing: A case study of PingER

Adamu, Fatima Binta (2015) Indexing strategy for big data processing: A case study of PingER. Masters thesis, Universiti Utara Malaysia.

[thumbnail of s817056_01.pdf]
Preview
Text
s817056_01.pdf

Download (7MB) | Preview
[thumbnail of s817056_02.pdf]
Preview
Text
s817056_02.pdf

Download (180kB) | Preview

Abstract

With the huge amount of data continuously accumulated and shared by individuals and organizations, it has become necessary to meet the emerging processing and retrieval
requirements associated with these large volumes of complex data. This could be achieved by indexing the data sets and reducing heavy computational overhead accustomed to most current indexing strategies during processing of very large amount of data sets. This study proposed a novel Indexing strategy called Big Data INDexing Strategy (BIND), using a concept of high performance parallel computing. BIND supports parallel distribution of data and performs processing in a MapReduce fashion. To
develop BIND strategy, Ian foster’s task-scheduling concept for parallel processing is applied. The proposed indexing strategy was first tested on a 2-node cluster environment
where varying sizes of datasets were used to note if the performance improves or declines as the size of the data increases. Subsequently, it was tested on a 3-node cluster to note the performance when the number of computation resources are increased. The results demonstrate that BIND minimizes the processing and query time as compared to the current strategy. The findings have significant implication in efficiently managing Big Data and facilitating data storage and information retrieval for users and organizations that manage Big Data.

Item Type: Thesis (Masters)
Supervisor : Habbal, Adib M. Monzer
Item ID: 5604
Uncontrolled Keywords: Big Data; Processing; Indexing; MapReduce; Information Retrieval
Subjects: Q Science > QA Mathematics > QA299.6-433 Analysis
Divisions: Awang Had Salleh Graduate School of Arts & Sciences
Date Deposited: 16 May 2016 10:18
Last Modified: 18 Mar 2021 00:24
Department: Awang Had Salleh Graduate School of Arts and Sciences
Name: Habbal, Adib M. Monzer
URI: https://etd.uum.edu.my/id/eprint/5604

Actions (login required)

View Item
View Item