Tải bản đầy đủ (.pdf) (466 trang)

Handbook of research on advanced data mining techniques and applications for business intelligence

Bạn đang xem bản rút gọn của tài liệu. Xem và tải ngay bản đầy đủ của tài liệu tại đây (19.02 MB, 466 trang )


Handbook of Research
on Advanced Data
Mining Techniques and
Applications for Business
Intelligence
Shrawan Kumar Trivedi
BML Munjal University, India
Shubhamoy Dey
Indian Institute of Management Indore, India
Anil Kumar
BML Munjal University, India
Tapan Kumar Panda
Jindal Global Business School, India

A volume in the Advances in Business
Information Systems and Analytics (ABISA)
Book Series


Published in the United States of America by
IGI Global
Business Science Reference (an imprint of IGI Global)
701 E. Chocolate Avenue
Hershey PA, USA 17033
Tel: 717-533-8845
Fax: 717-533-8661
E-mail: cust@igi-global.com
Web site: http://www.igi-global.com
Copyright © 2017 by IGI Global. All rights reserved. No part of this publication may be reproduced, stored or distributed in
any form or by any means, electronic or mechanical, including photocopying, without written permission from the publisher.


Product or company names used in this set are for identification purposes only. Inclusion of the names of the products or
companies does not indicate a claim of ownership by IGI Global of the trademark or registered trademark.

Library of Congress Cataloging-in-Publication Data
CIP Data Pending
ISBN: 978-1-5225-2031-3
eISBN: 978-1-5225-2032-0
This book is published in the IGI Global book series Advances in Business Information Systems and Analytics (ABISA)
(ISSN: 2327-3275; eISSN: 2327-3283)
British Cataloguing in Publication Data
A Cataloguing in Publication record for this book is available from the British Library.
All work contributed to this book is new, previously-unpublished material. The views expressed in this book are those of the
authors, but not necessarily of the publisher.
For electronic access to this publication, please contact: eresources@igi-global.com.


Advances in Business
Information Systems and
Analytics (ABISA) Book Series
Madjid Tavana
La Salle University, USA

ISSN:2327-3275
EISSN:2327-3283
Mission

The successful development and management of information systems and business analytics is crucial
to the success of an organization. New technological developments and methods for data analysis have
allowed organizations to not only improve their processes and allow for greater productivity, but have
also provided businesses with a venue through which to cut costs, plan for the future, and maintain

competitive advantage in the information age.
The Advances in Business Information Systems and Analytics (ABISA) Book Series aims to present
diverse and timely research in the development, deployment, and management of business information
systems and business analytics for continued organizational development and improved business value.

Coverage

• Decision Support Systems
• Legal information systems
• Business Intelligence
• Data Analytics
• Business Process Management
• Business Information Security
• Management information systems
• Data Management
• Strategic Information Systems
• Statistics

IGI Global is currently accepting manuscripts
for publication within this series. To submit a proposal for a volume in this series, please contact our
Acquisition Editors at Acquisitions@igi-global.com
or visit: http://www.igi-global.com/publish/.

The Advances in Business Information Systems and Analytics (ABISA) Book Series (ISSN 2327-3275) is published by IGI Global, 701
E. Chocolate Avenue, Hershey, PA 17033-1240, USA, www.igi-global.com. This series is composed of titles available for purchase individually; each title is edited to be contextually exclusive from any other title within the series. For pricing and ordering information please visit
http://www.igi-global.com/book-series/advances-business-information-systems-analytics/37155. Postmaster: Send all address changes to above
address. Copyright © 2017 IGI Global. All rights, including translation in other languages reserved by the publisher. No part of this series
may be reproduced or used in any form or by any means – graphics, electronic, or mechanical, including photocopying, recording, taping,
or information and retrieval systems – without written permission from the publisher, except for non commercial, educational use, including
classroom teaching purposes. The views expressed in this series are those of the authors, but not necessarily of IGI Global.



Titles in this Series

For a list of additional titles in this series, please visit: www.igi-global.com

Business Analytics and Cyber Security Management in Organizations
Rajagopal (EGADE Business School, Tecnologico de Monterrey, Mexico City, Mexico & Boston University, USA)
and Ramesh Behl (International Management Institute, Bhubaneswar, India)
Business Science Reference • copyright 2017 • 346pp • H/C (ISBN: 9781522509028) • US $215.00 (our price)
Handbook of Research on Intelligent Techniques and Modeling Applications in Marketing Analytics
Anil Kumar (BML Munjal University, India) Manoj Kumar Dash (ABV-Indian Institute of Information Technology and Management, India) Shrawan Kumar Trivedi (BML Munjal University, India) and Tapan Kumar Panda
(BML Munjal University, India)
Business Science Reference • copyright 2017 • 428pp • H/C (ISBN: 9781522509974) • US $275.00 (our price)
Applied Big Data Analytics in Operations Management
Manish Kumar (Indian Institute of Information Technology, Allahabad, India)
Business Science Reference • copyright 2017 • 251pp • H/C (ISBN: 9781522508861) • US $160.00 (our price)
Eye-Tracking Technology Applications in Educational Research
Christopher Was (Kent State University, USA) Frank Sansosti (Kent State University, USA) and Bradley Morris
(Kent State University, USA)
Information Science Reference • copyright 2017 • 370pp • H/C (ISBN: 9781522510055) • US $205.00 (our price)
Strategic IT Governance and Alignment in Business Settings
Steven De Haes (Antwerp Management School, University of Antwerp, Belgium) and Wim Van Grembergen
(Antwerp Management School, University of Antwerp, Belgium)
Business Science Reference • copyright 2017 • 298pp • H/C (ISBN: 9781522508618) • US $195.00 (our price)
Organizational Productivity and Performance Measurements Using Predictive Modeling and Analytics
Madjid Tavana (La Salle University, USA) Kathryn Szabat (La Salle University, USA) and Kartikeya Puranam
(La Salle University, USA)
Business Science Reference • copyright 2017 • 400pp • H/C (ISBN: 9781522506546) • US $205.00 (our price)
Data Envelopment Analysis and Effective Performance Assessment

Farhad Hossein Zadeh Lotfi (Islamic Azad University, Iran) Seyed Esmaeil Najafi (Islamic Azad University, Iran)
and Hamed Nozari (Islamic Azad University, Iran)
Business Science Reference • copyright 2017 • 365pp • H/C (ISBN: 9781522505969) • US $160.00 (our price)

701 E. Chocolate Ave., Hershey, PA 17033
Order online at www.igi-global.com or call 717-533-8845 x100
To place a standing order for titles released in this series, contact: cust@igi-global.com
Mon-Fri 8:00 am - 5:00 pm (est) or fax 24 hours a day 717-533-8661


Editorial Advisory Board
Ankita Tripathi, Amity University Gurgaon – Haryana, India

List of Reviewers
A. Sheik Abdullah, Thiagarajar College of Engineering, India
A. M. Abirami, Thiagarajar College of Engineering, India
M. Afshar Alam, Jamia Hamdard University, India
Tamizh Arasi, VIT University, India
A. Askarunisa, KLN College of Information Technology, India
Balamurugan Balusamy, VIT University, India
Yi Chai, Chongqing University, China
A. A. Chari, Rayalaseema University, India
T. K. Das, VIT University, India
Hirak Dasgupta, Symbiosis International University, India
Sanjiva Shankar Dubey, SSD Consulting, India
G. R. Gangadharan, University of Hyderabad, India
Belay Gebremeskel, Chongqing University, China
Rashik Gupta, BML Munjal University, India
Zhongshi He, Chongqing University, China
Priya Jha, VIT University, India

Ponnuru Ramalinga Karteek, BML Munjal University, India
Kaushik Kumar, Birla Institute of Technology Mesra, India
Raghvendra Kumar, Lakshmi Narain College of Technology Jabalpur, India
C. Mahalakshmi, Thiagarajar College of Engineering, India
Amir Manzoor, Bahria University, Pakistan
Vinod Kumar Mishra, Bipin Tripathi Kumaon Institute of Technology, India
Priyanka Pandey, Lakshmi Narain College of Technology Jabalpur, India
Prasant Kumar Pattnaik, KIIT University, India
S. Rajaram, Thiagarajar College of Engineering, India
Vadlamani Ravi, University of Hyderabad, India
Supriyo Roy, Birla Institute of Technology Mesra, India





Hanna Sawicka, Poznan University of Technology, Poland
S. Selvakumar, G. K. M. College of Engineering and Technology, India
Nita H. Shah, Gujarat University, India
Pourya Shamsolmoali, CMCC, Italy
Arunesh Sharan, AS Consulting, India
G. Sreedhar, Rastriya Sanskrit Vidhya Pheet University, India
Timmaraju Srimanyu, University of Hyderabad, India
R. Suganya, Thiagarajar College of Engineering, India
K. Suneetha, Jawaharlal Nehru Technological University, India
Himanshu Tiruwa, Bipin Tripathi Kumaon Institute of Technology, India
Khadija Ali Vakeel, Indian Institute of Management Indore, India
Malathi Velu, VIT University, India
Masoumeh Zareapoor, Shanghai Jiao Tong University, China



List of Contributors

Abdullah, A. Sheik / Thiagarajar College of Engineering, India.............................................. 1,34,162
Abirami, A. M. / Thiagarajar College of Engineering, India......................................................... 1,162
Alam, M. Afshar / Jamia Hamdard University, India.......................................................................... 62
Arasi, Tamizh / VIT University, India............................................................................................... 259
Askarunisa, A. / KLN College of Information Technology, India..................................................... 162
Balusamy, Balamurugan / VIT University, India............................................................................. 259
Biswas, Animesh / University of Kalyani, India................................................................................ 353
Chari, A. Anandaraja / Rayalaseema University, India.................................................................... 298
Das, T. K. / VIT University, India........................................................................................................ 142
Dasgupta, Hirak / Symbiosis Institute of Management Studies, India................................................ 15
De, Arnab Kumar / Government College of Engineering and Textile Technology, India................. 353
Dubey, Sanjiva Shankar / BIMTECH Greater Noida, India............................................................. 209
Gangadharan, G. R. / Institute for Development and Research in Banking Technology, India........ 379
Gebremeskel, Gebeyehu Belay / Chongqing University, China.......................................................... 90
Gupta, Rashik / BML Munjal University, India................................................................................ 192
He, Zhongshi / Chongqing University, China...................................................................................... 90
Jha, Priya / VIT University, India...................................................................................................... 259
Kumar, Kaushik / Birla Institute of Technology, India..................................................................... 284
Kumar, Raghvendra / LNCT College, India....................................................................................... 52
Mahalakshmi, C. / Thiagarajar College of Engineering, India........................................................ 162
Manzoor, Amir / Bahria University, Pakistan............................................................................ 128,225
Mishra, Vinod Kumar / Bipin Tripathi Kumaon Institute of Technology, India............................... 175
Pandey, Priyanka / LNCT College, India............................................................................................ 52
Pattnaik, Prasant Kumar / KIIT University, India.............................................................................. 52
Ponnuru, Karteek Ramalinga / BML Munjal University, India....................................................... 192
Rajaram, S. / Thiagarajar College of Engineering, India................................................................... 34
Ravi, Vadlamani / Institute for Development and Research in Banking Technology, India............. 379

Roy, Supriyo / Birla Institute of Technology, India........................................................................... 284
Sawicka, Hanna / Poznan University of Technology, Poland............................................................ 315
Selvakumar, S. / G. K. M. College of Engineering and Technology, India............................... 1,34,162
Shah, Nita H. / Gujarat University, India........................................................................................... 341
Shamsolmoali, Pourya / CMCC, Italy................................................................................................ 62
Sharan, Arunesh / AS Consulting, India........................................................................................... 209
Sreedhar, G. / Rashtriya Sanskrit Vidyapeetha (Deemed University), India..................................... 298
Suganya, R. / Thiagarajar College of Engineering, India................................................................... 34





Suneetha, Keerthi / SVEC, India....................................................................................................... 240
Timmaraju, Srimanyu / Institute for Development and Research in Banking Technology,
India.............................................................................................................................................. 379
Tiruwa, Himanshu / Bipin Tripathi Kumaon Institute of Technology, India.................................... 175
Trivedi, Shrawan Kumar / BML Munjal University, India............................................................... 192
Vakeel, Khadija Ali / Indian Institute of Management Indore, India................................................. 250
Velu, Malathi / VIT University, India................................................................................................ 259
Yi, Chai / Chongqing University, China.............................................................................................. 90
Zareapoor, Masoumeh / Shanghai Jiao Tong University, China....................................................... 62


Table of Contents

Preface................................................................................................................................................. xxii
Acknowledgment............................................................................................................................... xxvi
Section 1
Business Intelligence With Data Mining: Process and Applications

Chapter 1
An Introduction to Data Analytics: Its Types and Its Applications......................................................... 1
A. Sheik Abdullah, Thiagarajar College of Engineering, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
A. M. Abirami, Thiagarajar College of Engineering, India
Chapter 2
Data Mining and Statistics: Tools for Decision Making in the Age of Big Data.................................. 15
Hirak Dasgupta, Symbiosis Institute of Management Studies, India
Chapter 3
Data Classification: Its Techniques and Big Data.................................................................................. 34
A. Sheik Abdullah, Thiagarajar College of Engineering, India
R. Suganya, Thiagarajar College of Engineering, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
S. Rajaram, Thiagarajar College of Engineering, India
Chapter 4
Secure Data Analysis in Clusters (Iris Database).................................................................................. 52
Raghvendra Kumar, LNCT College, India
Prasant Kumar Pattnaik, KIIT University, India
Priyanka Pandey, LNCT College, India
Chapter 5
Data Mining for Secure Online Payment Transaction........................................................................... 62
Masoumeh Zareapoor, Shanghai Jiao Tong University, China
Pourya Shamsolmoali, CMCC, Italy
M. Afshar Alam, Jamia Hamdard University, India





Chapter 6

The Integral of Spatial Data Mining in the Era of Big Data: Algorithms and Applications................. 90
Gebeyehu Belay Gebremeskel, Chongqing University, China
Chai Yi, Chongqing University, China
Zhongshi He, Chongqing University, China
Section 2
Social Media Analytics With Sentiment Analysis: Business Applications and Methods
Chapter 7
Social Media as Mirror of Society....................................................................................................... 128
Amir Manzoor, Bahria University, Pakistan
Chapter 8
Business Intelligence through Opinion Mining................................................................................... 142
T. K. Das, VIT University, India
Chapter 9
Sentiment Analysis.............................................................................................................................. 162
A. M. Abirami, Thiagarajar College of Engineering, India
A. Sheik Abdullah, Thiagarajar College of Engineering, India
A. Askarunisa, KLN College of Information Technology, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
C. Mahalakshmi, Thiagarajar College of Engineering, India
Chapter 10
Aspect-Based Sentiment Analysis of Online Product Reviews........................................................... 175
Vinod Kumar Mishra, Bipin Tripathi Kumaon Institute of Technology, India
Himanshu Tiruwa, Bipin Tripathi Kumaon Institute of Technology, India
Chapter 11
Sentiment Analysis with Social Media Analytics, Methods, Process, and Applications.................... 192
Karteek Ramalinga Ponnuru, BML Munjal University, India
Rashik Gupta, BML Munjal University, India
Shrawan Kumar Trivedi, BML Munjal University, India
Chapter 12
Organizational Issue for BI Success: Critical Success Factors for BI Implementations within the

Enterprise............................................................................................................................................. 209
Sanjiva Shankar Dubey, BIMTECH Greater Noida, India
Arunesh Sharan, AS Consulting, India
Chapter 13
Ethics of Social Media Research......................................................................................................... 225
Amir Manzoor, Bahria University, Pakistan




Section 3
Big Data Analytics: Its Methods and Applications
Chapter 14
Big Data Analytics in Health Care....................................................................................................... 240
Keerthi Suneetha, SVEC, India
Chapter 15
Mining Big Data for Marketing Intelligence....................................................................................... 250
Khadija Ali Vakeel, Indian Institute of Management Indore, India
Chapter 16
Predictive Analysis for Digital Marketing Using Big Data: Big Data for Predictive Analysis........... 259
Balamurugan Balusamy, VIT University, India
Priya Jha, VIT University, India
Tamizh Arasi, VIT University, India
Malathi Velu, VIT University, India
Chapter 17
Strategic Best-in-Class Performance for Voice to Customer: Is Big Data in Logistics a Perfect
Match?.................................................................................................................................................. 284
Supriyo Roy, Birla Institute of Technology, India
Kaushik Kumar, Birla Institute of Technology, India
Section 4

Advanced Data Analytics: Decision Models and Business Applications
Chapter 18
First Look on Web Mining Techniques to Improve Business Intelligence of E-Commerce
Applications......................................................................................................................................... 298
G. Sreedhar, Rashtriya Sanskrit Vidyapeetha (Deemed University), India
A. Anandaraja Chari, Rayalaseema University, India
Chapter 19
Artificial Intelligence in Stochastic Multiple Criteria Decision Making............................................. 315
Hanna Sawicka, Poznan University of Technology, Poland
Chapter 20
Joint Decision for Price Competitive Inventory Model with Time-Price and Credit Period
Dependent Demand.............................................................................................................................. 341
Nita H. Shah, Gujarat University, India
Chapter 21
On Development of a Fuzzy Stochastic Programming Model with Its Application to Business
Management......................................................................................................................................... 353
Animesh Biswas, University of Kalyani, India
Arnab Kumar De, Government College of Engineering and Textile Technology, India




Chapter 22
Ranking of Cloud Services Using Opinion Mining and Multi-Attribute Decision Making:
Ranking of Cloud Services Using Opinion Mining and MADM........................................................ 379
Srimanyu Timmaraju, Institute for Development and Research in Banking Technology, India
Vadlamani Ravi, Institute for Development and Research in Banking Technology, India
G. R. Gangadharan, Institute for Development and Research in Banking Technology, India
Compilation of References................................................................................................................ 397
About the Contributors..................................................................................................................... 427

Index.................................................................................................................................................... 436


Detailed Table of Contents

Preface................................................................................................................................................. xxii
Acknowledgment............................................................................................................................... xxvi
Section 1
Business Intelligence With Data Mining: Process and Applications
Chapter 1
An Introduction to Data Analytics: Its Types and Its Applications......................................................... 1
A. Sheik Abdullah, Thiagarajar College of Engineering, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
A. M. Abirami, Thiagarajar College of Engineering, India
Data analytics mainly deals with the science of examining and investigating raw data to derive useful
patterns and inference. Data analytics has been deployed in many of the industries to make decisions
at proper levels. It focuses upon the assumption and evaluation of the method with the intention of
deriving a conclusion at various levels. Various types of data analytical techniques such as predictive
analytics, prescriptive analytics, descriptive analytics, text analytics, and social media analytics are used
by industrial organizations, educational institutions and by government associations. This context mainly
focuses towards the illustration of contextual examples for various types of analytical techniques and
its applications.
Chapter 2
Data Mining and Statistics: Tools for Decision Making in the Age of Big Data.................................. 15
Hirak Dasgupta, Symbiosis Institute of Management Studies, India
In the age of information, the world abounds with data. In order to obtain an intelligent appreciation of
current developments, we need to absorb and interpret substantial amounts of data. The amount of data
collected has grown at a phenomenal rate over the past few years. The computer age has given us both
the power to rapidly process, summarize and analyse data and the encouragement to produce and store
more data. The aim of data mining is to make sense of large amounts of mostly unsupervised data, in

some domain. Data Mining is used to discover the patterns and relationships in data, with an emphasis
on large observational data bases. This chapter aims to compare the approaches and conclude that
Statisticians and Data miners can profit by studying each other’s methods by using the combination of
methods judiciously. The chapter also attempts to discuss data cleaning techniques involved in data mining.






Chapter 3
Data Classification: Its Techniques and Big Data.................................................................................. 34
A. Sheik Abdullah, Thiagarajar College of Engineering, India
R. Suganya, Thiagarajar College of Engineering, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
S. Rajaram, Thiagarajar College of Engineering, India
Classification is considered to be the one of the data analysis technique which can be used over many
applications. Classification model predicts categorical continuous class labels. Clustering mainly deals
with grouping of variables based upon similar characteristics. Classification models are experienced by
comparing the predicted values to that of the known target values in a set of test data. Data classification has
many applications in business modeling, marketing analysis, credit risk analysis; biomedical engineering
and drug retort modeling. The extension of data analysis and classification makes the insight into big
data with an exploration to processing and managing large data sets. This chapter deals with various
techniques, methodologies that correspond to the classification problem in data analysis process and its
methodological impacts to big data.
Chapter 4
Secure Data Analysis in Clusters (Iris Database).................................................................................. 52
Raghvendra Kumar, LNCT College, India
Prasant Kumar Pattnaik, KIIT University, India
Priyanka Pandey, LNCT College, India

This chapter used privacy preservation techniques (Data Modification) to ensure Privacy. Privacy
preservation is another important issue. A picture, where number of clients owning their clustered databases
(Iris Database) wish to run a data mining algorithm on the union of their databases, without revealing
any unnecessary information and requires the privacy of the privileged information. There are numbers
of efficient protocols are required for privacy preserving in data mining. This chapter presented various
privacy preserving protocols that are used for security in clustered databases. The Xln(X) protocol and
the secure sum protocol are used in mutual computing, which can defend privacy efficiently. Its focuses
on the data modification techniques, where it has been modified our distributed database and after that
sanded that modified data set to the client admin for secure data communication with zero percentage
of data leakage and also reduce the communication and computation complexity.
Chapter 5
Data Mining for Secure Online Payment Transaction........................................................................... 62
Masoumeh Zareapoor, Shanghai Jiao Tong University, China
Pourya Shamsolmoali, CMCC, Italy
M. Afshar Alam, Jamia Hamdard University, India
The fraud detection method requires a holistic approach where the objective is to correctly classify the
transactions as legitimate or fraudulent. The existing methods give importance to detect all fraudulent
transactions since it results in money loss. For this most of the time, they have to compromise on some
genuine transactions. Thus, the major issue that the credit card fraud detection systems face today is that
a significant percentage of transactions labelled as fraudulent are in fact legitimate. These “false alarms”
delay the transactions and creates inconvenience and dissatisfaction to the customer. Thus, the objective




of this research is to develop an intelligent data mining based fraud detection system for secure online
payment transaction system. The performance evaluation of the proposed model is done on real credit
card dataset and it is found that the proposed model has high fraud detection rate and less false alarm
rate than other state-of-the-art classifiers.
Chapter 6

The Integral of Spatial Data Mining in the Era of Big Data: Algorithms and Applications................. 90
Gebeyehu Belay Gebremeskel, Chongqing University, China
Chai Yi, Chongqing University, China
Zhongshi He, Chongqing University, China
Data Mining (DM) is a rapidly expanding field in many disciplines, and it is greatly inspiring to analyze
massive data types, which includes geospatial, image and other forms of data sets. Such the fast growths
of data characterized as high volume, velocity, variety, variability, value and others that collected and
generated from various sources that are too complex and big to capturing, storing, and analyzing and
challenging to traditional tools. The SDM is, therefore, the process of searching and discovering valuable information and knowledge in large volumes of spatial data, which draws basic principles from
concepts in databases, machine learning, statistics, pattern recognition and ‘soft’ computing. Using DM
techniques enables a more efficient use of the data warehouse. It is thus becoming an emerging research
field in Geosciences because of the increasing amount of data, which lead to new promising applications. The integral SDM in which we focused in this chapter is the inference to geospatial and GIS data.
Section 2
Social Media Analytics With Sentiment Analysis: Business Applications and Methods
Chapter 7
Social Media as Mirror of Society....................................................................................................... 128
Amir Manzoor, Bahria University, Pakistan
Over the last decade, social media use has gained much attention of scholarly researchers. One specific
reason of this interest is the use of social media for communication; a trend that is gaining tremendous
popularity. Every social media platform has developed its own set of application programming interface
(API). Through these APIs, the data available on a particular social media platform can be accessed.
However, the data available is limited and it is difficult to ascertain the possible conclusions that can
be drawn about society on the basis of this data. This chapter explores the ways social researchers and
scientists can use social media data to support their research and analysis.
Chapter 8
Business Intelligence through Opinion Mining................................................................................... 142
T. K. Das, VIT University, India
Business organizations have been adopting different strategies to impress upon their customers and attract
them towards their products and services. On the other hand, the opinions of the customers gathered through
customer feedbacks have been a great source of information for companies to evolve business intelligence

to rightly place their products and services to meet the ever-changing customer requirements. In this work,
we present a new approach to integrate customers’ opinions into the traditional data warehouse model.
We have taken Twitter as the data source for this experiment. First, we have built a system which can be




used for opinion analysis on a product or a service. The second process is to model the opinion table so
obtained as a dimensional table and to integrate it with a central data warehouse schema so that reports
can be generated on demand. Furthermore, we have shown how business intelligence can be elicited from
online product reviews by using computational intelligence technique like rough set base data analysis.
Chapter 9
Sentiment Analysis.............................................................................................................................. 162
A. M. Abirami, Thiagarajar College of Engineering, India
A. Sheik Abdullah, Thiagarajar College of Engineering, India
A. Askarunisa, KLN College of Information Technology, India
S. Selvakumar, G. K. M. College of Engineering and Technology, India
C. Mahalakshmi, Thiagarajar College of Engineering, India
It requires sophisticated streaming of big data processing to process the billions of daily social
conversations across millions of sources. Dataset needs information extraction from them and it requires
contextual semantic sentiment modeling to capture the intelligence through the complexity of online
social discussions. Sentiment analysis is one of the techniques to capture the intelligence from Social
Networks based on the user generated content. There are more and more researches evolving about
sentiment classification. Aspect extraction is the core task involved in aspect based sentiment analysis.
The proposed modeling uses Latent Semantic Analysis technique for aspect extraction and evaluates
senti-scores of various products under study.
Chapter 10
Aspect-Based Sentiment Analysis of Online Product Reviews........................................................... 175
Vinod Kumar Mishra, Bipin Tripathi Kumaon Institute of Technology, India
Himanshu Tiruwa, Bipin Tripathi Kumaon Institute of Technology, India

Sentiment analysis is a part of computational linguistics concerned with extracting sentiment and emotion
from text. It is also considered as a task of natural language processing and data mining. Sentiment
analysis mainly concentrate on identifying whether a given text is subjective or objective and if it is
subjective, then whether it is negative, positive or neutral. This chapter provide an overview of aspect
based sentiment analysis with current and future trend of research on aspect based sentiment analysis.
This chapter also provide a aspect based sentiment analysis of online customer reviews of Nokia 6600.
To perform aspect based classification we are using lexical approach on eclipse platform which classify
the review as a positive, negative or neutral on the basis of features of product. The Sentiwordnet is used
as a lexical resource to calculate the overall sentiment score of each sentence, pos tagger is used for
part of speech tagging, frequency based method is used for extraction of the aspects/features and used
negation handling for improving the accuracy of the system.
Chapter 11
Sentiment Analysis with Social Media Analytics, Methods, Process, and Applications.................... 192
Karteek Ramalinga Ponnuru, BML Munjal University, India
Rashik Gupta, BML Munjal University, India
Shrawan Kumar Trivedi, BML Munjal University, India
Firms are turning their eye towards social media analytics to get to know what people are really talking
about their firm or their product. With the huge amount of buzz being created online about anything and




everything social media has become ‘the’ platform of the day to understand what public on a whole are
talking about a particular product and the process of converting all the talking into valuable information
is called Sentiment Analysis. Sentiment Analysis is a process of identifying and categorizing a piece of
text into positive or negative so as to understand the sentiment of the users. This chapter would take the
reader through basic sentiment classifiers like building word clouds, commonality clouds, dendrograms
and comparison clouds to advanced algorithms like K Nearest Neighbour, Naïve Biased Algorithm and
Support Vector Machine.
Chapter 12

Organizational Issue for BI Success: Critical Success Factors for BI Implementations within the
Enterprise............................................................................................................................................. 209
Sanjiva Shankar Dubey, BIMTECH Greater Noida, India
Arunesh Sharan, AS Consulting, India
This chapter will focus on the transformative effect Business Intelligence (BI) brings to an organization
decision making, enhancing its performance, reducing overall cost of operations and improving its
competitive posture. This chapter will enunciate the key principles and practices to bridge the gap between
organization requirements vs. capabilities of any BI tool(s) by proposing a framework of organizational
factors such as user’s role, their analytical needs, access preferences and technical /analytical literacy
etc. Evaluation methodology to select best BI tools properly aligned to the organization infrastructure
will also be discussed. Softer issues and organizational change for successful implementation of BI will
be further explained.
Chapter 13
Ethics of Social Media Research......................................................................................................... 225
Amir Manzoor, Bahria University, Pakistan
Over the last decade, social media platforms have become a very popular channel of communication.
This popularity has sparked an increasing interest among researchers to investigate the social media
communication. Many studies have been done that collected the publicly available social media
communication data to unearth significant patterns. However, one significant concern raised over such
practice is the privacy of the individual’s social media communication data. As such it is important that
specific ethical guidelines are in place for future researches on social media sites. This chapter explores
various ethical issues related to researches related to social networking sites. The chapter also provides
a set of ethical guidelines that future researches on social media sites can use to address various ethical
issues.
Section 3
Big Data Analytics: Its Methods and Applications
Chapter 14
Big Data Analytics in Health Care....................................................................................................... 240
Keerthi Suneetha, SVEC, India
With the arrival of technology and rising amount of data (Big Data) there is a need towards implementation

of effective analytical techniques (Big Data Analytics) in health sector which provides stakeholders with
new insights that have the potential to advance personalized care to improve patient outcomes and avoid




unnecessary costs. This chapter covers how to evaluate this big volume of data for unknown and useful
facts, associations, patterns, trends which can give birth to new line of handling of diseases and provide
high quality healthcare at lower cost to all citizens. This chapter gives a wide insight of introduction to
Big Data Analytics in health domain, processing steps of BDA, Challenges and Future scope of research
in healthcare.
Chapter 15
Mining Big Data for Marketing Intelligence....................................................................................... 250
Khadija Ali Vakeel, Indian Institute of Management Indore, India
This chapter elaborates on mining techniques useful in big data analysis. Specifically, it will elaborate
on how to use association rule mining, self organizing maps, word cloud, sentiment extraction, network
analysis, classification, and clustering for marketing intelligence. The application of these would be on
decisions related to market segmentation, targeting and positioning, trend analysis, sales, stock markets
and word of mouth. The chapter is divided in two sections of data collection and cleaning where we
elaborate on how twitter data can be extracted and mined for marketing decision making. Second part
discusses various techniques that can be used in big data analysis for mining content and interaction
network.
Chapter 16
Predictive Analysis for Digital Marketing Using Big Data: Big Data for Predictive Analysis........... 259
Balamurugan Balusamy, VIT University, India
Priya Jha, VIT University, India
Tamizh Arasi, VIT University, India
Malathi Velu, VIT University, India
Big data analytics in recent years had developed lightning fast applications that deal with predictive
analysis of huge volumes of data in domains of finance, health, weather, travel, marketing and more.

Business analysts take their decisions using the statistical analysis of the available data pulled in from
social media, user surveys, blogs and internet resources. Customer sentiment has to be taken into account
for designing, launching and pricing a product to be inducted into the market and the emotions of the
consumers changes and is influenced by several tangible and intangible factors. The possibility of using
Big data analytics to present data in a quickly viewable format giving different perspectives of the same
data is appreciated in the field of finance and health, where the advent of decision support system is
possible in all aspects of their working. Cognitive computing and artificial intelligence are making big
data analytical algorithms to think more on their own, leading to come out with Big data agents with
their own functionalities.
Chapter 17
Strategic Best-in-Class Performance for Voice to Customer: Is Big Data in Logistics a Perfect
Match?.................................................................................................................................................. 284
Supriyo Roy, Birla Institute of Technology, India
Kaushik Kumar, Birla Institute of Technology, India
For any forward-looking perspective, organizational information which is typically historical, incomplete
and most of the time inaccurate, needs to be enriched with external information. However, traditional
systems and approaches are slow, inflexible and cannot handle new volume and complexity of information.




Big data, an evolving term, basically refers to voluminous amount of structured, semi-structured or
unstructured information in the form of data with a potential to be mined for ‘best in class information’.
Primarily, big data can be categorized by 3V’s: volume, variety and velocity. Recent hype around big
data concepts predicts that it will help companies to improve operations and makes faster and intelligent
decisions. Considering the complexities in realms of supply chain, in this study, an attempt has been
made to highlight the problems in storing data in any business, especially under Indian scenario where
logistics arena is most unstructured and complicated. Conclusion may be significant to any strategic
decision maker / manager working with distribution and logistics.
Section 4

Advanced Data Analytics: Decision Models and Business Applications
Chapter 18
First Look on Web Mining Techniques to Improve Business Intelligence of E-Commerce
Applications......................................................................................................................................... 298
G. Sreedhar, Rashtriya Sanskrit Vidyapeetha (Deemed University), India
A. Anandaraja Chari, Rayalaseema University, India
Web Data Mining is the application of data mining techniques to extract useful knowledge from web data
like contents of web, hyperlinks of documents and web usage logs. There is also a strong requirement of
techniques to help in business decision in e-commerce. Web Data Mining can be broadly divided into
three categories: Web content mining, Web structure mining and Web usage mining. Web content data
are content availed to users to satisfy their required information. Web structure data represents linkage
and relationship of web pages to others. Web usage data involves log data collected by web server and
application server which is the main source of data. The growth of WWW and technologies has made
business functions to be executed fast and easier. As large amount of transactions are performed through
e-commerce sites and the huge amount of data is stored, valuable knowledge can be obtained by applying
the Web Mining techniques.
Chapter 19
Artificial Intelligence in Stochastic Multiple Criteria Decision Making............................................. 315
Hanna Sawicka, Poznan University of Technology, Poland
This chapter presents the concept of stochastic multiple criteria decision making (MCDM) method to
solve complex ranking decision problems. This approach is composed of three main areas of research,
i.e. classical MCDM, probability theory and classification method. The most important steps of the idea
are characterized and specific features of the applied methods are briefly presented. The application
of Electre III combined with probability theory, and Promethee II combined with Bayes classifier are
described in details. Two case studies of stochastic multiple criteria decision making are presented. The
first one shows the distribution system of electrotechnical products, composed of 24 distribution centers
(DC), while the core business of the second one is the production and warehousing of pharmaceutical
products. Based on the application of presented stochastic MCDM method, different ways of improvements
of these complex systems are proposed and the final i.e. the best paths of changes are recommended.





Chapter 20
Joint Decision for Price Competitive Inventory Model with Time-Price and Credit Period
Dependent Demand.............................................................................................................................. 341
Nita H. Shah, Gujarat University, India
The problem analyzes a supply chain comprised of two front-runner retailers and one supplier. The
retailers’ offer customers delay in payments to settle the accounts against the purchases which is received
by the supplier. The market demand of the retailer depends on time, retail price and a credit period offered
to the customers with that of the other retailer. The supplier gives items with same wholesale price and
credit period to the retailers. The joint and independent decisions are analyzed and validated numerically.
Chapter 21
On Development of a Fuzzy Stochastic Programming Model with Its Application to Business
Management......................................................................................................................................... 353
Animesh Biswas, University of Kalyani, India
Arnab Kumar De, Government College of Engineering and Textile Technology, India
This chapter expresses efficiency of fuzzy goal programming for multiobjective aggregate production
planning in fuzzy stochastic environment. The parameters of the objectives are taken as normally
distributed fuzzy random variables and the chance constraints involve joint Cauchy distributed fuzzy
random variables. In model formulation process the fuzzy chance constrained programming model is
converted into its equivalent fuzzy programming using probabilistic technique, α-cut of fuzzy numbers
and taking expectation of parameters of the objectives. Defuzzification technique of fuzzy numbers
is used to find multiobjective linear programming model. Membership function of each objective is
constructed depending on their optimal values. Afterwards a weighted fuzzy goal programming model
is developed to achieve the highest degree of each of the membership goals to the extent possible by
minimizing group regrets in a multiobjective decision making context. To explore the potentiality of the
proposed approach, production planning of a health drinks manufacturing company has been considered.
Chapter 22
Ranking of Cloud Services Using Opinion Mining and Multi-Attribute Decision Making:

Ranking of Cloud Services Using Opinion Mining and MADM........................................................ 379
Srimanyu Timmaraju, Institute for Development and Research in Banking Technology, India
Vadlamani Ravi, Institute for Development and Research in Banking Technology, India
G. R. Gangadharan, Institute for Development and Research in Banking Technology, India
Cloud computing has been a major focus of business organizations around the world. Many applications
are getting migrated to the cloud and many new applications are being developed to run on the cloud.
There are already more than 100 cloud service providers in the market offering various cloud services.
As the number of cloud services and providers is increasing in the market, it is very important to select
the right provider and service for deploying an application. This paper focuses on recommendation of
cloud services by ranking them with the help of opinion mining of users’ reviews and multi-attribute
decision making models (TOPSIS and FMADM were applied separately) in tandem on both quantitative
and qualitative data. Surprisingly, both TOPSIS and FMADM yielded the same rankings for the cloud
services.




Compilation of References................................................................................................................ 397
About the Contributors..................................................................................................................... 427
Index.................................................................................................................................................... 436


xxii

Preface

The complete work of this book is divided into four sections. The first section titled “Business Intelligence
with Data Mining: Process and Applications” includes all the chapters related to business Analytics
with data mining and its applications. The second section titled “Social Media Analytics with Sentiment
Analysis: Business Applications and Methods” contains all the chapters related to social media analytics

techniques and its applications of business intelligence. In the third section titled “Big Data Analytics:
Its Methods and Applications” covers all the chapters related to big data processes and its applications.
The last section includes the chapters related to advance decision models for business analytics titled as
“Advance Data Analytics: Decision Models and Business Applications”. The brief description of each
section as follows:
The first section of this book is “Business Intelligence With Data Mining: Process and Applications”
where the chapters related to data mining methods and its applications have been discussed. The first
chapter of this section authored by A. Sheik Abdullah, S. Selvakumar, and A. M. Abirami, explains
about data analytics where they explain Data analytics mainly deals with the science of examining and
investigating raw data to derive useful patterns and inference. Data analytics has been deployed in many
of the industries to make decisions at proper levels. It focuses upon the assumption and evaluation of
the method with the intention of deriving a conclusion at various levels. Various types of data analytical
techniques such as predictive analytics, prescriptive analytics, descriptive analytics, text analytics, and
social media analytics are used by industrial organizations, educational institutions and by government
associations. This context mainly focuses towards the illustration of contextual examples for various types
of analytical techniques and its applications. In the second chapter, Hirak Dasgupta aims to compare the
approaches and conclude that statisticians and data miners can profit by studying each other’s methods
by using the combination of methods judiciously. The chapter also attempts to discuss data cleaning
techniques involved in data mining. The third chapter of this section authored by A. Sheik Abdullah, R.
Suganya, S. Selvakumar, and S. Rajaram, deals with various techniques, methodologies that correspond
to the classification problem in data analysis process and its methodological impacts to big data. The
fourth chapter written by Raghvendra Kumar, Prasant Kumar Pattnaik and Priyanka Pandey, presented
various privacy preserving protocols that are used for security in clustered databases. The Xln(X) protocol and the secure sum protocol are used in mutual computing, which can defend privacy efficiently.
Its focuses on the data modification techniques, where it has been modified our distributed database
and after that sanded that modified data set to the client admin for secure data communication with zero
percentage of data leakage and also reduce the communication and computation complexity. The fifth
chapter of this section authored by Masoumeh Zareapoor, Pourya Shamsolmoali and M. Afshar Alam,
shows the performance of new credit card fraud detection technique which is based on, firstly balancing





Preface

the transaction records, and then applies the proposed algorithm to detect the fraudulent transactions.
At the end, we conduct a series of experiments to evaluate the effectiveness of our proposed techniques.
In the chapter six authored by Belay Gebremeskel, Yi Chai, and Zhongshi He, incorporates tremendous
novel ideas and methodologies as the integral of spatial data mining (SDM), which is highly pertinent
and serve as a single inference material for researchers, experts, and other users.
The second section of this book is “Social Media Analytics With Sentiment Analysis: Business
Applications and Methods” where the chapters related to social media analytics methods and related
applications have been discussed. In Chapter 7 authored by Amir Manzoor, explores the ways social
researchers and scientists can use social media data to support their research and analysis. Chapter 8
written by T. K. Das, presents a new approach to integrate customers’ opinions into the traditional data
warehouse model. He has taken Twitter as the data source for this experiment where at first, a system
which can be used for opinion analysis on a product or a service has been built. The second process
is to model the opinion table so obtained as a dimensional table and to integrate it with a central data
warehouse schema so that reports can be generated on demand. Furthermore, he has shown how business
intelligence can be elicited from online product reviews by using computational intelligence technique
like rough set base data analysis. Chapter 9 authored by A. M. Abirami, A. Sheik Abdullah, A. Askarunisa, S. Selvakumar, and C. Mahalakshmi proposes a modeling technique that uses latent semantic
analysis (LSA) technique for aspect extraction and evaluates senti-scores of various products under
study. In Chapter 10, Vinod Kumar Mishra, and Himanshu Tiruwa provide an overview of aspect based
sentiment analysis with current and future trend of research on aspect based sentiment analysis. This
chapter also provides an aspect based sentiment analysis of online customer reviews of Nokia 6600. To
perform aspect based classification they are using lexical approach on eclipse platform which classify
the review as a positive, negative or neutral on the basis of features of product. The senti-word net is
used as a lexical resource to calculate the overall sentiment score of each sentence, pos tagger is used for
part of speech tagging, frequency based method is used for extraction of the aspects/features and used
negation handling for improving the accuracy of the system. Chapter 11 written by Ponnuru Ramalinga
Karteek, Rashik Gupta, and Shrawan Kumar Trivedi, take the reader through basic sentiment classifiers like building word clouds, commonality clouds, dendrograms and comparison clouds to advanced

algorithms like K Nearest Neighbour, Naïve Biased Algorithm and Support Vector Machine. In Chapter
12, Sanjiva Shankar Dubey and Arunesh Sharan enunciate the key principles and practices to bridge
the gap between organization requirements vs. capabilities of any BI tool(s) by proposing a framework
of organizational factors such as user’s role, their analytical needs, access preferences and technical /
analytical literacy etc. Chapter 13 authored by Amir Manzoor explores various ethical issues related to
researches related to social networking sites. This chapter also provides a set of ethical guidelines that
future researches on social media sites can use to address various ethical issues.
The third section of this book is “Big Data Analytics: Its Methods and Applications” where the chapters
related to Big data analytics methods and their applications have been discussed. In this section, Chapter
14, written by K. Suneetha, covers how to evaluate this big volume of data for unknown and useful facts,
associations, patterns, trends which can give birth to new line of handling of diseases and provide high
quality healthcare at lower cost to all citizens. This chapter gives a wide insight of introduction to Big
Data Analytics in health domain, processing steps of BDA, Challenges and Future scope of research
in healthcare. Chapter 15 authored by Khadija Ali Vakeel elaborates on mining techniques useful in
big data analysis. Specifically, it will elaborate on how to use association rule mining, self-organizing
maps, word cloud, sentiment extraction, network analysis, classification, and clustering for marketing
xxiii


Preface

intelligence. The application of these would be on decisions related to market segmentation, targeting
and positioning, trend analysis, sales, stock markets and word of mouth. The chapter is divided in two
sections of data collection and cleaning where we elaborate on how twitter data can be extracted and
mined for marketing decision making. Second part discusses various techniques that can be used in big
data analysis for content and interaction network. In Chapter 16, Balamurugan Balusamy, Priya Jha,
Tamizh Arasi, and Malathi Velu discuss the Big data analytics in recent years had developed lightning
fast applications that deal with predictive analysis of huge volumes of data in domains of finance, health,
weather, travel, marketing and more. Business analysts take their decisions using the statistical analysis
of the available data pulled in from social media, user surveys, blogs and internet resources. Customer

sentiment has to be taken into account for designing, launching and pricing a product to be inducted
into the market and the emotions of the consumers’ changes and is influenced by several tangible and
intangible factors. The possibility of using big data analytics to present data in a quickly viewable format
giving different perspective of the same data is appreciated in the field of finance and health, where the
advent of decision support system is possible in all aspects of their working. Cognitive computing and
artificial intelligence are making big data analytical algorithms to think more on their own, leading to
come out with big data agents with their own functionalities. In Chapter 17, Supriyo Roy and Kaushik
Kumar, explore the usefulness of applying big data concepts in these emerging areas of logistics are
explored with different dimensions. Conclusion of this paper may seem to be significant to any strategic
decision maker / manager working with specific field of distribution and logistics.
The last section of this book is “Advanced Data Analytics: Decision Models and Business Applications” where the chapters related to advance data analytics techniques and their applications have been
discussed. Chapter 18, written by G. Sreedhar and A. A. Chari, considers the important element of Page
load time of a website for assessing the performance of some well-known online Business websites
through statistical tools. Also this research work considers the optimum design aspect of Business websites leading to improvement and betterment of online business process. Chapter 19, written by Hanna
Sawicka, presents the concept of stochastic multiple criteria decision making (MCDM) method to solve
complex ranking decision problems. This approach is composed of three main areas of research, i.e.
classical MCDM, probability theory and classification method. The most important steps of the idea are
characterized and specific features of the applied methods are briefly presented. The application of Electre
III combined with probability theory, and Promethee II combined with Bayes classifier are described
in details. Two case studies of stochastic multiple criteria decision making are presented. The first one
shows the distribution system of electro-technical products, composed of 24 distribution centers (DC),
while the core business of the second one is the production and warehousing of pharmaceutical products.
Based on the application of presented stochastic MCDM method, different ways of improvements of
these complex systems are proposed and the final i.e. the best paths of changes are recommended. In
Chapter 20, Nita H. Shah discusses the problem that analyzes a supply chain comprised of two frontrunner retailers and one supplier. The retailers’ offer customers delay in payments to settle the accounts
against the purchases which is received by the supplier. The market demand of the retailer depends on
time, retail price and a credit period offered to the customers with that of the other retailer. The supplier
gives items with same wholesale price and credit period to the retailers. The joint and independent decisions are analyzed and validated numerically. Chapter 21, written by Animesh Biswas and Arnab Kumar
De, expresses efficiency of fuzzy goal programming technique for multi-objective aggregate production
planning in fuzzy stochastic environment. The parameters of the objectives are taken as normally distributed fuzzy random variables and the chance constraints involve joint Cauchy distributed fuzzy random

xxiv


×