Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Data Mining Knowl Discov
Impact Factor: 2.111

Market Basket Analysis algorithms with MapReduce

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

The MapReduce approach has been popular in computing large scale data since Google implemented its platform on Google Distributed File Systems (GFS) followed by Amazon Web Service (AWS) providing the Apache Hadoop platform in inexpensive computing nodes. Map/Reduce motivates to redesign and convert the existing sequential algorithms to MapReduce as restricted parallel programming so that the paper proposes Market Basket Analysis algorithm with MapReduce as well as apriority property. Two algorithms are proposed by adapting an existing Apriori‐algorithm and building a simple algorithm that sorts data sets and converts it to (key, value) pairs to fit with MapReduce. It is executed on Amazon EC2 Map/Reduce platform. The experimental results show that the Apriori‐algorithm does not perform as well as the simple algorithm. Using the simple algorithm, the code with Map/Reduce increases the performance by adding more nodes, but at a certain point there is a bottleneck that does not allow further performance gain. It is believed that the operations of distributing, aggregating, and reducing data in Map/Reduce, cause the bottleneck. WIREs Data Mining Knowl Discov 2013, 3:445–452. doi: 10.1002/widm.1107

Transaction data at a store.
[ Normal View | Magnified View ]
Chart for execution.
[ Normal View | Magnified View ]
MBA algorithm for Reducer.
[ Normal View | Magnified View ]
MBA algorithm for Mapper.
[ Normal View | Magnified View ]
Apriori‐Map/Reduce algorithm.
[ Normal View | Magnified View ]
Map/Reduce flows.
[ Normal View | Magnified View ]
Data set restructured with Sort.
[ Normal View | Magnified View ]
Data set restructured for the same list.
[ Normal View | Magnified View ]
Data set restructured for Map/Reduce.
[ Normal View | Magnified View ]
Top 10 pairs of items frequently occurred at store.
[ Normal View | Magnified View ]

Browse by Topic

Fundamental Concepts of Data and Knowledge > Big Data Mining
Algorithmic Development > Association Rules
Application Areas > Business and Industry

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts