Skip to content

Home of Data Algorithms Book

Mahmoud Parsian edited this page Mar 24, 2017 · 20 revisions

Data Algorithms

Welcome!

to the Home of Data Algorithms book wiki!

I have started working on the 2nd-Edition of "Data Algorithms" book. Some of the topics and chapters, which I will include are:

  • PageRank Algorithm
  • Rank Product Algorithm
  • Feature-ization of K-Means
  • Graph Algorithms: motif finding
  • Social Recommendations using motif finding
  • FP-Growth Algorithms
  • ...

Going to Production

I am so excited to report that the "Data Algorithms" published on July 2015! This mean that a HARD COPY is available soon!

Instructions (as BASH scripts) are provided for building and running sample codes in Data Algorithms book.

Webcast

All-vs-All: Correlation Using Spark/Hadoop(Thursday, July 23, 2015)

Author Book Signing at Strata+Hadoop World San Jose

Author book signings for ("Data Algorithms") will be held in the O'Reilly booth on Thursday, March 31, 2016 at 10:30am PST. Complimentary copies of books will be provided for the first 25 attendees.

Introduction

All solutions (using Spark and Hadoop) in Data Algorithms book has been provided in Java. But I have a good news for Python programmers: Kashif Rasul has graciously volunteered to provide solutions in Python programming language.

Immediate Goal

My immediate goal is to provide at least two solutions for each chapter:

  • MapReduce/Hadoop Solution
  • Spark Solution (in Java, Scala, Python, ...)

Another goal is to provide compact Spark solutions in Java 8 (using Lambda Expressions).

Reorganization

My next immediate goal is to separate solutions for MapReduce/Hadoop and Spark. I will change package structure as (if you have suggestions/comments, please let me know):

org.dataalgorithms.<chapter-number>.spark
org.dataalgorithms.<chapter-number>.mapreduce

Posting New Content/Solution/Contribution

If you want to post a solution to any chapter, please let me know and full credit will be given to you.

Follow @mahmoudparsian

Thanks,
best regards,
Mahmoud Parsian
[email protected]