-
Notifications
You must be signed in to change notification settings - Fork 664
Home of Data Algorithms Book
to the Home of Data Algorithms book wiki!
I have started working on the 2nd-Edition of "Data Algorithms" book. Some of the topics and chapters, which I will include are:
- PageRank Algorithm
- Rank Product Algorithm
- Feature-ization of K-Means
- Graph Algorithms: motif finding
- Social Recommendations using motif finding
- FP-Growth Algorithms
- ...
I am so excited to report that the "Data Algorithms" published on July 2015! This mean that a HARD COPY is available soon!
Instructions (as BASH scripts) are provided for building and running sample codes in Data Algorithms book.
All-vs-All: Correlation Using Spark/Hadoop(Thursday, July 23, 2015)
Author book signings for ("Data Algorithms") will be held in the O'Reilly booth on Thursday, March 31, 2016 at 10:30am PST. Complimentary copies of books will be provided for the first 25 attendees.
All solutions (using Spark and Hadoop) in Data Algorithms book has been provided in Java. But I have a good news for Python programmers: Kashif Rasul has graciously volunteered to provide solutions in Python programming language.
My immediate goal is to provide at least two solutions for each chapter:
- MapReduce/Hadoop Solution
- Spark Solution (in Java, Scala, Python, ...)
Another goal is to provide compact Spark solutions in Java 8 (using Lambda Expressions).
My next immediate goal is to separate solutions for MapReduce/Hadoop and Spark. I will change package structure as (if you have suggestions/comments, please let me know):
org.dataalgorithms.<chapter-number>.spark
org.dataalgorithms.<chapter-number>.mapreduce
If you want to post a solution to any chapter, please let me know and full credit will be given to you.
Thanks,
best regards,
Mahmoud Parsian
[email protected]