Skip to content

PySpark code that calculates A x A^T x A distributed in a computing cluster.

License

Notifications You must be signed in to change notification settings

jaakko-paavola/massive-matrix-multiplication-with-spark

Repository files navigation

massive-matrix-multiplication-with-spark

PySpark code that calculates A x A^T x A. Tested with a 1,000,000 x 1000 matrix in a computing cluster. The included file data-small-sample.txt contains space-separated floating point values that make up a 1,000 x 200 matrix for testing.

About

PySpark code that calculates A x A^T x A distributed in a computing cluster.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published