Skip to content

Apache Spark Architecture was used to process and solve custom problems on a huge dataset.

Notifications You must be signed in to change notification settings

ArhantJain/Cluster-Computing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

Cluster-Computing

  1. Write​ a program to​ get​​ the​​ number​​ of​​ Airports​​ by​ ​ Country.

Commmand To Run : python pyspark_1.py [No. of CPUs] [OUTPUT DIRECTORY NAME]

On Running this command, .csv file will be created inside the OUTPUT DIRECTORY which gives the​​ number​​ of​​ Airports​​ by​ ​each Country.

  1. Write a program to find the Country having​ the​ highest​ number​ of​ ​ airports.

Commmand To Run : python pyspark_2.py [No. of CPUs] [OUTPUT FILE NAME]

On Running this command, answer will be stored inside the OUTPUT FILE and also gets Printed on the console i.e the Country having​ the​ highest​ number​ of​ ​ airports.

  1. Write a program to find airports whose latitude is between [10, 90] and longitude is between [-10, -90]. ([a,b] a,b both are included)

Commmand To Run : python pyspark_3.py [No. of CPUs] [OUTPUT DIRECTORY NAME]

On Running this command, answer will be stored inside the OUTPUT DIRECTORY which gives the​​ airports whose latitude is between [10, 90] and longitude is between [-10, -90]

About

Apache Spark Architecture was used to process and solve custom problems on a huge dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages