This project delves into the comprehensive analysis of sugarcane production across various countries. The dataset provides insights into production quantity, production per person, acreage, and yield.
This analysis has been done to answer arising questions when we think about sugarcane production across various countries. Some example questions include:
- Which continent has the highest number countries producing sugarcane?
- Which country has the highest production of sugarcane?
- Which country has the most land & whether most amount of land translates to highest production of sugarcane or not?
- Do number of countries in a Continent effects production of sugarcane? And so on...
-- Loading the Dataset
-- Data Cleaning
-- Univariate Analysis
-- Checking for Outliers
-- Bivariate Analysis
-- Correlation
-- Analysis (Continent based)
- Country: Country name
- Continent: Continent of the country
- Production(Tons): Total sugarcane production in tons
- Production_per_Person(Kg): Sugarcane production per person in kilograms
- Acreage(Hectare): Total acreage dedicated to sugarcane cultivation in hectares
- Yield(Kg/Hectare): Yield of sugarcane in kilograms per hectare
The dataset underwent a cleaning process to ensure accuracy and consistency: -> Removed dots and replaced commas for better numerical representation
-> Converted data types to appropriate formats
-> Handled missing values by dropping relevant entries
Explored individual columns to understand their distribution and characteristics.
Investigated relationships between different variables, including production, acreage, and yield.
Explored the correlation matrix to identify relationships between different features.
Analyzed sugarcane production based on continents, examining factors such as the number of countries, production distribution, and correlation.
Concluded with key findings and insights derived from the analysis, emphasizing factors influencing sugarcane production.