Skip to content

This project analyzes life expectancy data from the World Health Organization (WHO), sourced from Kaggle. The dataset spans 183 countries across 6 regions, covering metrics such as life expectancy, mortality rates, healthcare access, and socioeconomic factors.

Notifications You must be signed in to change notification settings

ralstonraphael/Water_Access_ML_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🌍 Life Expectancy Analysis with WHO Data

πŸ“„ Project Overview

This project explores and analyzes life expectancy data provided by the World Health Organization (sourced from Kaggle). Using Python's powerful data science ecosystem, we clean, transform, and visualize insights about global health trends. We then apply machine learning techniques to predict life expectancy based on socioeconomic and health factors.


πŸ§ͺ Tech Stack

  • Python
  • Pandas, Numpy
  • Matplotlib, Seaborn
  • Scikit-learn
  • TensorFlow/Keras

🧠 Key Learnings

  • Mastery in data preprocessing: handling missing values, normalization, and encoding.
  • Feature engineering to enhance model inputs.
  • Exploratory Data Analysis (EDA) with Matplotlib and Seaborn.
  • Training and evaluating machine learning models using TensorFlow/Keras.
  • Interpretation of results to derive health policy insights.

πŸ” Key Insights

  • GDP and schooling are strongly positively correlated with higher life expectancy.
  • HIV/AIDS prevalence shows a strong negative correlation.
  • Countries with better healthcare expenditure generally enjoy higher life expectancy.
  • Model performance indicates socioeconomic indicators can be good predictors of life expectancy, though regional anomalies exist.

🧾 How to Run

# Install required libraries
pip install pandas numpy matplotlib seaborn scikit-learn tensorflow

About

This project analyzes life expectancy data from the World Health Organization (WHO), sourced from Kaggle. The dataset spans 183 countries across 6 regions, covering metrics such as life expectancy, mortality rates, healthcare access, and socioeconomic factors.

Topics

Resources

Stars

Watchers

Forks