Healthcare Data Analysis
Goal
My intention behind this project was to practice importing data in Jupyter Notebook with the Python Library, Pandas, and conduct an analysis based on the dataset from Kaggle using Python and the following libraries: Pandas, NumPy, and Matplotlib.
Process
I imported the dataset from Kaggle using Pandas, then I reviewed the data to better understand what the raw data meant. Then I cleaned the data, analyzed it, created a few visualizations, and wrote actionable insights.
Insights
I did find some actionable insights based on this data, such as:
- Penicillin is the most commonly prescribed medicine and would be critical that hospitals make sure they have plenty of this drug in supply.
- Females also have a higher percent chance of developing obesity and cancer according to this dataset. Having research studies and treatment plans for women separate from men would be beneficial.
These are just a few of the insights. I have more listed in the full report.
See full project on GitHub.
