Machine Learning Projects

(Note: Click on the images to view the project file)

Cleveland Heart Disease Dataset

To determine whether a patient is at risk of heart disease or not, based on the given features. This is a modified dataset that has 14 features of the original cleveland dataset


Buldozer Bluebook Dataset

To accurately estimate the price of a buldozer, given its various features. This is a pretty old dataset from Kaggle which helped me learn more about regression scoring parameters and models.


Uber Rides Dataset

To accurately estimate the fare amount of an Uber ride, given historic data. This is a classic regression dataset obtained from Kaggle which takes a little more cleaning and visualizing to infer insights.


Data Engineering Projects

(Note: Click on the images to view the project file)

SQL Query Exercise

Performing simple to complex queries on a sample database to satisfy the requirements of the exercise.