Projects

Classification of Loan Data using Machine Learning

Skills
Python Apache Spark Machine Learning

Implemented Naïve Bayes, kNN and Logistic Regression algorithm from scratch without using any existing libraries to classify/predict whether a particular loan would default or not. Used the Banking data from Kaggle which is about 1GB in size with > 1M records.