Selected projects in data science, machine learning and NLP
Data Science | Data Analysis
View code on Github | Read Blog
I performed a thorough data analysis on the Stack Overflow Annual Developer Survey Data to uncover insights about data scientists. I addressed specific research questions through my analysis and developed a machine learning model to predict the salaries of data scientists.
NLP | Text Classification | ETL Pipeline
I have created an ETL pipeline, ML pipeline and Web application based on Flask to categorise real-time messages during disaster events. I have applied NLP techniques to process the text data. The dataset is provided by Figure Eight in collaboration with Udacity.
Machine Learning
View code on Github | Read Paper
In this project, I have developed three different novel online Machine Learning approaches for real-time software defect prediction. These models demonstrated enhanced predictive performance, with G-Mean improvement reaching up to 48.16% during concept drift periods.
Machine Learning
I have proposed and implemented a novel online hyper-tuning algorithm capable of tuning machine learning model hyper-parameters in real-time. This novel method can regularly identify optimal hyper-parameter combinations, minimizing declines in the machine learning model's predictive performance.