MSc Data Science & Analytics with Distinction — University of Hertfordshire. I build end-to-end ML pipelines, deploy to Azure, and translate complex data into clear business decisions.
Full MLOps pipeline on the Kaggle Telco Churn dataset. Trained and benchmarked Logistic Regression, Random Forest, and XGBoost — selecting Logistic Regression for deployment based on best accuracy and F1. Tracked all experiments with MLflow and deployed a RESTful Flask API to Azure App Service (Linux, Python 3.10) using Gunicorn. Resolved Kudu environment dependency issues via a custom startup command.
View on GitHubXGBoost achieved R² 95.51% and RMSE 0.0106 after hyperparameter tuning — outperforming LightGBM (R² 94.98%) and Random Forest (R² 78.51%). Feature importance analysis identified top environmental predictors, delivering actionable insights for disaster management stakeholders.
Interactive Power BI dashboard on the Kaggle Superstore Sales dataset. Built DAX measures, drill-through filters, and dynamic slicers for self-service exploration by region, product category, and time period — replicating a production BI reporting environment.
View on GitHubInvestigated the statistical association between EKG results and heart disease prevalence using chi-square proportion hypothesis testing in RStudio on a clinical dataset (n=303). Delivered clear statistical interpretation for both technical and non-technical audiences.
Preprocessed an 800-instance sentiment dataset using TF-IDF and Lovins stemming (1,199 features). Applied Random Projection to 900 features and evaluated the accuracy/efficiency trade-off across J48 and LibSVM classifiers — showing dimensionality reduction impact is algorithm-dependent.
I'm a data professional based in Hatfield, UK, holding an MSc in Data Science & Analytics earned with Distinction from the University of Hertfordshire. My BTech background in mechanical engineering gives me a structured, mathematical mindset I now apply to building ML systems that solve real problems.
I work across the full data lifecycle — from data cleaning and statistical analysis through to model deployment on Azure. I'm equally comfortable building predictive models in Python and delivering Power BI dashboards that non-technical stakeholders can act on immediately.
Actively seeking roles as a Data Scientist, Data Analyst, or BI Analyst. Right to work in the UK. Open to hybrid and on-site roles.
I'm actively looking for Data Scientist, Data Analyst, and BI Analyst roles across the UK. Right to work — no sponsorship required. Reach out any time.