Data Analyst

Hamza Zaman

MSc Data Science graduate turning complex data into clear business decisions. I build predictive models and dashboards that help teams act faster and smarter.

Quick Overview

Location London, UK
Education MSc Data Science
Work Status UK Eligible
Availability Open to Roles

Featured Projects

Real problems, measurable results

ESG 10-K Extraction from SEC EDGAR

3 Companies Analyzed
PythonSEC APINLPBeautifulSoup

Automated extraction of ESG disclosures from Apple, Alphabet, and Tesla 10-K filings. Built NLP pipeline to identify Environmental, Social, and Governance content from SEC EDGAR.

Car Insurance Claim Prediction

Improved Accuracy
PythonXGBoostFeature Engineering

Built a gradient boosting model to estimate claim risk for auto policies. Engineered features from policyholder data that improved premium pricing accuracy and reduced loss ratios.

London Fire Brigade Analytics

False Alarm Detection
Pythonscikit-learnClustering

Developed classification and clustering models to identify false alarms in emergency calls. Delivered actionable insights for deployment planning and resource optimization.

Osteoporosis Fracture-Risk

88% Accuracy
Pythonscikit-learnKNN

Benchmarked KNN, Random Forest, Logistic Regression, and SVM models. Identified BMD and Age as top predictive factors using feature importance analysis.

NLP Lip-Reading — Master's Thesis

0.92 BLEU Score
TensorFlowSeq2SeqGRU-Attention

Trained sequence-to-sequence model on 45K LRS2 sentences using TPU. Implemented phoneme-viseme features achieving ~3% WER, outperforming baseline by 15+ percentage points.

ESG Repo Evaluation (SEC EDGAR)

5 Repos Benchmarked
PythonNLPSEC EDGARBERT

Evaluated 5 open-source ESG extraction tools against 62 real 10-K filings using a 55-keyword dictionary. Scored on setup, code quality, ESG signal, and SEC compliance.

edgartools Deep Code Evaluation

9.2/10 Score
PythonSEC EDGARXBRL140K LOC

Deep source analysis of edgartools v5.17.1: 332 files, 657 classes, 21 dependencies mapped. Evaluated XBRL parsing, HTTP layer, AI integration, and ESG extraction capabilities.

Technical Skills

Tools I use to deliver value

📊 Data Analytics

  • SQL (T-SQL, CTEs, Window Functions)
  • Power BI (DAX, Power Query)
  • Excel (Pivot Tables, VBA)
  • Python (pandas, NumPy)

🤖 Machine Learning

  • scikit-learn
  • Feature Engineering & CV
  • Time-Series Forecasting
  • Transformers & Seq2Seq

💼 Professional

  • Stakeholder Communication
  • KPI Design & Measurement
  • Data Storytelling
  • GDPR & Data Governance
Blog
LinkedIn
GitHub