Skip to content
View Tinyiko-Mathebula's full-sized avatar

Block or report Tinyiko-Mathebula

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Tinyiko-Mathebula/README.md

Hi, I'm Tinyiko Patience Mathebula 👋

Data Analyst | Python · SQL · Power BI · Machine Learning | Johannesburg, Gauteng


🎯 What I Do

I build end-to-end data solutions — from relational database design and SQL analysis through to machine learning pipelines and business intelligence dashboards.

I bring something most analysts at my level don't: 8+ years of operational data experience managing POPIA-compliant systems, designing reporting workflows, and delivering insights to leadership — before formally transitioning into data analytics.


🚀 Featured Projects

Project What it proves Tools
Credit Risk Intelligence Platform Improved model AUC from 0.68 → 0.7822. DataQuest 2026 competition. Python, Scikit-Learn, Streamlit
Customer Churn Prediction 0.77 AUC · 3 models compared · catches 65% of churners Python, Scikit-Learn, pandas
Retail Sales & RFM Segmentation 25,000 transactions · top 20% of customers = 83.6% of revenue Python, MySQL, pandas
Enterprise Banking Database 3NF design · audit logging · fraud detection logic MySQL, SQL
Telecom Churn Dashboard Interactive KPI dashboard · DAX measures · churn driver analysis Power BI, DAX
Pension Data SQL Analytics Contribution auditing · data quality · regulatory reporting MySQL, SQL

🛠️ Technical Stack

Languages & Analysis Python SQL MySQL pandas NumPy scikit-learn matplotlib seaborn

Machine Learning Logistic Regression Random Forest Gradient Boosting Feature Engineering WoE/IV Analysis

Business Intelligence Microsoft Power BI DAX Tableau Streamlit Excel (Advanced)

Databases & Governance Relational Database Design 3NF Normalisation Oracle Database POPIA Compliance Data Quality Auditing

Web & Tools GitHub HTML CSS Google Workspace AI-assisted workflows (Claude, ChatGPT)


📚 Currently Building

  • 🎓 BCom Business Informatics — UNISA (Final year, completing Nov 2026)
  • 📊 GCI 2026 DataCamp Programme — active participant
  • 👩🏾‍💻 DataCamp | Women in Data 2026 — active participant
  • 🔬 DataCamp DataLab: International Debt Statistics · Netflix Movies · Students' Mental Health

📫 Connect

LinkedIn Email


Open to Junior to Mid Data Analyst roles in Johannesburg · On-site · Hybrid · Remote

Pinned Loading

  1. dataquest-2026-credit-risk dataquest-2026-credit-risk Public

    Credit Risk Intelligence Platform — DataQuest 2026. Improved baseline model AUC from 0.68 to 0.7822 using feature engineering and WoE encoding. Interactive Streamlit dashboard for regulator-aware r…

    Python

  2. python-churn-prediction python-churn-prediction Public

    End-to-end customer churn prediction pipeline — 0.77 AUC · 5,880 customers · 3 models compared (Logistic Regression, Random Forest, Gradient Boosting) · threshold tuning catches 65% of churners. Py…

    Jupyter Notebook

  3. retail-sales-analysis retail-sales-analysis Public

    Retail sales analysis & RFM customer segmentation · 25,000 transactions · 13 months · top 20% of customers generate 83.6% of revenue · 8 customer segments identified · 5 business recommendations. P…

    Python

  4. enterprise_banking_database_system enterprise_banking_database_system Public

    Enterprise-level relational banking database — 3NF architecture, transaction control, audit logging, fraud detection logic, and performance indexing. Built to satisfy tier-1 banking data requiremen…

  5. Telecom-Customer-Churn-PowerBI Telecom-Customer-Churn-PowerBI Public

    Interactive Power BI dashboard analysing telecom customer churn — KPI cards, DAX measures, and slicers identifying key churn drivers: contract type, service type, and payment method. Power BI · DAX…

  6. pension-data-sql-project pension-data-sql-project Public

    SQL pension administration database — contribution analysis, data quality auditing, missing payment detection, and aggregated regulatory reporting. Built to simulate real-world pension fund data ma…