Ghouti Belhadj

Data • AI Engineer

🎓
ESME — Master's-level Engineering Degree (Sep 2022 - Jun 2027)
🌍
HUST — Wuhan, China (Fall 2025) • Top 0.1% in AI
Ghouti Belhadj

Experience

Acadomia — Scientific Tutoring

Mathematics and Physics Tutor
October 2024 - Present • Paris, France
Teaching advanced scientific concepts to 8+ high school and university students with personalised sessions. Improved grades by up to 9 points.

DevExplore — ESME's student developer community

President
November 2025 - Present • Paris, France
Agile management of a 16-person team. Organising tech projects and hackathons for a 600+ student community.

Projects

AI Classifier For Aircraft Incident Reports

Automated classification of aviation incident reports, reducing manual triage time by ~90% (from ~30 min to <5 sec per report)

Python LangChain Mistral FastAPI ChromaDB Aviation Safety Data
▼ Click to expand

End-to-end LLM architecture for BEA accident report analysis: classification, HFACS extraction, semantic RAG and weak signal detection (all-in-one).

Highlights:
  • ▪ F1 macro 0.81 (from 0.31, +160% via data-centric iteration)
  • ▪ 120+ reports processed, 2,500+ chunks semantically indexed
  • ▪ ~90% reduction in manual triage time (30 min → <5 sec per report)

Intelligent ChatPDF Assistant

End-to-end RAG pipeline for intelligent data extraction and hallucination-free LLM querying

Python RAG FAISS LLM
▼ Click to expand

Architected a complete system including PDF chunking, high-dimensional embedding generation, FAISS vector storage, and LLM querying with contextual constraints.

Highlights:
  • ▪ Optimised PDF chunking
  • ▪ Embedding generation
  • ▪ Hallucination-free querying

Financial Chatbot - BCG X

Automated KPI extraction from financial data, reducing manual research time by 30%

Python Pandas Financial Data
▼ Click to expand

Financial chatbot automating the extraction of critical KPIs. Pandas pipeline structuring 3 years of financial data into interactive metrics.

Highlights:
  • ▪ 30% reduction in manual research
  • ▪ 3 years of structured data
  • ▪ 20% faster insights

Glaucoma Detection with ML

Classification model for glaucoma detection with 92% F1-Score and expert validation

Machine Learning TensorFlow Medical Imaging
▼ Click to expand

5 feature extraction approaches and 4 classification models for automated glaucoma detection. Validated by an expert ophthalmologist.

Highlights:
  • ▪ 5 feature extraction approaches
  • ▪ 4 models evaluated
  • ▪ F1-Score 92%

Deep Learning Image Classifier

Deep neural network for image classification with an optimised data pipeline

TensorFlow Deep Learning CNN
▼ Click to expand

Complete data pipeline with optimised image preprocessing. Design and training of a deep neural network for classification.

Highlights:
  • ▪ Complete data pipeline
  • ▪ Optimised preprocessing
  • ▪ Production deployment

Linear Regression from Scratch

Vectorised implementation without predictive ML libraries, built from scratch using OOP

Python NumPy Pandas
▼ Click to expand

OOP learning model without external libraries. Vectorised Gradient Descent implementation via NumPy. Training and testing on real-world data.

Highlights:
  • ▪ Custom OOP model
  • ▪ Vectorised Gradient Descent
  • ▪ Real-world data validation

Skills

Languages & Frameworks
Python C++ SQL Bash HTML/CSS/JS Pandas PyTorch TensorFlow Scikit-learn LangChain HuggingFace
Data & AI
ETL Pipelines LLM Machine Learning RAG Deep Learning ChromaDB Embeddings PowerBI Tableau
Cloud & Tools
Google Cloud Platform NoSQL Docker Git/GitHub API REST FastAPI
Certifications
Google Cloud Certified Cloud Digital Leader Google Data Analytics Google Data Cleaning
Languages

French

Native

English

C1

Chinese

B1

Arabic

B2

Get in touch

I am open to internship and work-study opportunities and interesting projects in AI and Data Science.