Skip to content
View MahdiNavaei's full-sized avatar

Block or report MahdiNavaei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MahdiNavaei/README.md

Mahdi Navaei

AI Engineer • Agentic AI Architect • Data Scientist

Profile Views LinkedIn Email Kaggle

7+ years building production-grade AI systems | Tehran, Iran

Specializing in Agentic AI, LLMs, RAG Systems, and Enterprise ML Pipelines


🎯 What I Do

I architect and deploy production-grade intelligent systems that drive real business impact. My focus areas:

Domain Expertise
Agentic AI Autonomous agents, multi-step reasoning, task orchestration, tool integration
LLMs & NLP Fine-tuning, RAG architectures, NL2SQL, conversational AI, prompt engineering
Enterprise RAG Hybrid retrieval, re-ranking pipelines, knowledge management systems
Production ML End-to-end pipelines, real-time inference, MLOps, scalable deployments

🚀 Flagship Project 1

🤖 Production-Grade Agentic AI Framework

Vision + LLM + Event Sourcing • Local LLMs • LangGraph • HITL Safety • Autonomous Task Execution

Tests Python LangGraph License Docker

ARIA is not a prompt-chain demo or a single-purpose script — it's a full agentic AI system built for real-world automation: observe UIs with vision, plan with LLMs, act safely with human oversight, and learn from outcomes. Designed to run on local LLMs and consumer GPUs (8GB VRAM), with native English & Persian support for privacy-sensitive and resource-constrained environments.

Cognitive architecture — perception, reasoning, execution, and memory are separated and observable:

┌─────────────────────────────────────────────────────────────────────────┐
│                         ARIA — Cognitive Core                            │
├─────────────────────────────────────────────────────────────────────────┤
│  👁️ Eye (VLM/OCR)   →  Observe real interfaces • Screenshot • UIRef    │
│  🧠 Brain (LLM)      →  Plan, execute, observe • LangGraph • HITL gates  │
│  ✋ Hand (Actions)   →  Browser • Desktop • Playwright • PyAutoGUI       │
│  💾 Memory           →  Working + Episodic + Semantic (Redis • Qdrant)   │
│  📡 Event Bus        →  Kafka/Redpanda • Full audit trail & replay       │
│  📚 Learning         →  Extract skills & policies from successful runs   │
└─────────────────────────────────────────────────────────────────────────┘

Why ARIA stands out:

Pillar What it means for you
Vision-First VLM-powered UI understanding with multi-locator fallback — no brittle selectors only
Event-Sourced Every step persisted; full audit trail and replay for debugging and compliance
Human-in-the-Loop Safety gates for sensitive actions (login, CAPTCHA, payment) — production-safe by design
Local & Bilingual Run entirely on your hardware; native Farsi STT (Whisper) and embeddings
Production-Ready FastAPI + WebSocket API, Streamlit dashboard, Docker Compose, 81 tests

Tech stack: LangGraph • Ollama / OpenAI • Qwen-VL • Playwright • Redpanda (Kafka) • Redis • Qdrant • Mem0

The Job Apply automation (LinkedIn, Indeed) is the first production plugin — the platform is built for more.

🔗 Explore ARIA → • 📖 Docs, ADRs, and MODELS.md inside the repo


🚀 Flagship Project 2

📄 Governance-Safe Financial Document AI

Bilingual (EN/FA) • Quality Gates • Human-in-the-Loop Review • Replayable Lifecycle • Audit Endpoints

Release Python Next.js License FastAPI

InvoiceMind — Evidence-first invoice processing

InvoiceMind is not an OCR benchmark or a generic prompt demo — it's a production-oriented platform for invoice extraction, human review, and governance-safe automation. Built for teams where traceability and control matter more than blind automation. Most invoice AI fails in production because decisions are hard to trust, explain, and control; InvoiceMind tackles that gap head-on.

End-to-end flow — from ingestion to final export, with explicit gates and audit at every step:

┌─────────────────────────────────────────────────────────────────────────┐
│                    InvoiceMind — Pipeline & Lifecycle                    │
├─────────────────────────────────────────────────────────────────────────┤
│  📥 Ingestion → Validation → OCR/Layout → LLM Extraction → Postprocess  │
│  📊 Routing (quality gates) → Review / Quarantine → Export + Audit        │
├─────────────────────────────────────────────────────────────────────────┤
│  Run lifecycle: RECEIVED → VALIDATED → EXTRACTED → GATED →               │
│                 AUTO_APPROVED | NEEDS_REVIEW → FINALIZED                 │
│  Control: cancel • replay • quarantine (reason-coded) • audit/verify      │
└─────────────────────────────────────────────────────────────────────────┘

Why InvoiceMind stands out:

Pillar What it means for you
Evidence-first Policy and gate-based routing instead of confidence-only automation
Decision traceability Every auto-approve or escalate tied to gate results and reason codes
Replayable & auditable Full run lifecycle, cancel/replay, and audit endpoints for compliance and post-incident analysis
Local-first Privacy-first inference; versioned config bundles and model registry (models.yaml)
Safe defaults Quarantine and human review over aggressive auto-posting; NIST AI RMF & OWASP LLM–aligned

Tech stack: Python 3.11+ • FastAPI • Next.js 16 • React 19 • TypeScript • SQLAlchemy • Alembic • SQLite • AGPL-3.0

ADR-001 (local-first), ADR-002 (evidence-first), ADR-003 (policy-driven gates) — design documented in the repo.

🔗 Explore InvoiceMind → • 📖 Docs, run.bat one-click startup, API surface in README


💼 Featured Projects

🛡️ DriveShield — Real-Time Collision Risk Intelligence

End-to-end collision prediction platform using Nexar's BADAS-Open model.

  • State-of-the-Art Prediction: Real-time risk analysis with vision models
  • 100% Offline: Runs locally without external API calls
  • Production-Ready: FastAPI backend + React TypeScript frontend

Tech: Python • FastAPI • React • TypeScript • PyTorch • Computer Vision

🔗 View Repository →

🔄 Hybrid Retail Recommender System

Production-ready hybrid recommender combining collaborative filtering & content-based ML.

  • Results: 140% precision improvement, 175% recall improvement
  • Scale: Tested on 38K+ user dataset
  • Bilingual: English/Persian UI with RTL support

Tech: Python • FastAPI • React • TypeScript • scikit-learn • Docker

🔗 View Repository →

🌊 FlowCast — Surge Pricing & ETA Optimization Engine

Enterprise-grade intelligent pricing and ETA prediction for ride-hailing platforms.

  • ETA Accuracy: +20% improvement over baseline
  • Revenue: +10-25% efficiency per trip
  • Price Stability: 30-40% volatility reduction

Tech: Python • FastAPI • React • GeoPandas • Time-Series Forecasting

🔗 View Repository →

💊 Pharmaceutical Supply Chain Agentic AI

Four-agent system for supply chain optimization using LangGraph orchestration.

  • Logistics Costs: 40% reduction
  • Stockouts: 67% reduction
  • Forecast Accuracy: 95%+ (MAPE < 5%)

Tech: Python • FastAPI • LangGraph • Next.js • MongoDB • GPT-4o-mini

🔗 View Repository →

📚 More Projects
Project Description Tech
Blood Cell Cancer Detection CNN-based classifier with 99%+ accuracy TensorFlow • Keras • Medical Imaging
Books Recommendation System Production recommender, 8% sales increase Collaborative Filtering • scikit-learn
Stock Price Collection Automated data pipeline for finance ML Web Scraping • Database Design
CIFAR-10 Classification CNN image classifier, 90%+ accuracy TensorFlow • Keras • CNN

🛠️ Tech Stack

AI & LLM

OpenAI LangChain LangGraph Ollama RAG Transformers

ML & Data Science

Python PyTorch TensorFlow scikit-learn Pandas

Production & DevOps

FastAPI Docker Kubernetes PostgreSQL Redis

Frontend

React TypeScript Next.js


🏆 Achievements

Achievement Description
🥈 2nd Place — Tehran Provincial AI Competition (2022)
🎓 Member — Iran's National Elites Foundation
📜 Kaggle Notebooks Master
📄 Published Researcher — Health Science Reports (Wiley), ICVPR, AMLAI

📊 GitHub Stats

GitHub Stats GitHub Streak

📚 Publications


💼 Experience

Role Company Period
Senior AI/ML Engineer Daria Hamrah Paytakht Jul 2024 – Present
Senior AI/ML Engineer Educational Industries Research & Innovation Co Nov 2023 – Jul 2024
Data Science Team Lead Diar-e Kohan CO. Sep 2020 – May 2022
Data Scientist Diar-e Kohan CO. Sep 2018 – Sep 2020

🎯 Open To

  • 🚀 Building Agentic AI systems and LLM applications at innovative companies
  • 💼 Production-grade AI systems that solve real business problems
  • 🌍 Collaborating with international teams on cutting-edge AI/ML projects
  • 🤝 Remote positions, contract work, or full-time opportunities worldwide

Let's Connect

LinkedIn GitHub Kaggle Email


⭐ If you find my work interesting, please consider giving my repositories a star!

Building the future of AI, one system at a time.

Pinned Loading

  1. DriveShield DriveShield Public

    Real-time dashcam collision risk prediction with BADAS-Open, FastAPI backend, and a bilingual React dashboard.

    Python

  2. ParaBoostForest-Hybrid-Parallel-Boosting-for-Imbalanced-Learning ParaBoostForest-Hybrid-Parallel-Boosting-for-Imbalanced-Learning Public

    Reproducible credit-card fraud benchmark on Kaggle with Optuna tuning, PR-based thresholding, and publication-ready plots; compares ParaBoostForest, RF, and XGBoost.

    Python