SAOUD YAHYA

MLOps & DevOps Engineer • Cloud Architect • AI Research

Transforming AI breakthroughs into production-ready solutions through scalable cloud infrastructure

Specializing in bridging the gap between cutting-edge AI research and production-ready systems. Building scalable, intelligent infrastructures with cloud-native technologies and enterprise-grade microservices architectures.

🏆 Water4Future Hackathon Winner - 1st place among 20+ teams (UNESCO & Cadi Ayyad University)
saoud@cloud: kubectl get engineer --selector=expertise=mlops,devops |
NAME          READY   STATUS    CERTIFICATIONS
saoud-yahya   1/1     Running   KCNA,LFS250

Docker

Containerization

Kubernetes

Orchestration

AWS

Cloud Platform

Terraform

Infrastructure as Code

ArgoCD

GitOps

MLOps

ML Production

KCNA Certified

Kubernetes and Cloud Native Associate

Cloud Native Computing Foundation

LFS250 Certified

Kubernetes and Cloud Native Essentials

The Linux Foundation

Master's in AI

Artificial Intelligence (In Progress)

Faculté des Sciences Semlalia, Marrakech

Technical Stack

Infrastructure & DevOps

Docker Kubernetes AWS EKS Terraform ArgoCD Jenkins Helm Kustomize Prometheus Grafana ELK Stack

AI/ML & MLOps

TensorFlow PyTorch MLOps LangChain Computer Vision NLP YOLOv9 Transformers Agentic RAG Model Deployment

Development & Architecture

Python Java Spring Boot Go React Next.js FastAPI Flask Microservices Apache Kafka PostgreSQL MongoDB

Security & Quality

DevSecOps SonarQube Trivy OWASP Container Security Vulnerability Scanning GitOps Security

Education & Experience

Education

Oct 2024 - Oct 2026

Master's Degree, Artificial Intelligence

Faculté des Sciences Semlalia, Marrakech

Currently Pursuing
Oct 2023 - Jun 2024

Bachelor's Degree, Computer Science

École Supérieure de Technologie de Safi

Completed
2021 - 2023

Technology Studies

École Supérieure de Technologie de Safi

Foundation

Professional Experience

May 2024 - Jun 2024

DevOps Developer Intern

Country Technologie

Casablanca, Casablanca-Settat, Morocco

2-month internship focusing on DevOps practices and cloud infrastructure
Apr 2023 - May 2023

Software Developer Intern

2M TV

Casablanca-Settat, Morocco • On-site

2-month internship in software development and media technology

Featured Projects

NexusCommerce Platform

Complete Cloud-Native E-Commerce Ecosystem

May 2025 - Aug 2025
🎯 Project Impact

Built a resilient, scalable microservices ecosystem that revolutionizes e-commerce performance with enterprise-grade reliability.

8
Microservices
70%
API Reduction
<200ms
Response Time
One-Command
Deployment

A modern e-commerce platform where microservices work in harmony to deliver exceptional shopping experiences. Features event-driven architecture, service mesh, and comprehensive observability.

🚀 Key Achievements: Zero-downtime deployments • Enterprise-grade scalability • Production-ready
Java 17 Spring Boot 3.x Go Apache Kafka Next.js 15 React 18 TypeScript AWS EKS Terraform ArgoCD Helm Istio Redis
  • Built 8 microservices using Java Spring Boot and Go, deployed on AWS EKS
  • Complete GitOps automation with ArgoCD for zero-downtime deployments
  • Event-driven architecture with service mesh (Istio) and async Kafka communication
  • Seamless kind-to-EKS production migration with one-command deployment
  • Comprehensive observability: Prometheus, Grafana, ELK Stack
  • Multi-environment CI/CD with customer frontend (Next.js) and admin dashboard

Kubernetes-Based Machine Learning Platform

Scalable ML Workflows on Kubernetes

Developed a comprehensive Kubernetes project with three Flask applications: LSTM for fake news detection, an ML pipeline for S&P 500 prediction, and GPT-2 for text generation. Deployed and managed using kubectl with comprehensive monitoring through Prometheus and Grafana for real-time insights into scalable and reliable ML workflows.

🤖 ML Pipeline: LSTM + S&P 500 + GPT-2 • Prometheus monitoring • Kubernetes orchestration
Kubernetes Flask LSTM GPT-2 Python Prometheus Grafana kubectl Docker TensorFlow MLOps
  • Three specialized Flask applications: fake news detection, S&P 500 prediction, text generation
  • LSTM neural networks for sequential data processing and prediction tasks
  • GPT-2 integration for advanced natural language generation capabilities
  • Kubernetes deployment with kubectl for container orchestration and scaling
  • Real-time monitoring and metrics collection using Prometheus and Grafana
  • Scalable ML workflows with automatic resource management and fault tolerance

Bio Data Extraction - Animal NER

Self-Supervised Named Entity Recognition System

June 2025

Developed an innovative Named Entity Recognition (NER) system for extracting animal entities from scientific PDF documents using self-supervised learning techniques. This project addresses the critical challenge of traditional NER systems that require massive manually annotated datasets, which typically cost $50,000-$100,000 per domain and take 6-12 months for expert annotation.

💡 Innovation: Eliminated $50K-$100K annotation costs • Superior BERT baseline performance
Deep Learning NLP Self-Supervised Learning Contrastive Learning PDF Processing PyTorch Python Neural Networks
  • Self-Supervised Entity Relationship Learning eliminating expensive manual annotation
  • Ultra-Strict Filtering mechanisms ensuring high-quality entity extraction
  • Contrastive Learning methods improving recognition accuracy and relationships
  • Cost-effective alternative to domain-specific datasets with cross-discipline generalization
  • Scalable solution for biodiversity research and scientific literature analysis
  • Overcame traditional NER limitations and computational expense of standard transformers

Scaling Laws & Mechanistic Interpretability

Transformer Architecture Research

May 2025 - Jun 2025

Conducted comprehensive research investigating scaling behaviors of standard Transformers vs Switch Transformers, examining how model parameters affect performance and internal representations. This work addresses fundamental questions about optimal model design in large-scale neural networks.

🔬 Research Impact: Switch-Large 120.91 PPL • Enhanced SAE interpretability • Open-source framework
Transformer Architecture Scaling Laws Sparse Activation Switch Transformers Mechanistic Interpretability Sparse Autoencoders PyTorch Research
  • Architecture-Dependent Scaling Discovery with opposite coefficients (-0.0381 vs +0.0173)
  • Enhanced Mechanistic Interpretability Framework using SAE-enhanced superposition analysis
  • Performance Optimization with Switch-Large achieving best performance (120.91 PPL)
  • Analyzed 16 model configurations ranging from 13.1M to 597.3M parameters
  • Developed superposition scoring methodology with consistent sparsity patterns
  • Created practical guidelines for architecture selection in resource-constrained environments

SEE Smart Glasses

AI-Powered Accessibility Solution

Intelligent accessibility solution combining computer vision and NLP to assist visually impaired users with real-time environmental awareness and voice interaction. Winner of accessibility innovation awards.

YOLOv9 Computer Vision NLP React Native FastAPI Docker ViT-GPT2 Whisper ASR
  • Real-time object detection and scene description using YOLOv9
  • Voice interaction with AI-powered natural language processing
  • Hardware-software integration with mobile companion app
  • Containerized deployment for scalability and maintainability

Agentic RAG & LangGraph System

Advanced AI Content Generation

Sophisticated LLM-based content generation system with retrieval-augmented generation capabilities using LangChain and multi-agent orchestration for intelligent decision-making.

LangChain Agentic RAG LangGraph Vector DB Python Docker
  • Multi-agent RAG systems with intelligent orchestration
  • Real-time generation with vector database integration
  • Horizontal scaling with containerized deployment
  • Advanced prompt engineering and context management

Enterprise Microservices Platform

Production-Ready Microservices Architecture

Scalable microservices ecosystem built with Spring Boot, Spring Cloud, and Docker, featuring service discovery, distributed tracing, and comprehensive CI/CD pipelines.

Spring Boot Spring Cloud Docker RabbitMQ Jenkins Zipkin Eureka
  • Microservices with service discovery and load balancing
  • Asynchronous messaging with RabbitMQ integration
  • Distributed tracing with Zipkin for monitoring
  • End-to-end CI/CD pipelines with Jenkins automation
  • Multi-database support (PostgreSQL & MongoDB)

Get In Touch

Email

saoudyahya123@gmail.com

Location

Casablanca, Morocco

Ready to Build Something Great?

MLOps Excellence • Cloud-Native Solutions • AI Innovation • Enterprise Architecture

# "Bridging AI Research with Production Reality"