Shitanshu Pandey
Data & ML engineering: streaming, lakehouse, and end-to-end ML
Open to roles - Delhi NCR and remote
I build reliable data systems and ship ML with confidence: Kafka plus Spark for real-time data, durable storage layers, Airflow for orchestration, and containerized deployment with complete monitoring. I prefer clear documentation, measurable outcomes, and steady iteration.
Background, professional certifications, and recent activity
Medium
Articles →Detailed, source-backed write-ups on data, ML, SWE, and more
Kaggle
Notebooks →Systematic notebooks with reproducible runs and clear metrics
GitHub
View repos →Active projects - SWE builds, data pipelines, experiments, and infra as code