Satish Fulwani — Staff Data Engineer

About Me

Experienced and versatile Staff Data Engineer with 6+ years of delivering scalable, cloud-native data solutions across diverse industries, including Media & Entertainment, Insurance, and Nutrition. Proven expertise in architecting end-to-end data pipelines using Azure Data Factory, Databricks, PySpark, and AWS services such as Glue, Lambda, S3, and DynamoDB. Strong background in building data platforms that enable personalized content delivery, actuarial analytics, and healthfocused insights through efficient, secure, and real-time data processing. In my most recent role, I led a team of 3 data engineers to successfully modernize a legacy data platform by migrating it to Azure, improving pipeline efficiency, and reducing processing time by over 40%. I bring a strong understanding of data modelling, ETL/ELT workflows, and automation strategies that enable organizations to make data-driven decisions faster and with greater accuracy. I’m passionate about driving engineering excellence, mentoring junior team members, and collaborating with cross-functional stakeholders to align data strategy with business goals.

Tech Stack

Python Hadoop Big Data Spark SQL AWS Azure Pandas Databricks Jenkins Keras SciPy Data Science

Experience

Staff Data Engineer — NAGARRO INDIA PVT LTD

Feb 2023 – Present

Led a key project to modernize and migrate legacy data pipelines to Azure Cloud.
This project aimed to improve processing efficiency and scalability by designing new ETL/ELT workflows using Azure Data Factory for orchestration and Databricks for big data processing
The migration enabled real-time data analytics and better decision-making for the business.
Migrated over 1TB of data to Azure SQL DB with zero downtime
Developed real-time data pipelines to support business intelligence.

Data Engineer Consultant — DELOITTE ILLP

Aug 2021 – Feb 2023

As a Data Engineer, I resolved complex big data challenges by aligning data engineering strategies with dynamic business needs
Worked across departments to identify data quality issues, processing bottlenecks, and scalability gaps in high-volume data systems. Designed and implemented solutions using AWS Glue, AWS EMR,AWS S3, AWS Sagemaker, focusing on performance, reliability, and cost-efficiency
Implemented CI/CD pipelines for data workflows.
Reduced data pipeline latency by 40% through optimized transformation logic.
Improved data quality by 55% via automated validation and cleansing
Cut operational costs by 20% through resource-efficient job orchestration.
Enabled near real-time reporting, enhancing executive decision-making.

Big Data Engineer — ACCENTURE PVT LTD

Aug 2021 – Feb 2023

Designed, engineered, and orchestrated highly efficient and cost-effective step functions on AWS to ingest and process data from multiple heterogeneous sources.
Focused on building scalable, fault-tolerant, and performance-optimized workflows using serverless technologies, ensuring seamless data integration and realtime availability for downstream analytics.
Developed serverless architecture using AWS Lambda, Queues and other AWS services.
Framework helped migrated 1.5 million assets from an on-premises system to the cloud.
Developed an automated tool to generate zip files up to 60GB in size containing on-premises data and copy that data to Digital Asset Management (DAM) tool.
The tool's architecture utilized the least number of dedicated AWS EC2 instances and the most amount of all other AWS Serverless Solutions, reducing the migration expense to $100.

Projects

Football Match Data Scraper and ETL Pipeline

This project is a simple yet effective data pipeline that scrapes football match statistics, processes the data, and stores it in a structured CSV format. It serves as a foundational data engineering portfolio project, showcasing skills in web scraping, data cleaning, and structured data storage.

Tech stack: Python Beautiful Soup, Pandas

View Project

COVID19 India Data Analysis

This project demonstrates a data engineering and visualization pipeline that scrapes raw COVID-19 data from an API, processes it, and generates a dynamic "racing" bar chart visualization. This project showcases skills in data extraction, cleaning, and creating animated data visualizations.

Tech stack: Python, Pandas

View Project

Databricks Spark ETL Pipeline

This project demonstrates an end-to-end Extract, Transform, Load (ETL) pipeline using Apache Spark on the Databricks platform. It reads raw data from a CSV file, processes it, and loads the refined data into a SQL database. This project is a solid example of cloud-based data engineering and showcases proficiency with distributed data processing.

Tech stack: Python, Spark, SQL

View Project

FRUIT CLASSIFIER | DEEP LEARNING | APR’23

Conducted an in-depth research project as part of a master’s program, focusing on solving a real-world problem using data-driven techniques. Designed and implemented data pipelines, performed advanced data analysis, and built models aligned with academic and industry standards.

Tech stack: Python, Spark, AWS

View Project

Certifications & Awards

🎓 GCP Certified Cloud Digital Leader
🎓 AWS Certified Machine Learning- Speciality
🎓 AWS Certified Cloud Practitioner
🎓 AWS Certified Developer- Associate
🎓 Machine Learning Advanced Nanodegree
🎓 Machine Learning Foundation Nanodegree
🏆 BRIGHTEST MIND — Awarded by Nagarro
🏆 2nd Prize in Accenture AI Hackathon (2019)
🏆 APEX Award (twice) for Delivery and Profitability

Education

MTech in Data Science and Engineering

BITS Pilani, 2021 – 2023

BE in Computer Science

Mumbai University, 2014 – 2018

High School

Amravati University, 2012 – 2014

Contact

For professional opportunities, collaborations, or inquiries, please reach out using the options below.

+91 9145175111 Email LinkedIn