Hara.

Hi, I'm Abhishek Singh

I turn petabyte-scale data into sub-second insight.

Senior Data Engineer with 3.5 years building cloud data architecture on Azure and GCP. I specialize in high-volume ingestion, T-SQL and PySpark pipelines, and resilient scraping infrastructure — turning raw data into analytics teams can actually use.

Selected work

All projects →

Legacy SQL Server → Azure Analytics Modernization

2024

Modernized an end-to-end analytics platform by migrating legacy SQL Server workloads to Azure. Moved storage to a multi-layer Data Lake on ADLS with file-level partitioning, refactored monolithic stored procedures into Airflow-orchestrated Python/PySpark pipelines, and enabled self-serve, ad-hoc analytics through Synapse SQL Pools.

Azure SynapseDatabricksPySparkADLSAirflowT-SQL

Cross-Cloud E-commerce Analytics Platform

2025

Designed the backend data architecture for a cloud-native e-commerce analytics platform — BigQuery for analytical workloads and MongoDB for low-latency operational data. Built FastAPI services to expose analytics to the web app, with a clean separation between analytical and transactional layers.

GCP BigQueryMongoDBFastAPIDatabricksPySparkAzure

Latest writing

All posts →