Aaditya Bhilegaonkar

Data engineer.
Building pipelines
teams rely on.

Bio

I build pipelines, orchestration systems, and data infrastructure that organizations can trust. I care about data quality, reliability, and code the next engineer can follow. Right now I'm shipping open source contributions to repos at Astronomer and Tesla, working alongside the engineers who built the tools I use every day.

PRs 0 Open pull requests

Issues 0 Issues claimed

Companies 0 Active targets

Repos 0 Repos touched

Stack

Python Apache Spark Apache Airflow dbt Apache Kafka Snowflake SQL Go Apache Iceberg AWS

Open Source Contributions

Projects

↗ Claude Code

Data Lineage Claude Skill

Change one column. This finds everything that breaks. Traces all downstream dbt, dashboards, notebooks, and pipelines before you push.

↗ LLM · ETL

LLM-Augmented Metadata Pipeline

LLaMA reads your table metadata, writes the SQL. Medallion ETL on PySpark + AWS Glue. 20% faster ingestion, 30% faster analyst turnaround.

↗ LLM · Retrieval

RAG Retrieval Optimization

Bad chunking kills recall. Benchmarked 3 hybrid strategies on Milvus. Recall@5 up 14%, search latency down 35%.

↗ Dev Tool

cc-catalyst

Same Claude, lower bill. A local proxy between Claude Code and Anthropic's API that cuts token spend without touching your workflow.

↗ Claude Code

claude-vibecheck

Stop shipping code you don't understand. Narrates non-obvious logic in plain English the moment you write it.

↗ Multi-Agent

Agentic Data Engineering Platform

Drop in your schema, walk away. Multi-agent system on Google ADK that handles prep, scheduling, and BigQuery loading. ETL dev from 5+ hours to minutes.

Contact

Let's work
together.

Open to full-time data engineering roles. If your team is building something serious with data, I want to hear about it.

Say hello ↗