About

HarshitRaj

Curiosity-led builder. Calm execution, consistency, and the kind of growth that lasts.

Harshit Raj

Distributed systemsKafkaObservabilityJavaAWS

I’m a curiosity-led builder. I like understanding how things work, then turning that understanding into something real: a system, a tool, a habit, or a craft.

I’ve built and shipped systems at scale at Amazon and AdPushup, but I don’t want this site to feel like a resume. It’s more of a snapshot of what I’m into: consistency (gym/running), perspective (travel/mountains), and clarity (writing).

The thread connecting everything is simple: show up, iterate, learn, and keep it honest — no heroics, just steady progress.

Story

How I got here

I’m drawn to how things work — and the discipline it takes to get better at anything.

I didn’t start coding because I wanted to build startups or chase titles. I started because I liked understanding how things actually work under the hood. In college, I was more interested in breaking programs, debugging weird issues, and figuring out why something failed rather than just getting output.

Early on, I got exposure to real production systems at scale — the kind that teach you humility fast. I learned that tiny mistakes become real outages, and that good logging, observability, and ownership matter more than cleverness.

I’ve shipped at scale at Amazon and AdPushup, but I don’t want to turn this page into a list of job bullets. What matters to me is the shape of the work: reliability, calm execution, and making things simpler and more predictable over time.

Outside work, I train. I run (even when I don’t feel like it). I travel to reset perspective. I write to think. Those habits keep me grounded — and they show up in how I build, learn, and show up every day.

Tech stack

Tools I trust in production

Not a keyword list — this is what I reach for, and why, when the goal is calm systems.

Backend & APIs

Java + Spring Boot
Production APIs, strong conventions, and boring reliability at scale.
REST (and pragmatic contracts)
Clear boundaries, versioning discipline, and predictable operability.
Go (selectively)
Low-latency services, concurrency-heavy workloads, and simpler binaries.

Messaging & Streaming

Kafka
Durable event pipelines where throughput and replayability matter.
Kafka Streams
Real-time transforms/anomaly detection without building a new platform.

Data Stores

SQL (Postgres/MySQL)
Core source-of-truth data with well-modeled schemas and constraints.
Couchbase
Document + caching patterns where read latency and flexibility matter.

Observability

Prometheus + Grafana
Metrics-first debugging, SLO visibility, and honest capacity signals.
Alerting hygiene (PagerDuty)
Fewer pages, higher signal, and faster incident triage.
Structured logging
Searchable, consistent logs that actually help during incidents.

Cloud & Infrastructure

AWS (Lambda, CloudWatch)
Managed primitives with clear ownership and cost boundaries.
Docker
Reproducible builds and consistent deploys across environments.
IaC (CDK/Terraform)
Make infra reviewable, repeatable, and enforceable over time.

Engineering workflow

CI/CD (Jenkins)
Automate boring steps; push confidence left with tests and checks.
Testing (integration + e2e where needed)
Catch regression at boundaries that break in production.
Design reviews
Decisions age — writing them down keeps systems coherent.

Work (high level)

The shape of my work

I’m intentionally not listing titles/dates here. This is the kind of work I enjoy: systems thinking, reliability, and clear ownership.

Amazon

Large-scale internal systems where correctness, reliability, and good operational hygiene matter.

JavaAWSDockerCI/CD

Core backend flows (think: high-stakes state transitions, retries, and edge cases).
Operational improvements: calmer on-calls, clearer alerts, and fewer surprises.
Cost + performance trade-offs: making systems predictable without over-optimizing prematurely.

AdPushup

High-throughput ad-tech systems with lots of events, lots of edge cases, and real consequences when things drift.

JavaKafkaSQLObservability

Event-driven pipelines and ingestion flows (throughput, retries, backpressure, and correctness).
Monitoring + debugging loops: making issues easier to see and faster to fix.
Real-time signals: catching anomalies and traffic shifts without drowning in noise.

Open Source

Contributions & Maintenance

Small, targeted changes that improve correctness, UX, and long-term maintainability.

node-cron (13M+ weekly downloads)

Fractional-offset scheduling, UTC parsing refactor, and docs improvements.

charmbracelet/huh

Added unit tests and CI coverage for spinner components (Go).

glasskube

UX fixes and stability improvements for the v1.0 release (Go + HTMX).