hey, i'm

Vaibhav Hariram

building paprika — runtime governance for ai agents. trace capture, policy enforcement, deterministic replay. deployed to 25+ engineering teams.

prev. swe intern @ railinc (summer '24 + '25). react + java/spring boot on the RIGIS platform. etl pipeline processing 100K+ rail stations.

studying cs @ berkeley

fine-tuned 7B–13B LLMs on 1.2M samples, distributed training across 4×A100s for berkeley's gorilla project (BAIR). cited by nvidia, anthropic, openai teams.

researcher @ uc berkeley (eecs), satellite imagery + ML to detect crypto mining development in el salvador (Potts Lab)

personal projects

routeXrouting engine that does in <1ms what dijkstra does in 45. contraction hierarchies over 280K bay area road segments, postgis geofencing, xgboost eta predictions.

stack: c++ • postgis • xgboost • osm

swandbfeature store that actually respects time. pit-correct joins, <8ms online serving, parquet storage that doesn't lie to your model.

stack: duckdb • redis • parquet

chroniclellm inference server — 32% throughput over hf baseline. micro-batching + kv-cache reuse. <200ms p95 at 100 concurrency.

stack: python • cuda • mistral-7b

snare(see repo)

open source

vLLM / PyTorch — merged performance optimization + distributed benchmarking PRs upstream

geofilter — numba jit point-in-polygon engine. 10M points in 0.58s. wrote it because existing spatial filters were too slow and i was impatient.

reqbench — load-testing cli sustaining 4.8K req/sec at 500 concurrency with p95/error ci gating

lore

pretty good at geography trivia

played trevor rainbolt in ranked geoguessr. got absolutely cooked tho

played carnatic violin for 10 years

huge chelsea fan + chicago sports

grew up in cary, north carolina. escaped.

tl;dr

i love geospatial ai/ml + geotech: geospatial data, machine learning, and real systems people actually use

currently

berkeley → scale and build wherever and whatever i can

still thinking about that caleb williams throw…

some other things

coursework: data structures, algorithms, discrete math, structure of computer programs, a.i., machine learning, networking + internet protocols, database systems, computer vision, data science, probably more ai/ml

languages: python, c++, typescript, java, sql

ml + systems: FSDP, AMP, CUDA, KV-cache, distributed fine-tuning, eval pipelines, quantization

frameworks: react, fastapi, spring boot, node.js, duckdb, langgraph, pytorch, xgboost

infra: aws, docker, redis, postgresql, postgis, gdal, arcgis pro, linux, ci/cd

interests: cities + public transportation, collecting old maps, tamil culture, soccer jerseys, geoguessr, chicago sports, international music

less frequently: south indian movies, home decor, sketching, online chess, sparkling water

other things

used to watch anime a little bit

trying to learn to dj and cook (or both at the same time)

will run a marathon this year

trying to attend more hackathons

roast my spotify playlists