Hello world!
I’m a Senior Engineering Leader with deep hands-on expertise in designing and scaling distributed systems, data platforms, and cloud-native architectures. Currently, I work as a Senior Architect at Josys, where I’m also a founding engineer—helping build the core platform, engineering foundations, and architectural direction as the company scaled from early-stage to a mature enterprise SaaS.
Over the last 13+ years, I’ve worked across telecom, legal, and enterprise SaaS domains, operating at the intersection of technology, product, and business. My experience goes beyond system design—I’ve been closely involved in shaping engineering practices, platform strategy, and scale-readiness decisions that enable teams to move fast without compromising reliability or long-term growth.
I enjoy solving complex problems that sit at the edge of scale and ambiguity: architecting resilient distributed systems, building data and search platforms, optimizing cost and performance, and mentoring teams to think in systems rather than features. This site is a snapshot of my journey—what I’ve built, how I think, and the principles I apply when designing technology and teams meant to scale.
Writings
Engineering Decisions behind introducing transactions in AWS DynamoDB: https://medium.com/@vipulbest1/engineering-decisions-behind-introducing-transactions-in-aws-dynamodb-3466a89af4e5
Understanding NodeJs Event Loops, Phases and beyond…(Part1): https://medium.com/@vipulbest1/understanding-nodejs-event-loops-phases-and-beyond-part1-e87f40d0e3c6
Handling Concurrency through database in rails: Part 2 : https://medium.com/@vipulbest1/handling-concurrency-through-database-in-rails-part-2-136e5420bce0
Handling Concurrency through database in rails: Part 1: https://medium.com/@vipulbest1/handling-concurrency-through-database-in-rails-part-1-8619ffe75f2d
Distributed Locks Using Ruby and Redis: https://medium.com/@vipulbest1/distributed-locks-using-ruby-and-redis-2e0a2c7b9765
Thread Safety in Ruby and the mystical GIL: https://medium.com/@vipulbest1/thread-safety-in-ruby-and-the-gil-884b6bc47219
ElasticSearch: Understanding Nested Data Types: https://medium.com/@vipulbest1/elasticsearch-understanding-nested-data-types-8f523364e07
Engineering Beyond the Code: What the world needs!
Mysql/AWS RDS : MyIASM and InnoDB Engine
MongoDB and AWS DocumentDB
ElasticSearch and AWS Opensearch
AWS RDS with Scalable PostgreSQL,
Familiarity with YugaByteDB (Infinitely Scalable Postgress)
AWS DynamoDB
Index management ranging from RDBMS to documentStores :
B+trees, secondaryIndexes, 2DSphere Indexes, Inverted TreesHigh Write Dbs: Spanner, RocksDB, Ubers M3DB
ACID, Data consistency, Sharding, Replication strategies
Transactional Guarantees in highly scalable distributed document store.
Ruby and Redis based job framework: sidekiq, sidekiq-unique, rescue , delayed-job
Integrated Shoryuken: Multi threaded Ruby SQS Poller
Node and Redis based job framework: BullMQ
AWS SQS, AWS SNS, AWS EventBridge
Queue BackPressure and concurrency control in low latency , high throughput NodeJS systems using p-limit, async-queue and p-queue
Parquet: Columnar File Formats
Delta Lake, Iceberg TableFormats on top of parquet to provide snapshot isolations, MVCC, transactions and much more
Architected high load pipelines with Apache Spark
CDC Pipelines using debezium, Kafka and AirByte
Architected multiple systems using medallion architecture
Data Pipeline Orchestration on Apache Airflow and AWS MWAA. Scaling thousands of DAGs for consistent performance
and latency. Architected the entire DAG pipeline for scheduled and high priority DAGs
Streaming Pipelines using Spark Streaming and Flink
Architected and guided team on Apache Pinot integration for low latency user queries.
Microservice orchestration and SAGA using Temporal
Deployed Temporal on EKS.
Data Engineering workflows using AirFlow DAGs.(AWS MWAA)
Producers
Compressing streaming payloads using zstd, gzip and snappy
Batching and fine tuning payload size and format write consensus and quorom strategy
Atleast , Atmost and Exactly once publish semantics
Idempotent Publish of events
Handling InSyncReplicas and ACKs
De-Duplication of published events
Manual and auto commit of offsets
Transactional Guarantees
Order of publish
Partitioning strategies
Consumers
Key decisions on order of consumptions
Designing idempotent consumer
De-duplication on consumers
TimeOuts and Retries
Distributed locking for duplicate consumption
Out of order consumption of stale events
Strict consumption order when there is dependency between two events
Offset Management : Log End and committed offset, Consumer Lag monitoring and alerts, FineTuning consumer groups
Partition assignment strategies
Developed , deployed and maintained search systems with Read QPS of more than 1000-2000 RPS.
Handle different tokenization approach for different languages
Text based searches, Faceted and fuzzy searches
Taking critical decisions on index model design to faster searches. Designing efficient sharding strategy
Designing efficient replication strategy
Optimistic Concurrency Control and handling stale events
Handling Event Storms for CQRS architecture
TradeOffs between Full Immutable writes vs Partial mutable writes
Handling deep pagination
Databases: Aurora, documentDB, Opensearch DynamoDB, AWS TimeStreams
CI/CD: AWS CodeBuild and CodePipeline
Queues: Sqs, Sns, Event Bridge
Storage: EBS, S3
S3 Federated Query: Athena
Infrastructure: AWS ECS, EKS, EMR
Auto scaling policies, Cloudwatch logs, ALB grouping,
AWS Cloud Map, ECS Discovery Service using AWS Cloud Map, AWS Parameter store, AWS KMS