A disciplined approach towards data modelling in non relational spaceThe process of data modelling is a complex and elaborate one. My most important learning over the years has been that one should strive to…Mar 28, 2021Mar 28, 2021
Understanding Monoids using real life examplesYou will find a lot of articles over the internet which delve deep into the world of monoids, running into scores of pages hoping to…Aug 10, 20191Aug 10, 20191
An overview of spark performance optimisationsIf you are new to Spark and suddenly find yourself trying to optimise a fairly complex application then where do you start? Here is a…Apr 7, 20192Apr 7, 20192
Consistency models in distributed systemsWith distributed systems, consistency is not a monolithic construct and an ubiquitous phenomenon; instead it is a spectrum of stronger to…Oct 3, 20181Oct 3, 20181
MapReduce design patterns — Part 1Even if you are remotely familiar with the big data ecosystem, then you will very well know that the simple paradigm of map and reduce…Aug 26, 2018Aug 26, 2018
Making sense out of the fast data and stream processing conundrumWith mushrooming of streaming frameworks, I believe there is way too much literature around some of the underlying constructs. Fast data…Jun 9, 2018Jun 9, 2018
Database storage engines under the hoodWe have often wondered how databases store and manage massive loads of data behind the scenes, allowing developers to focus on more…May 20, 2018May 20, 2018
Content farming in an e-commerce universeWhen you are a big e-commerce company and have a catalogue size thats bigger than population of Mumbai or Delhi, how do you advertise such…Mar 24, 2018Mar 24, 2018
The (presumed) death of the lambda architectureA few years back, I got my hands dirty building a quintessential Lambda architecture that involved computing a user affinity model for an…Dec 6, 20171Dec 6, 20171