AQE optimisationsIn prev blog I discussed internal of AQE and how AQE nodes converge to a final physical plan. In this blog I will some of the optimisations…Oct 7, 2024Oct 7, 2024
Adaptive Query Execution in SparkAdaptive Query Execution (AQE) in Apache Spark is an advanced optimization technique introduced to dynamically adjust query execution plans…Oct 6, 2024Oct 6, 2024
WholeStageCodeGen Internal workingIn our prev blog we discussed WSCG and problem this solves in volano iterator model. In this blog post we will discuss internal…Oct 5, 2024Oct 5, 2024
Vectorized execution in SparkIn prev blog post I discussed about volcano iterator and WSCG. Volacano iterator model suffered from too many virtual calls and WSCG…Oct 5, 2024Oct 5, 2024
Whole Stage Code Gen in SparkWSCG is a query plan optimisation technique used in most DBMS systems. Before undestanding about WSCG we need to first understand issues…Oct 4, 2024Oct 4, 2024
Vector Clocks In Distributed SystemA vector clock is a data structure used for determining the partial ordering of events in a distributed system and detecting causality…May 13, 2023May 13, 2023
Database replicationReplication means keeping a copy of the same data on multiple machines that are connected via a network. There are several reasons why you…May 13, 2023May 13, 2023
CRDT in Distributed SystemsConflict-Free Replicated Data Types (CRDTs) are a family of data types that can be updated without synchronization and do not require…Apr 3, 2021Apr 3, 2021