COMPRESSION IN CASSANDRA

COMPRESSION IN CASSANDRA Operators can set up compression on a per-table basis with Cassandra. By compressing the SSTable in user-configurable compression chunk_length_in_kb, compression minimizes the amount of data on disk. The CPU cost of compressing data is only required when the SSTable is written since… Read More

STRATEGIES OF COMPACTION

STRATEGIES OF COMPACTION ▪️ Unified Compaction Strategy (UCS) ▪️ Size-Tiered Compaction Strategy (STCS) ▪️ Leveled Compaction Strategy (LCS) ▪️ Time Window Compaction Strategy (TWCS)   UNIFIED COMPACTION STRATEGY: For the majority of workloads, including mixed read-write, time-series, read-heavy, and write-heavy workloads, the UnifiedCompactionStrategy (UCS) is… Read More

CHANGE DATA CAPTURE

CHANGE DATA CAPTURE When a configurable size-on-disk for the CDC log is reached, Change Data Capture (CDC) offers a technique to refuse writes to certain tables and flag them for archiving. By setting the table property cdc=true (either when creating the table or modifying it),… Read More

BLOOM FILTERS

BLOOM FILTERS WHAT ARE BLOOM FILTERS: Cassandra combines data from RAM (in memtables) and disk (in SSTables) in the read path. Cassandra uses a data structure called a bloom filter to avoid having to go through each and every SSTable data file to find the… Read More

CASSANDRA BACKUPS

CASSANDRA BACKUPS WHAT ARE BACKUPS: Immutable SSTable files are used by Apache Cassandra to store data. The backup copies of the database data that are kept as SSTable files in the Apache Cassandra database are called backups. There are various uses for backups, some of… Read More