top of page
Interesting Recent Posts :
Writer's pictureRohit chopra

Cassandra DB- Internal Architecture


Cassandra is a distributed NoSQL database that is designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure. The key architectural components of Cassandra are:

  1. Node: A single instance of Cassandra is called a node, which can participate in a cluster and stores data.

  2. Cluster: A cluster is a group of nodes that work together to provide data storage and access, with each node in the cluster having the same configuration and sharing the same data.

  3. Data Partitioning: Cassandra partitions data across nodes in the cluster using a partitioning strategy such as Murmur3Partitioner, RandomPartitioner, or ByteOrderedPartitioner.

  4. Replication: Cassandra replicates data across multiple nodes in a cluster for reliability and availability, with each piece of data having multiple copies stored on different nodes.

  5. Gossip Protocol: Cassandra uses a gossip-based protocol for node discovery, failure detection, and data dissemination in the cluster.

  6. Consistency Level: Cassandra provides tunable consistency levels, allowing trade-offs between consistency and availability.

  7. Column Family: Cassandra organizes data into column families, which are similar to tables in a relational database. Each column family contains multiple rows of data, each with a unique key.

  8. SSTables: Cassandra stores data on disk in a compact format called SSTables (Sorted String Tables), which are optimized for read-intensive workloads



9 views

Recent Posts

See All

Machine Learning : KNN model

Introduction to K-Nearest Neighbors (KNN) Algorithm in Machine Learning: K-Nearest Neighbors (KNN) is one of the simplest and most...

Комментарии


bottom of page