Abstract
Cassandra is a distributed database with great scalability and performance that can manage massive amounts of data that is not structured. The experiments performed as a part of this paper analyses the Cassandra database by investigating the trade-off between data consistency andperformance. The primary objective is to track the performance for different consistency settings. The setup includes a replicated cluster deployed using VMWare. The paper shows how difference consistency settings affect Cassandra's performance under varying workloads. The results measure values for latency and throughput. Based on the results, regression formula for consistency setting is identified such that delays are minimized, performance is maximized and strong data consistency is guaranteed. One of our primary results is that by coordinating consistency settings for both read and write requests, it is possible to minimize Cassandra delays while still ensuring high data consistency.
Publisher
Academy and Industry Research Collaboration Center (AIRCC)
Reference14 articles.
1. [1] Github: Benchmarking Cassandra and other NoSQL databases with YCSB. https://github. com/cloudius-systems/osv/wiki/Benchmarking-Cassandra-and-other-NoSQL-databaseswith-YCSB.
2. [2] Mishra, V. (2014), Beginning apache Cassandra development. Apress [E-book].
3. [3] P. Bagade, A. Chandra and A. B. Dhende, "Designing performance monitoring tool for NoSQL Cassandra distributed database," International Conference on Education and e-Learning Innovations, 2012, pp. 1-5, doi: 10.1109/ICEELI.2012.6360579. Eben Hewitt. Cassandra: The Definitive Guide. O'Reilly Media, Inc., 1 edition, 2010.
4. [4] Datamodel - cassandra wiki. http://wiki.apache.org/cassandra/DataModel.
5. [5] Daniel Bartholomew. Sql vs. nosql. Linux J., 2010.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献