InfoQ Homepage NoSQL Content on InfoQ
-
Spark Application Performance Monitoring Using Uber JVM Profiler, InfluxDB and Grafana
In this article, author Amit Baghel discusses how to monitor the performance of Apache Spark based applications using technologies like Uber JVM Profiler, InfluxDB database and Grafana data visualization tool.
-
Challenges of Building a Reliable Realtime Chat Service
Realtime chat has become a common feature of modern applications. These days not only communicators and social networks allow users to talk with each other over the Internet—chat is crucial in healthcare, e-commerce, gaming and many other industries.
-
14 Things I Wish I’d Known When Starting with MongoDB
I’ve been a database person for an embarrassing length of time, but I only started working with MongoDB recently. When I was starting out with MongoDB, there are a few things that I wish I’d known about. With general experience, there will always be preconceptions of what databases are and what they do. In hopes of making it easier for other people, here is a list of common mistakes.
-
A Critique of Resizable Hash Tables: Riak Core & Random Slicing
This fall, Wallaroo Labs will be releasing a large new feature set to our distributed data stream processing framework, Wallaroo. One of the new features requires a size-adjustable, distributed data structure to support growing & shrinking of compute clusters. It might be a good idea to use a distributed hash table to support the new feature, but what distributed hash algorithm should we choose?
-
Picking an Active-Active Geo Distribution Strategy: Comparing Merge Replication and CRDT
Modern distributed applications are fuelling the growing demand for distributed active-active, multi-master databases. While most popular databases support multi-master deployment, different databases employ different techniques. LWW, MVCC, merge replication and CRDTs deliver eventual consistency, offering read and write access with local latency and remaining available during network partitions.
-
Columnar Databases and Vectorization
In this article, author Siddharth Teotia discusses the Dremio database which is based on Apache Arrow with vectorization capabilities.
-
Polyglot Persistence Powering Microservices
At Netflix, the cloud database engineering team is responsible for providing several flavors of data persistence as a service to microservice development teams. Roopa Tangirala explained how her team has created self-service tools that help developers easily implement the appropriate data store for each project's needs.
-
Introducing FaunaDB Serverless Cloud
FaunaDB Serverless Cloud is the managed version of FaunaDB, a serverless, object-relational, globally replicated, strongly consistent, temporal database, that can be deployed on multiple clouds, such as AWS, GCP, and Azure, or on premises.
-
Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included! (Part 2)
Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Continuing our exploration of Reladomo, the primary Java ORM used at GS and now open source, GS Technology Fellow, Mohammad Rezaei looks at advanced features, such as sharding, caching, bitemporal access, performance, and testing.
-
Pascal Desmarets on NoSQL Data Modeling Best Practices
NoSQL databases are specialized to store different types of data like Key Value, Documents, Column Family, Time Series, Graph, and IoT data. Pascal Desmarets talks about how to perform data modeling in NoSQL databases compared to the modeling in Relational databases.
-
Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX
In this article, author Srini Penchikala discusses Apache Spark GraphX library used for graph data processing and analytics. The article includes sample code for graph algorithms like PageRank, Connected Components and Triangle Counting.
-
Analysis and Mitigation of NoSQL Injections
NoSQL data storage systems lack the security measures and awareness that are required for data protection. Because code analysis alone is insufficient to prevent attacks in today's typical large-scale deployment, certain mitigations should be done throughout the entire software life cycle.
ChatGPT is fun, but the future is fully autonomous AI for code
ChatGPT and AI tools based on Large Language Model have recently grabbed headlines. But LLMs are not the only way to do AI for code: fully autonomous code-writing is possible by using reinforcement learning to write code that can compile, run and be correct. Learn more in this webinar.