InfoQ Homepage Database Content on InfoQ
-
API Friction Complicates Hunting for Cloud Vulnerabilities. SQL Makes it Simple
APIs can tell you everything about your cloud infrastructure, but they're hard to use and work in different ways. What if you could write simple SQL queries that call APIs for you and put results into a database? Steampipe, an open-source project that maps APIs to Postgres foreign tables, makes that dream come true. It's hard enough to reason over data. Acquiring it should be easy, and now it is.
-
Embracing Cloud-Native for Apache DolphinScheduler with Kubernetes: a Case Study
This article shares how Apache DolphinScheduler was updated to use a more modern, cloud-native architecture. This includes moving to Kubernetes and integrating with Argo CD and Prometheus. This improves substantially the user experience of deploying, operating, and monitoring DolphinScheduler.
-
Developing a Cloud-Native Application on Microsoft Azure Using Open Source Technologies
Cloud native is a development approach that improves building, maintainability, scalability, and deployment of applications. My intention with this article is to explain, in a pragmatic way, how to build, deploy, run, and monitor a simple cloud-native application on Microsoft Azure using open-source technologies.
-
A Recipe to Migrate and Scale Monoliths in the Cloud
In this article, I want to present a simple cloud architecture that can allow an organization to take monolithic applications to the cloud incrementally without a dramatic change in the architecture. We will discuss the minimal requirements and basic components to take advantage of the scalability of the cloud.
-
Raft Engine: a Log-Structured Embedded Storage Engine for Multi-Raft Logs in TiKV
In this article, authors discuss the design and implementation of Raft Engine, a log-structured embedded storage engine introduced in TiDB distributed, NewSQL database version 5.4. They also discuss the performance benefits of the engine compared to the previous implementation based on RocksDB.
-
Designing Secure Tenant Isolation in Python for Serverless Apps
Software as a Service (SaaS) has become a very common way to deliver software today. While providing the benefits of easy access to users without the overhead of having to manage the operations themselves, this flips the paradigm and places the responsibility on software providers for maintaining ironclad SLAs, as well as all of the security and data privacy requirements.
-
Two Must-Have Tools for Jakarta EE Developers
The wildfly-jar-maven-plugin and the brand new wildfly-datasources-preview-galleon-pack from the WildFly project are worthy of your attention. These tools add on-the-fly generation of an Uber JAR including configuration for containerization and datasources, and make it a pleasure to write applications for Jakarta EE and WildFly.
-
Data Patterns for the Edge: Data Localization, Privacy Laws, and Performance
With growing competition to get data that power experiences to the end-user closer and closer and the advent of local data privacy laws, let's look at different enterprise data patterns like “synchronous data retrieval”, “subsequent data retrieval” and “prefetch data retrieval” on data center.
-
Building End-to-End Field Level Lineage for Modern Data Systems
In this article, the authors discuss the data lineage as a critical component of data pipeline root cause and impact analysis workflow, and how automating lineage creation and abstracting metadata to field-level helps with the root cause analysis efforts.
-
An Introduction and Tutorial for Azure Cosmos DB
Azure Cosmos DB is a globally distributed, JSON-based database delivered as a ‘Platform as a Service’ (PaaS) in Microsoft Azure. Learn about the benefits and disadvantages of Azure Cosmos DB. Find out more about this database and discover how to interact with it using tools, SDKs, and APIs.
-
The Next Evolution of the Database Sharding Architecture
In this article, author Juan Pan discusses the data sharding architecture patterns in a distributed database system. She explains how Apache ShardingSphere project solves the data sharding challenges. Also discussed are two practical examples of how to create a distributed database and an encrypted table with DistSQL.
-
Developing Deep Learning Systems Using Institutional Incremental Learning
Institutional incremental learning promises to achieve collaborative learning. This form of learning can address data sharing and security issues, without bringing in the complexities of federated learning. This article talks about practical approaches which help in building an object detection system.
Resources
The Data Divide: Top Challenges Facing Enterprise Data Teams
Now, as companies navigate their new normal in a hybrid environment, they must also harness the power of their data and support those responsible for managing it. Download Now.
Guide to the Lakehouse: Unite your data teams in the cloud to bridge the information gap
Learn how the lakehouse helps future-proof your data-driven business, and how the right data integration platform can simplify & accelerate data project creation, without compromising on sophistication. Download Now.
Close the Information Gap: How to Succeed at Analytics in the Cloud
Download this guide to better understand what a data-driven business looks like and the dangers of having an information gap across data that's distributed, diverse, and dynamic. Download Now.