InfoQ Homepage Architecture & Design Content on InfoQ
-
Spring AI 1.0 Delivers Easy AI Systems and Services
AI is here to stay, and it represents a unique and wonderful opportunity for Java and Spring developers. For most people, “AI engineering” simply means calling an LLM-based service over HTTP. In this article, we’ll examine the new Spring AI 1.0 project and explore how it can be used to integrate AI more effectively.
-
Architecting the MVP in the Age of AI
AI enhances software architecture by informing decisions, suggesting alternatives, and streamlining documentation. While it can’t replace human judgment, it accelerates MVP development and supports experimentation, trade-off analysis, and technical debt management when provided with sufficient context.
-
Optimizing Search Systems: Balancing Speed, Relevance, and Scalability
Innovative software engineer focused on optimizing search performance in dynamic environments. This article highlights key strategies from our QCon San Francisco 2024 presentation, addressing challenges faced by platforms like Uber Eats in data indexing and retrieval. Our advancements ensure swift, relevant user experiences amidst ever-growing datasets.
-
Agentic AI Architecture Framework for Enterprises
To deploy agentic AI responsibly and effectively in the enterprise, organizations must progress through a three-tier architecture, Foundation tier, Workflow tier, and Autonomous tier where trust, governance, and transparency precede autonomy.
-
Engineering Principles for Building a Successful Cloud-Prem Solution
Discover how Cloud-Prem solutions combine cloud efficiency with on-premise control, meeting data sovereignty and compliance demands while optimizing operational costs and enhancing customer security.
-
Analyzing Apache Kafka Stretch Clusters: WAN Disruptions, Failure Scenarios, and DR Strategies
Proficient in analyzing the dynamics of Apache Kafka Stretch Clusters, I assess WAN disruptions and devise effective Disaster Recovery (DR) strategies. With deep expertise, I ensure high availability and data integrity across multi-region deployments. My insights optimize operational resilience, safeguarding vital services against service level agreement violations.
-
Decentralized Architecture Needs More Than Autonomy
Architectural success in decentralized systems depends more on how decisions are made than on system design alone. Replacing control with trust requires visible, structured practices—such as ADRs and advice forums—to build confidence and clarity.
-
We Took Developers out of the Portal: How APIOps and IaC Reshaped Our API Strategy
Dynamic API strategist with expertise in transforming legacy management into efficient APIOps frameworks using Infrastructure as Code (IaC). Proven track record in automating API lifecycles, enhancing security, and fostering developer productivity through CI/CD integration. Adept at driving operational excellence and consistency across environments, enabling rapid deployment and innovation.
-
Using Traffic Mirroring to Debug and Test Microservices in Production-Like Environments
Traffic mirroring has evolved from a network security tool to a robust method for debugging and testing microservices using real-world data. By safely duplicating production traffic to a shadow environment, teams can replicate elusive bugs, profile performance under actual load, validate new features, and detect regressions, ensuring that production remains isolated and user experiences intact.
-
Bringing GPU-Level Performance to Enterprise Java: a Practical Guide to CUDA Integration
Java developers are no longer limited by CPU cores. This guide explores how to bring GPU-level acceleration to enterprise Java using Compute Unified Device Architecture (CUDA), with a practical Java Native Interface (JNI)-based integration pattern, real-world use case, and performance benchmarks. If you're solving high-throughput challenges, this article shows how to make Java truly parallel.
-
The MVP Dilemma: Scale Now or Scale Later?
Scaling a system is a hard problem to solve. Underinvesting in scalability leads to a shortened lifespan for the system, but overinvesting can kill the MVP business case because of cost.
-
Designing Resilient Event-Driven Systems at Scale
Learn how to design resilient event-driven systems that scale. Explore key patterns like shuffle sharding and decoupling queues to handle load spikes and failures. Understand common pitfalls like over-relying on retries and neglecting observability for robust, scalable architectures.