BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Register Sign in

Unlock the full InfoQ experience

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources.

Log In

or

Don't have an InfoQ account?

Stay updated on topics and peers that matter to youReceive instant alerts on the latest insights and trends.
Quickly access free resources for continuous learningMinibooks, videos with transcripts, and training materials.
Save articles and read at anytimeBookmark articles to read whenever youre ready.

Logo - Back to homepage

News Articles Presentations Podcasts Guides

Topics

Development

Featured in Development

From VR to Flat Screens: Bridging the Input and Immersion Gap

Dany Lepage discusses the architectural journey of porting a hit VR title to seven non-VR platforms. He explains how his team solved the challenges of cross-progression, diverse input paradigms, and maintaining release velocity across Steam, iOS, and PlayStation. Beyond the tech, he shares candid lessons on the "product fit" gap when translating immersive social presence to 2D screens.

All in development

Architecture & Design

Featured in Architecture & Design

Stripe’s Docdb: How Zero-Downtime Data Movement Powers Trillion-Dollar Payment Processing

Jimmy Morzaria discusses the evolution of Stripe’s database tier to support 5 million QPS with 5.5 nines of reliability. He explains the architecture of DocDB and shares how Stripe leverages a custom zero-downtime data movement platform to perform horizontal sharding, version upgrades, and multi-tenant migrations - all while maintaining the strict consistency required for global commerce.

All in architecture-design

AI Infrastructure

Featured in AI, ML & Data Engineering

Time-Series Storage: Design Choices That Shape Cost and Performance

Every time-series database makes a set of storage design decisions: how to lay out rows, when to compress, what to partition on. These decisions determine cost and query performance more than the choice of database itself. This article works through those fundamentals from first principles, using widely available tools like PostgreSQL and Apache Parquet to make each trade-off measurable.

All in ai-ml-data-eng

Culture & Methods

Featured in Culture & Methods

Beyond Coding: How Senior ICs Grow Influence and Drive Impact

Netflix’s Kasia Trapszo discusses the transition from writing code to scaling organizations. She shares lessons on building trust through technical clarity, aligning teams to solve the "right" problems, and using intentional documentation to scale your judgment. Learn how to move beyond individual output to create a lasting architectural legacy that empowers others to make better decisions.

All in culture-methods

DevOps

Featured in DevOps

Evolution of a Backend for a Streaming Application

Daniele Frasca explains the architectural evolution of Joyn, a German streaming giant. He discusses moving from fragile single-node setups to resilient serverless architectures using AWS. He shares insights on the Hub and Spoke pattern for data consistency, cell-based isolation to reduce blast radius, and cost-optimization strategies for achieving affordable multi-region active-active setups.

All in devops

Events

Helpful links

Choose your language

Online InfoQ Architect Certification

The more senior you become, the fewer people pressure-test your decisions. This 5-week cohort gives you that check.

Register Now.

Learn how leading engineering teams run AI in production—reliably, securely, and at scale.

Register Now.

Online InfoQ AI Engineering Certification

A practical online cohort for senior engineers making decisions around retrieval, agents, evals, and AI infrastructure.

Register Now.

QCon San Francisco

Learn what's next in AI and software, from teams already doing it.

Register Now.

InfoQ Homepage Model Inference Content on InfoQ

Articles

RSS Feed

Cloud

Local-First AI Inference: A Cloud Architecture Pattern for Cost-Effective Document Processing

The Local-First AI Inference pattern routes 70–80% of documents to deterministic local extraction at zero API cost, reserving Azure OpenAI calls for edge cases and flagging low-confidence results for human review. Deployed on 4,700 engineering drawing PDFs, it cut API costs by 75% and processing time by 55%, while bounding errors through a human review tier.

Obinna Iheanachor
on May 11, 2026
AI, ML & Data Engineering

Secure AI-Powered Early Detection System for Medical Data Analysis & Diagnosis

In this article, the author discusses the techniques for securing AI applications in healthcare with an use case of early detection system for medical data analysis & diagnosis. The proposed layered architecture includes application components to support secure computation, ai modeling, governance and compliance, and monitoring and auditing.

Mahesh Vaijainthymala Krishnamoorthy
on Mar 03, 2025

BT