InfoQ Homepage DevOps Content on InfoQ
-
Operating Pivotal Application Service at Scale
Yusuke Kondo and Akinori Nitta explain the challenges faced and solutions experienced to run and manage a large-scale platform.
-
Managing Systems in an Age of Dynamic Complexity
Laura Nolan looks at the common architectural shapes of dynamic control planes, and some examples of how they fail. Why are dynamic control planes so hard to run, and what can be done about it?
-
PKS Is Not JAK8sP (Just Another Kubernetes Platform)
Cornelia Davis discusses what distinguishes Pivotal Container Service and covers some of the latest advancements coming from the Kubernetes community, such as cluster-api and more.
-
Lessons Learned from Reviewing 150 Infrastructures
Jon Topper presents a structured review of the architectural and operational choices of 150 platform teams, talking about common mistakes and providing advice on how to avoid these.
-
Monitoring All the Things: Keeping Track of a Mixed Estate
Luke Blaney talks about how to approach monitoring an estate of many technologies and what the Financial Times did to improve visibility across systems built by all its teams.
-
Distributed Tracing in the Wild
Adrian Cole, Tommy Ludwig and Narayanan Arunachalam share the “Sites” project, which is an inventory of real-life setups people use today with distributed tracing to increase developer productivity.
-
Pitfalls in Measuring SLOs
Danyel Fisher and Liz Fong Jones discuss how they brought the theory of SLOs to practice, and what they learned that they hadn’t expected in the process.
-
Bootiful Azure Spring Cloud
Julien Dubois, Josh Long discuss how Azure supports service discovery, centralized configuration, database binding, application scaling and monitoring, distributed tracing and blue/green deployment.
-
Building a Scalable Data Science & Machine Learning Cloud Using Kubernetes
Murali Paluru discusses how to leverage Kubernetes within a team using the architecture shared, and some of the common mistakes and pitfalls to avoid.
-
Policy Enforcement on Kubernetes with Open Policy Agent
Aleks Saul and Jaime Gonzalez Aguilar introduce Rego, the language used to describe OPA policies, recent updates to OPA, and break down sample policies for common use cases.
-
Ship Fast and Pay Attention: Five Lessons in Applying Observability
Dan Abel shares lessons learned from shipping more often with fewer tests, and how that built a better system for their users.
-
The Halo of Resilience Engineering
J. Paul Reed looks at how some of the pillars of Resilience Engineering might help and a team can deal with the changes forced to confront.