BT

Engineering for the Long Term at Google

| by João Miranda Follow 2 Followers on Jun 08, 2015. Estimated reading time: 2 minutes |

Astrid Atkinson, director at Google, drew on their experiences over the last decade to present some rules and advice on engineering for the long term. The Velocity Conference 2015 attendees at Santa Clara learned that it's crucial to imagine that you're going to be wildly successful, that complexity mustn't be eliminated but managed and that the focus should be on scaling systems not teams.

If you're going to be successful, then complexity will keep increasing no matter what, so the focus should be on managing that complexity. Atkinson says that the growth that your product can drive should never be curtailed: don't stop from doing something because it's too complex. On the other hand, you should keep a close eye on what's costing you time and work towards removing those inefficiencies.

Atkinson highlighted how good teams and good people are invaluable, but very hard to find. If you're going to be successful in the long run, your systems will grow exponentially, but your teams will grow sub-linearly. You'll make large investments - recruiting, training - in your teams, so you'll want to keep them. Atkinson believes that "you team IS your service". Don't burn your people out. Manage the interrupt load. Bring down the silos: systems need both dev and ops.

Enabling growth requires moving away from bespoke solutions to as much standardization as possible. Store configurations in the same way everywhere, standardize on naming, stop managing individual machines. Atkinson mentioned several examples at Google. For many years now, every Google server contains a small http server that serves a status page. This status page provides basic info, such as the traffic that's hitting the server or when was it built. Borg, Google's cluster manager, helped Google's engineers to manage and run their own services by abstracting away the individual machines.

Engineering for the long term also means paying close attention to maintainability. If the development teams are going to grow a lot, it's important to consider how to operate and maintain all those systems. Two rules stand out. First, each service must look after itself and its communications with others. Second, use shared infrastructure. If you have several services that do similar things, consolidate them, to avoid wasteful overhead. When consolidating, Atkinson likes Dan McKinley's advice on boring infrastructure choices: prefer the most stable thing. You'll also have to think on migrating the workload, as it can be a risky and long endeavor. Atkinson favors moving the biggest customer first, to bring the risk forward.

Atkinson also urged everyone to "don't let the weeds get higher than the garden". That means investing in support tools for the engineering processes, e.g., build, testing, release and monitoring ("from the user's perspective"). It also means that tasks that are executed more than twice should be automated.

Astrid Atkinson's talk is available for online viewing.

Rate this Article

Adoption Stage
Style

Hello stranger!

You need to Register an InfoQ account or or login to post comments. But there's so much more behind being registered.

Get the most out of the InfoQ experience.

Tell us what you think

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread
Community comments

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread

Discuss

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT