1/23 Architecting ML systems

Welcome to Architecting ML systems, the second module of the Production Machine Learning Systems course.

2/23 Architecting ML systems

In this module, we’ll explore what makes up an architecture as well as

3/23 Architecting ML systems

why and how to make good systems design decisions.

4/23 Architecting ML systems

Let me ask you a question.

What percent of system code does the ML model account for?

5/23 Architecting ML systems

5%

6/23 Architecting ML systems

25%

7/23 Architecting ML systems

50%

8/23 Architecting ML systems

90%

You’ll recall from earlier in this specialization, we showed how time gets distributed among the different tasks necessary to launch an ML model and, surprisingly, modeling accounted for far less than most people expect.

The same is true with respect to the code.

9/23 Architecting ML systems

So, the answer is that ML model code typically accounts for about 5% of the overall code base.

10/23 Architecting ML systems

The reason that ML models account for such a small percentage is that to keep a system running in production requires doing a lot more than just computing the model’s outputs for a given set of inputs.

11/23 Architecting ML systems

In this module, you’ll see what else a production ML system needs to do and how you can meet those needs.

Upon completing this module, you should acquire the knowledge to

12/23 Architecting ML systems

choose an appropriate training and serving paradigm,

13/23 Architecting ML systems

serve ML models scalably,

14/23 Architecting ML systems

and design an architecture from scratch

15/23 Architecting ML systems

And while our focus is on “Google Cloud”, it’s important that you always try and reuse generic systems when possible–many of which are open-source frameworks.

16/23 Architecting ML systems

What’s true of software frameworks like TensorFlow

17/23 Architecting ML systems

Spark,

18/23 Architecting ML systems

or Apache Beam is also true of the underlying infrastructure on which you execute them.

19/23 Architecting ML systems

Rather than spend time and effort provisioning infrastructure,

20/23 Architecting ML systems

you can use managed services such as such as

21/23 Architecting ML systems

Dataproc,

22/23 Architecting ML systems

AI Platform,

23/23 Architecting ML systems

or Dataflow to execute your Spark, TensorFlow, and Beam code.