Running Projects: A Systems Approach

Sep 13, 2023

It can be daunting for an engineer to take on leading their first large project. They may have a dozen engineers, multiple work streams, intense reporting requirements, and tight deadlines. Fortunately, we can leverage a strength any good software engineer has: systems design.

You can use systems thinking and design to break down a project from technical design to tasks, organize those tasks into work streams, set up reporting, and provide accurate estimates to your stakeholders.

You can accomplish this by doing the following steps:

Find the work streams
Model dependencies between the work streams
Breakdown each work stream
Find the KPIs for each work stream
Build the project system
Monitor and adjust

We’re going to look at two examples inspired by projects I’ve led, modified to protect proprietary information. For the first example, we’ll discuss a service that consists of an API that takes in asynchronous work requests and a workflow that processes those requests. For the second example, we’ll talk about a major project that touches over a dozen services with hundreds of changes.

Find the work streams

Typically, your software will have boundaries that separate components and you can leverage these to separate your work streams as well. A good tech design will help guide you through this process. We’ll use this when we go through the first example above.

If there isn’t a tech design, you may need to look at the work itself. Assess the work that needs done and find any common patterns in how it will be done. Those patterns will help us define the work streams in the second example.

Model dependencies between the work streams

Work streams may have dependencies between them. These dependencies will eventually result in bottlenecks where one work stream is waiting on another. To find these, inspect the boundaries in your software. Are there API definitions that need completed? Database models? Message formats? Figure these out, and prioritize resolving these first.

Breakdown each work stream

For each work stream, start at a high level and find which pieces can be done independently. For instance, you may be building two sets of CRUD APIs for two different resources so you can build these separately. For each resource, you may need to define a data model which will prevent working on each of the Create, Read, Update, Delete APIs. You don’t want the Create and Delete APIs operating on completely different data models! Model these dependencies between high level tasks. As you do this, you’ll get a sense for what can be done in parallel and what tasks are blocking others.

Find the KPIs for each work stream

If you’re leading a project of any size, there will be someone wanting to know how progress is going. For some projects, it may make sense to assign task points and try to report the progress on those. For some projects, that could be a complete waste of time and something like ticket count would be a better fit. For others, you may report on the completion of each sub workstream, such as which APIs are done and which aren’t. Work with your stakeholders to find the right way to report progress.

KPI (Key Performance Indicator) is business word to many engineers, but it’s really just a metric to measure the success of a particular system. As software engineers, we know how to measure systems. Consider each of your work streams and what you would need to measure to report status to your stakeholders.

Build the project system

Now, we have the information we need to start figuring out how the project should function. We have our work streams, we understand the dependencies, and we know how many people we have. You’ll generally want to assign a single owner to each work stream, unless a stream is especially small and can be easily managed by someone with split attention. Model out the project as you would any other system, including reporting lines and work streams.

Monitor and adjust

No plan survives first contact with reality, and your project system is no exception. People will get sick, work streams will move more slowly or more quickly than expected, priorities will shift, and so on. Keep a close eye on the system, identify constraints, and eliminate them.

Example One: Async Workflow

For this example, we have a simple system where an API ingests work that a workflow completes in the background. Here’s the (very) high level technical design.

API writes the work request to a datastore
The datastore event triggers the workflow
The workflow performs the work and marks the request as completed
When the caller requests the status, the API reads from the datastore

Let’s go through the process outlined above to build a project system that can deliver this software system.

Work Streams

First, we can see three main parts of the system: The API, The Datastore, The Workflow. In this case, the work streams fall out of the architecture diagram. Let’s start by considering each of these a separate work stream and move to the next step.

Dependencies

Looking at the diagram, we can see that The API and The Workflow depend on The Datastore. The API and The Workflow are fairly independent, though. There may be some common code, but the bulk will be in how they interact with The Datastore. We can rework our diagram to show these dependencies.

We have a problem, though. According to this diagram, we need to complete The Datastore before we can start work on The API or The Workflow. This means those two work streams will be stalled waiting on the third, which isn’t great. We want to go fast.

Fortunately, with a strong architecture in our code, we can abstract away the implementation detail of The Datastore. I won’t get into code architecture in this post because it deserves its own, but just imagine The API’s code and The Workflow’s code operate against interfaces instead of making database queries.

With this abstraction, we can break the dependencies and work on all three work streams simultaneously.

Breakdown The Work

Now, we need to get some tasks. I recommend using a simple bullet list to start. For the API workstream, we might produce something like this:

API Infrastructure
- Create containers
- Create load balancer
CreateFoo
- Define API model
- Implement
- Testing
ReadFoo
- Define API model
- Implement
- Testing
UpdateFoo
- Define API model
- Implement
- Testing
DeleteFoo
- Define API model
- Implement
- Testing

And so on and so forth. Once you have your bullet list, it’s worth doing a check for dependencies again. We want to make this maximally parallelizable. Fortunately, if you are following a strong testing pyramid design, these can all be done individually. You can build the CreateFoo API and test it without having any real containers running. You can continue breaking this bullet list down, perhaps a task for each test. At this point, we’re talking tasks and not work streams, so we’ll leave it here.

Reporting and KPIs

For this example, we can probably follow a traditional task pointing system. The work is fairly well understood and easy to estimate. Once you go through your planning sessions and assign points as a team, you can just report progress on those points as your KPI.

The Project System

Let’s say we have 5 engineers that can work on this project: Alice, Bob, Charlie, and Dani. The last one is the project lead: you.

We have three parallelizable work streams: The API, The Workflow, and The Datastore. We’ll assign Alice to The API and Charlie to The Workflow, as they need opportunities to lead a small project. You must delegate ownership of these streams to them. We’ll get into delegation in another post and how to use it to empower your fellow engineers and help them grow. The Datastore is especially concerning, as you need to make sure the data model is extensible. You’ll take that responsibility as the most experienced engineer on the team.

This leaves Bob and Dani unassigned. The Datastore doesn’t have a large volume of work, so you assign them to Alice and Charlie’s work streams respectively.

Since Alice and Charlie own their own workstreams, they only need to communicate with you insofar as escalating major issues or resolving conflicts on the boundary. You are the only one communicating with stakeholders, so they have a single point of contact and there is no confusion when communicating.

This diagram illustrates the system for the project team. You are not interacting with Bob and Dani since Alice and Charlie are directing them. The Stakeholder is not communicating with anyone else, ensuring clear communication from the project team. Any issues in the work streams are escalated through you, the project lead.