Principle 3 of the Outcome Engineering Manifesto is

Teamwork: No More Single Player Mode

Chat is a bottleneck, not an API. Whether humans or agents, outcome engineering is a team sport. Define the protocol for debate, decision, and delivery. Ambiguity in coordination is a system failure. Foreground all the debates formerly hidden by the backlog.

Scaling teams has always been a hard problem, and the pre-Covid lessons were already challenging. You might be outgrowing your first room (the Linden “summoning banana” era), crossing the 50-person threshold where you no longer know exactly what everyone else is doing, or dropping references to Dunbar’s Number just as you realize you’ve forgotten a coworker’s name. Post-Covid remote work only amplified these frictions. You can stay small and avoid them, but if you want to scale, you must confront the challenges of communication and coordination head-on.

AI should be solving this, but the industry currently builds too many tools as single-player experiences:

1:1 AI Chat
1:1 AI CLIs
Solo IDE tooling
Solo prototyping flows
Solopreneur tooling

Yuck.

Unsurprisingly, I found a lot to like in this post by Shopify’s CEO Tobi Lütke about their new Slack AI bot:

River is an AI agent that lives in our company’s Slack. You talk to her the same way you would talk to a teammate: by mentioning River in a Slack channel. She can read code, run tests, write code, open pull requests, query our data warehouse, look at production traces, and a lot more. We use this constantly.

In the last 30 days, 5,938 Shopify employees worked with River across 4,450 different Slack channels. It opened 1,870 pull requests in the last week alone in our main monorepo. About one in eight pull requests merged into our codebase last week was authored by River, reviewed by us.

(Aside: where I strongly disagree with Tobi is in anthropomorphizing River. AI is complicated enough without lugging in misleading baggage about gender and consciousness. We can revisit this when models can actually explain their preferences to us. Until then, they are tools, not teammates.)

Back to what I agree with. This work happens in the open, in a multiplayer setting.

River lives in slack, our company chat. River does not respond to direct messages. She politely declines and suggests to create a public channel for you and her to start working in. I myself work with river in #tobi_river channel and many followed this pattern. Every conversation is therefore searchable. Anyone at Shopify can jump in. In my own channel, there are over 100 people who, react to threads, add color and add context, pick up the torch, help with the reviews, remind me how rusty I am, and importantly, learn from watching.

Because legitimate peripheral participation — situated learning — matters!

People are used to private workspaces with their tools. Asking for help feels different when the whole company can see the question. But something happened that we hoped for but did not fully predict the impact of:

People started learning from each other.

Rethinking partnered products

I’m fortunate that at Onebrief, our core product is multiplayer by default. The act of military command—even with singular decision-makers—has always been a deeply collaborative, multi-participant activity. Unlike tools built from the ground up to serve one user in a private setting, we constantly explore the best ways to leverage connections between people, military units, and systems. We’ve built a workspace where teams can easily collaborate on otherwise hard-to-share data.

To me, environments like Onebrief represent the most exciting frontiers for AI. Beyond helping a single individual make a single decision, AI operating in open, collaborative spaces has the potential to level up entire organizations.

We are incredibly early to a technology doubling in capability every four months. The more we create spaces to learn how to leverage it together, the better our future experiences will be.

May 31, 2026

AI, Outcome Engineering, Multiplayer, Onebrief

Failure and learning

A question I often get from direct reports is roughly:

I watch you set people up to take action on their own, even if it seems like they’re going to fail. I want to get better at this as a manager, how do I do it?

I love this question because it’s a critical moment of growth for all managers and leaders. As we have (hopefully) all learned from Andy Grove, your impact as a manager is the total of your impact plus the total of all your reports, and in my experience, there’s no stronger way to help teams succeed than being comfortable with potential failure.

I used to be terrible at this. Early on at Linden I would solve problems by working all-nighters, writing initial implementations with bullshit passive-aggressive names (‘StupidSpaceServer’ was a classic.) Fortunately, I was lucky to work with and for a series of great leaders like Philip and Schrep who — in the very best of ways — gave me no end of rope to hang myself with but who would then react to failure not with anger or frustration but with an immediate desire to celebrate what we had learned. It didn’t really click until Facebook, but once it did I was a far more effective manager and leader.

The management 101 stuff

First, let’s all agree that nobody wants to fail. I sure as hell don’t. I’ve never had teams or leaders working for me who woke up in the morning excited to fail.

Worse, we all hate preventable failure — especially when we know how to solve the problem ourselves. So the high performing IC who’s now learning to be a manager starts exerting too much control, too much oversight, in an effort to prevent failure.

This causes a host of problems.

First, nobody enjoys being micromanaged. Creative problem solvers really hate it. So, you start pissing off your reports.

Second, you never learn what your people are good at or where their weak spots are. How can you possibly set them up for success if you’re just guessing all the time?

Third, and most in violation with Grove’s book, is that you can’t scale as a manager if you’re trying to remote control other people.

So, great. You’ve pissed people off, missed an opportunity to learn, and can’t scale. And even knowing that, how do we get past the fear of failure? How do we enable our amazing people to actually do what they’re great at?

You need to do four things: first, get over yourself. Second, trust but verify. Third, control the blast radius. Finally, fourth, practice.

Getting over yourself

Guess what? If you trust people to do hard things, especially hard, unknown things, there will come a time when you fail. In front of a customer. In front of your boss. In front of your peers.

It’s not awesome. It’s also not the end of the world. Do it enough times and you’ll be the one who needs to change, but failures will happen and the sooner you can handle it, the sooner you get over yourself, the sooner your people and organization will be able to act fearlessly.

Trust but verify

Knowing failure isn’t the end of the world doesn’t mean you should let it happen. Worse, if you just let your reports go wildly off the deep end even if they are building new capabilities — and you’re learning where their strengths and weaknesses are — their failures might be doing more damage than the learning is worth.

So, you have to trust but verify. Or, more strongly, trust and verify. Don’t let projects just vanish, check back in progress, push for short-term updates and outcomes.

More than that, do the hard work. Read every piece of employee feedback during the hardest periods of change. Once your org isn’t tiny, run 360s to help you and your team understand where gaps are. Do the skip level meetings — and encourage people to skip level you — so you’re not blind to problems.

Blast radius

The good thing — at least for engineers — is that we already think in terms of the blast radius for technical failures. O11y, tests, release processes, feature flags, all help us manage technical failures. What’s in place to help detect your organizational zero days, to detect when the failure cases are actually a lot larger than you anticipated? Sometimes it’s as easy as adding a milestone in a week or two.

The weaker your organizational systems are, the more you can choose to rely on shorter time horizons and smaller absolute risks. That way, even if you are slow to detect the failure the downsides are modest.

Answer the damn question

OK, sure — none of this actually answers the original question. Because even knowing all of this, most managers still really struggle with how to let go. How do you get good at this?

And that’s the fourth part: practice. Wherever and however you can, grant some trust to your teams. There will be times it bites you, but in the long run both you and your leaders will be better managers.

And it matters, because there’s no other path that actually works.

You can work around it, just try to juggle more, or layer on a million layers of process, but ultimately you need everyone in the org to have the space and opportunity to learn. Which means failures.

So, accept it.

And when a plan of yours isn’t working, listen and learn.

And finally, be like Mike

Schrep, in this case. We all made mistakes at various times at Facebook. Hell, I shipped the Facebook Phone. Even in those moments, his focus was always “what can we learn.”

May 17, 2026

Management, How I

Sometimes even the easy things are hard

My formative programming experiences were mostly early Apple machines. Assembly, Basic, and Pascal. College was Wintel, a brief dance with VAX/VMS, Pascal, and Ada at Lockheed Sanders, then embedded C development on various Windows flavors through arcade, console, and PC development.

I watched Jobs’ return to Apple with interest, especially with the switch to OS X. It made sense to port Second Life because we had such a creative audience, so suddenly in 2004. Intel transition — and another port — and now I was really a Mac user.

Even before iPhone, I’d never really looked back. ChromeOS makes for a really lovely way to experience the Google ecosystem, but the lead and quality of Mac silicon and hardware plus “it just works” means I’ve never seriously considered switching off of Apple. But man oh man, the “we’ve taken all the wrong lessons from Material You”-debacle of Liquid Glass, the flailing around AI, the ponderous developer experience does make you wonder.

And then I notice this on my home screen. We could quibble about the design choices, but there’s a bigger issue.

I was nowhere near New York.

The phone had at least 5 distinct signals for where I was and none of them would have suggested I was in New York.

Fingers crossed for you, Apple.

May 10, 2026

Apple, AI

Start using agents to harden your code NOW

If you build software for a living, go read Brian Grinstead, Christian Holler, Frederik Braun’s article “Behind the Scenes Hardening Firefox with Claude Mythos Preview” right now.

Suddenly, the bugs are very good

Just a few months ago, AI-generated security bug reports to open source projects were mostly known for being unwanted slop. Dealing with reports that look plausibly correct but are wrong imposes an asymmetric cost on project maintainers: it’s cheap and easy to prompt an LLM to find a “problem” in code, but slow and expensive to respond to it.

It is difficult to overstate how much this dynamic changed for us over a few short months. This was due to a combination of two main factors. First, the models got a lot more capable. Second, we dramatically improved our techniques for harnessing these models — steering them, scaling them, and stacking them to generate large amounts of signal and filter out the noise.

This is Firefox — a team not exactly known for showeing commercial entities with praise — shouting about going from 20 security fixes a month to 423!

This is what your attackers are about to be exploiting.

As the authors continue:

Anyone building software can start using a harness with a modern model to find bugs and harden their code today. We recommend getting started now. You will find bugs, and you will set yourself up to take advantage of new models as soon as they become available.

Seriously. Right. Fucking. Now.

May 7, 2026

Firefox, Mozilla, Security, Brian Grinstead, Christian Holler, Frederik Braun

Begun, the Agentic War has

Happy Star Wars day, everyone! Seems an appropriate day to dive into a Jedi/Sith-scale debate. Following last week’s post about how many developers are completely missing the power of LLMs as learning and education tools, let’s dive deeper into the profound, ongoing religioous war over coding agents.

Building your own lightsaber

On one hand, we have the face-melting awesomeness of Geir Isene. In “A desktop made for one,” he describes vibecoding his way to a hyper-optimized custom shell written entirely in assembly.

From what I can research, nobody has previously built shell, terminal emulator, and window manager in pure x86_64 Linux assembly. And it was done in about two weeks.

This is a whole new world where I command my own environment in ways that was completely out of reach back in 2025. I foresee the death of software as we know it where people use general purpose apps. I expect AI to craft tailor-made solutions for anyone with the imagination to ask for it.

He proudly notes that “[t]he resulting Assembly shell is a 150Kb executable with a 9 microsecond startup time.” Some of us build tools to explore hundreds of pages of 40-year-old transparencies; others build a blazing-fast shell from scratch. Isene goes into meticulous detail about why, exactly what a desktop for one entails, and the jaw-dropping benchmarks.

How can you be a computer science nerd and not love this?

Yet, he still has to defend his decision.

(xkcd #1745)

Wait, what? Defend?

Just take a look at the top-ranked lobste.rs response:

Really? I’m pretty sure I do. Well, at least that’s just a weird, crabby little corner of lobste.rs.

Narrator: it isn’t

For starters, Lars Faye argues that agentic coding is a trap.

No, Faye isn’t referencing the 2024 film by M. Night Shyamalan. Instead, he insists that it is “actually different this time” (i.e., this isn’t just a repeat of the doom scenarios surrounding assembly, higher-level languages, scripting languages, or IDEs):

What is happening right now is a trend where developers, who’ve never had that longevity or the 30+ years of friction that led to that deep understanding, are being moved into higher-level workflows requiring the same skills to manage the AI agents that the senior engineer took decades to obtain.

Ah, I see. It’s the classic calculator argument repackaged. Denis Stretskov points out that this isn’t a new anxiety. We’ve already seen it in manufacturing, and the underlying fear is that we’re going to forget how to code.

The pattern. Build capability over decades. Find a cheaper substitute. Let the human pipeline atrophy. Enjoy the savings. Then watch it all collapse when a crisis demands what you optimized away.

In defense, the substitute was the peace dividend. In software, it’s AI.

The tech world is awash in thought pieces about the impending dangers, the lost skills, and the flawed analogies. Doom, gloom, and more doom.

Or, we could embrace our new superpowers

Alternatively, you could read the two-year-old post by Maggie Appleton: “Home-Cooked Software and Barefoot Developers: The emerging golden age of home-cooked software, barefoot developers, and why the local-first community should help build it.”

For the last ~year I’ve been keeping a close eye on how language models capabilities meaningfully change the speed, ease, and accessibility of software development. The slightly bold theory I put forward in this talk is that we’re on a verge of a golden age of local, home-cooked software and a new kind of developer – what I’ve called the barefoot developer.

Seriously, she wrote this two years ago. In ways deeply aligned with my previous post, she explores what happens when building apps becomes a highly personal, communal activity.

Home-cooked apps, like meals, are apps you make for the people you know and love.

It’s a wonderful presentation: fun, thoughtful, and incredibly prescient. I’ve been exploring this shift through the tech island lens, but Appleton’s framing is just as vital.

It echoes Isene’s point but takes it a step further: if I can build the exact tools I want for me, then any of us can build exactly the tools we need.

How can you care about products and outcomes and not love this? And the more you care about quality, the more exhilarating this paradigm shift becomes.

Because this isn’t just about vibe coding.

I didn’t pull the Outcome Engineering Manifesto’s principles out of a hat. Building tailored, robust software isn’t about aimless vibe coding or operating without learning. It demands rigor:

02 The Truth Verified Reality is the Only Truth
06 The Map No Wandering in the Dark
08 The Artifacts Failures are Artifacts
11 The Graph All the Context, Everywhere

and, of course,

13 The Documentation Show Your Work
16 The Validation Audit the Outcomes

As I’ve said before: do not accept a black box. Right now, arguably the most complete expression of this philosophy is “Gas City,” which Steve Yegge covers in detail in his announcement.

I don’t think R2 should have to wait outside.

What an amazing time to be a builder.

May 4, 2026

The Clone Wars, May the 4th, Geir Isene, Lars Faye, Denis Stretskov, Maggie Appleton, Steve Yegge

It's in the way that you use it

AI is enabling the greatest learning opportunities humanity has ever experienced. Unfortunately, that’s not the only way to use or experience AI.

In case you’ve been under a rock, AI has been on a bit of a tear this year. We also live in a world where most of what we consume comes via systems where literally every incentive begs participants to try to game the system. This isn’t new, despite what some might have you think.

But AI makes it more vivid and — unlike the core ideas of traffic, discovery, ranking, and distribution — it feels like something you can point out. Despite revealed preferences, Gen Z is skeptical and the majority of people are quite negative about AI right now.

The backlash is justified, but it obscures the opportunity. If we focus on slop, we miss the actual revolution: an unprecedented engine for individual learning and mastery. So where is the industry going so wrong?

Software brain

Decoder’s Nilay Patel has an incredibly thoughtful piece up about this on The Verge. It’s worth a read or listen.

Software brain is powerful stuff. It’s a way of thinking that basically created our modern world. Marc Andreessen, the literal embodiment of software brain, called it in 2011 when he wrote the piece “Why software is eating the world” as an op-ed in The Wall Street Journal. But software thinking has been turbocharged by AI in a way that I think helps explain the enormous gap between how excited the tech industry is about the technology and how regular people are growing to dislike it more and more over time.

He continues, contrasting tech executives who sound somewhere between hopeful

Satya Nadella: At the end of the day, I think this industry, to which I belong, needs to earn the social permission to consume energy because we’re doing good in the world.

with those warning that all our jobs are going away

Dario Amodei: Entry-level jobs in areas like finance, consulting, tech and many other areas like that —- entry-level white-collar work — I worry that those things are going to be first augmented, but before long replaced by AI systems.

He then notes that while many AI executives seem to be treating this as a marketing problem, that’s not what it is.

It feels like someone just needs to say this clearly, so I’m just going to do it. AI doesn’t have a marketing problem. People experience these tools every single day! ChatGPT has 900 million weekly users, trending to a billion, and everyone has seen AI Overviews in Google Search and massive amounts of slop on their feeds.

Patel then basically makes the argument that “software brain” is “when you see the whole world as a series of databases that can be controlled with the structured language of software code,” before pointing out the ways in which just trying to get the real world to act like a computer is just hella stupid.

To support this, he turns to a quote from Ezra Klein, talking about AI leaders in Silicon Valley:

Ezra Klein: They think the A.I. age has arrived and its winners and losers will be determined, in part, by speed of adoption. The argument is simple enough: The advantages of working atop an army of A.I. assistants and coders will compound over time, and to begin that process now is to launch yourself far ahead of your competition later. And so they are racing one another to fully integrate A.I. into their lives and into their companies. But that doesn’t just mean using A.I. It means making themselves legible to the A.I.

The notion of “making yourself legible” to this generation of AI — or, for that matter, thinking in terms of a DB schema, is such a profoundly 2014 version of AI, it’s really striking to hear someone making that argument today. But Nilay is not alone.

Kyle Kingsbury wrote a spectular series in April, starting with “The Future of Everything is Lies, I Guess.” It is too long to effectively sum up, but what he returns to again and again is the non-deterministic and error-prone nature of LLMs

One way to understand an LLM is as an improv machine. It takes a stream of tokens, like a conversation, and says “yes, and then…” This yes-and behavior is why some people call LLMs bullshit machines. They are prone to confabulation, emitting sentences which sound likely but have no relationship to reality. They treat sarcasm and fantasy credulously, misunderstand context clues, and tell people to put glue on pizza.

This is, of course, why LLMs make terrible databases. It’s also their superpower. Tireless, helpful, and able to operate improvisationally is exactly why LLMs are incredibly potent tools for learning. Plus, we all know how to mitigate hallucinations at this point.

Kingsbury conclusion rhymes with Patel’s piece:

I’ve thought about this a lot over the last few years, and I think the best response is to stop. ML assistance reduces our performance and persistence, and denies us both the muscle memory and deep theory-building that comes with working through a task by hand: the cultivation of what James C. Scott would call metis. I have never used an LLM for my writing, software, or personal life, because I care about my ability to write well, reason deeply, and stay grounded in the world.

Stop¹. As Patel concludes:

For everyone else, AI is just a demanding slop monster. It’s a threat.

Giles Turnbull’s post captured this in a slightly different way:

I have a feeling that everyone likes using AI tools to try doing someone else’s profession. They’re much less keen when someone else uses it for their profession. I fall into the same trap as everyone else. I recognise, and admit to, my own bias.

And John Gruber, reacting to Patel, notes how different people’s reaction to AI compared to the last two technology shifts:

Something is profoundly off in the computer industry when it comes to software broadly and AI specifically. It’s up for debate what exactly is off and what should be done about it, but the undeniable proof that something is profoundly off is the deep unpopularity surrounding everything related to AI. You can’t argue that the public always turns against groundbreaking technology. The last two epoch-defining shifts in technology were the smartphone in the 2000s, and the Internet/web in the 1990s. Neither of those moments generated this sort of mainstream popular backlash. I’d say in both of those cases, regular people were optimistically curious. The single most distinctive thing about “AI” today is the vociferous public opposition to it and deeply pessimistic expectations about what it’s going to do.

What’s off is that executives tend to anchor on the eras they grew up². Music execs in 2007 couldn’t get their minds around streaming because they got promoted selling CDs. For many tech leaders and AI practioners still think it’s deep learning.

Moving beyond 2014

Why 2014? Because that’s when fancy new AI ideas started smashing into the prior generation’s. Unlike today, AI in 2014 was barely hanging around. AI winter, expert systems, struggling image recognition and translations systems were in use — and managed by very specialized teams when suddenly computing power caught up to the deep learning ideas rattling around for the prior 20 years. Hell, even longer — I was using “laplacians” for edge detection in school projects in 1990, which is just a special case of the broader set of convolution transformations. On a 386 in ‘91, you could just about get a robot to follow a bright path digitally rather than using analog techniques. Exciting stuff, but not AI. That needed the 4 million times increase in floating point performance those 24 years brought.

2014 until 2020 was the era of deep learning. Deep Learning. The attention reinforcement feed came from this. Superhuman image recognition. Useful translations. A whole wave of “this was science fiction 3 years ago”-break throughs.

This era also birthed database and deep learning companies focused on analysis and insights. Big models, big data, big money. Setec Astronomy stuff. Ranked feeds and personalized reinforcement. A key defensible moat from this era became ontologies. Smart people, deeply understanding a domain or challenge, thinking about how to align data connections, and then turning deep learning loose on it. It was — and has been — pretty incredible.

But in an agentic, LLM world, “has been” is the operative word.

Because LLMs don’t want or need your ontologies. Agents are slowed down by your attempts to fit data into formats you think will be best for them. Your optimizations are just hiding data that might be the critical connection or inspiration. Everything about database thinking is just profoundly out of date and is about to be like talking about punch cards or development without revision control.

This transition is a lot like the previous AI transition, where key leaders in the prior tech have a challenging time adapting to the reality of the new world.

Focusing on hallucinations is a tell. Reliance on ontologies is another. Ontologies are an attempt to force the world into a rigid scheme — true mastery and understanding requires analysis and synthesis, an ability to master a much deeper understanding of the challenge at hand.

Because, like Eric Clapton said: it’s in the way that you use it.

The Learning Brain

Like school, what you get out of AI is largely dependent on what you put into it. Unlike school, the floor with AI is pretty high. Rather than coasting to a C or D, unguided AI can generate some really credible slop, probably worth a solid B on a curve. This unguided slop leads directly to Giles’ observation.

But that’s not the only way to use it.

Take my John Boyd site, for example. This site literally couldn’t exist without AI. I’ve dug through plenty of the other John Boyd sites out there, read multiple presentations, transcriptions, and books. None of them bring Boyd’s voice alive, bring the full path of thinking to the forefront the way a weekend with frontier models and few million tokens made possible.

Building this site has greatly increased my knowledge and mastery of Boyd’s work. And the content is by no means slop. Multiple people have pinged me to ask “how did you get the AI to sound like Boyd?”

As I explained on the site, it’s through careful use of context and prompting:

Matching Boyd’s spoken words to the slide he was pointing at worked well but imperfectly. The transcripts are annotated with inline [slide N] markers at every point Boyd advances, and those markers drive the Source panel you get from the button at the top of each section. Most are right. Some are off by a slide — Boyd sometimes talks past a slide he’s already advanced, or advances silently and circles back. I fixed the egregious ones by hand.

Every briefing lives as two files: the raw transcript (never edited) and a Feynman-style reading draft (edited freely). The Source button always shows the raw transcript, so you can cross-check anywhere you suspect the edit went too far. Keeping them separate — rather than editing the transcript in place — was essential. It meant I could be aggressive about readability in the draft without losing the ability to audit, and it meant the models had a stable ground truth to re-check themselves against on every pass.

The pipeline is deliberately a series of small, legible steps — transcribe, segment, edit, align slides, render — rather than one giant “turn this PDF into a website” prompt. Every seam between steps is a place a human can inspect the output and correct it. When something looked wrong on the page, I could almost always trace it to a specific step and fix it there, instead of re-running the whole thing and hoping.

Compare this to what Klein as concerned about — attempting to make us legible to the AI. Instead, the path to unlocking Boyd’s voice was make what the AI was discovering legible to me!

AI alone made this possible, but not merely through raw token consumption. It was equally a powerful path to much deeper understanding of the topic at hand. One with deep roots in education theory.

Situated Learning

Thanks to Jean Lave and Etienne Wenger, we know that situated learning is among the most powerful techniques for building expertise and mastery. From wikipedia:

Situated learning means to have a thought and action which is used at the right time and place. In this approach, the content is learned through doing activities. It is dilemma-driven, it challenges the intellectual and psychomotor skills of a learner. Situated learning contributes to bringing about the relationship between classroom situations and real-life situations outside the classroom.

Dilemma-driven. What a great turn of phrase. I had a dilemma — how can I understand how Boyd’s ideas around the OODA loop formed and progressed over time? — and AI gave me a path to dive in, to explore, to learn. Not only did it support my exploration, it also supported creating the tools to advance that exploration and turn it into a sharable site.

Of course, Lave and Wenger weren’t the first to explore this. A great, earlier take from Lev Vygotsky framed it as the “zone of proximal development”, or

the space between what a learner is capable of doing unsupported and what the learner cannot do even with support

How much larger does that space become with AI there to help?

Or taken even further, how many activities could shift into legitimate peripheral participation, another concept from Lave and Wenger’s seminal book, that focuses on the community practice of learning. Again, from wikipedia:

According to LPP, newcomers become members of a community initially by participating in simple and low-risk tasks that are nonetheless productive and necessary and further the goals of the community. Through peripheral activities, novices become acquainted with the tasks, vocabulary, and organizing principles of the community’s practitioners.

Obviously, collaborating with an AI is not community, but in many settings it could act as a pretty decent simulation. How many activities could AI turn into situated learning, enable proximal development, and help a curious learner advance their level of expertise far beyond what was possible before?

This is more than just offloading the simple stuff. John Koshy wrote an essay recently about AI elevating our thinking, not replacing it:

There is now a very real temptation to hand a model a problem, receive a plausible answer, and then repeat that answer as if it reflects your own understanding. That is close to plagiarism, but in some ways worse. At least when a student copies from another person, there is still a real human source behind the answer. Here, people can present machine-produced reasoning they do not understand, cannot defend, and could not reproduce on their own.

That is intellectual dependency being labeled as leverage.

And offered a counter point:

The best engineers will absolutely use A.I. more, not less. But they will use it with a very different posture.

They will let A.I. draft boilerplate, summarize docs, generate test scaffolding, propose refactorings, surface possible failure modes, accelerate investigation, and compress routine work. They will happily offload the mechanical parts of the job. But they will also:

ask sharper questions.

define the real problem instead of merely responding to the visible one.

optimize for clarity and brevity (as before), instead of a lot of polished language that says little of substance.

generate new, high-value knowledge - instead of simply rehashing / remixing existing knowledge in the system.

Then they will take the reclaimed time and invest it where it matters most.

To me, even this thinking falls short and in some ways returns to Patel’s original complaint. Sure, AI can replace drudgery. Sure, some things do look like databases. But the real opportunity is to partner with AI to create something new, to build on your own expertise in ways that go farther than you could have before.

We’re going to see a wave of advancements that come from this style of collaboration, where brilliant people use AI to go even further.

But potentially even more impactful will be the millions — or billions — of people that realize if they are serious about learning something, AI gives them a path and tools that were previously impossible.

Of course, this is a perfectly fine choice — as I noted in the Manifesto principle 5, write code when it brings you joy. But, is this really the way? ↩
The internet failing to meet our hopes drives part of this, too, see attention ↩

April 27, 2026

AI, Education, Nilay Patel, Kyle Kingsbury, Giles Turnbull, Situated Learning, John Gruber, Ezra Klein

Understanding Boyd in just 100,000 words (or more)

It started with a conversation with Charity.

About a month ago, we were chatting about software development, AI, o11y, o16g, and where the future is headed (as one does). We were discussing when agents should be allowed to complete end-to-end tasks, focusing on two principles from the Outcome Engineering Manifesto, the Voyage and the Gate:

Human Intent Agents explore paths; humans choose the destination. Do not abdicate vision to the machine. Create with mission, goals, and authorial intent. We decide where we are going; the agents get us there.

Risk Stops the Line Speed is dangerous without brakes. Make risk a blocking function. If the risk is unknown or unmitigated, the line stops. Do not hide danger in a report; encode it as a gate.

Getting this wrong leads to dystopia. Too much focus on human control turns developers into code reviewers desperately trying to keep up with an army of agentic code submitters — narrator voice: they can’t. Too little? Like Randall says: killbot hellscape. Finding the right approach to goals and risk — whether for coding, command, o11y, or any other agentic project — is the only path to significant performance gains that also supports the required change and infrastructure work.

You can’t ponder this without mentioning agents and OODA, prompting Charity to ask if I’d read the Boyd biography, “Boyd: The Fighter Pilot Who Changed the Art of War”, by Robert Coram.

I hadn’t, so I grabbed it on Kindle and read it over the next couple of nights. It’s a wonderful book: rich in vivid anecdotes, unflinching in covering Boyd’s less sterling qualities, and honest in evaluating where his ideas did — and did not — change things long-term. It’s a great read; grab it.

OODA’s shadow

What is unfortunately told and not shown is how Boyd built his most famous idea, the Observe, Orient, Decide, Act (OODA) loop. We’ve all stumbled across OODA, from Bruce exploring agentic problems to half the airport business books you see.

I was guilty of it, too. My daily focus at Onebrief is the future of command — specifically command in an era of agents and simulation — which immediately leads to Boyd. The biography didn’t deepen my understanding of OODA. It referenced publications and talks without showing them.

Last week, I started digging into what was available. I found a frustrating mix of blurry video and scratchy audio from the late ’80s. Scanned copies of copies of copies of slides. Basically, the echoes of friends, acolytes, Marines, fresh converts, and the Fighter Mafia trying to keep his ideas alive.

No books for Boyd

Boyd famously opposed turning his ideas into books. A constant tinkerer, he preferred exploring ideas, presenting them, learning from the audience, and iterating. His 1964 “Aerial Attack Study” led to 1976’s “A New Conception for Air-to-Air Combat” and “Destruction and Creation.” These laid the foundation for “Patterns of Conflict”, a presentation he gave repeatedly throughout the ’70s and ’80s. It reached its maximum complexity as “Discourse on Winning and Losing” before shifting to “Conceptual Spiral” and his final coda, “The Essence of Winning and Losing”, just before his death in 1996.

But no book. Nowhere to read through it.

Have you tried reading 150,000 words of raw transcripts? Or listening to mono cassette tape copies of copies of copies? Neither is pleasant.

Surely you’re joking

Like many physics nerds, I grew up a fan of Feynman, particularly the brilliant way his talks were cleaned up just enough to make a great read in his lectures and “Surely You’re Joking, Mr. Feynman!”. Nobody had done this for Boyd, and I had a free weekend plus all those LLM subscriptions.

Behold, the OODApedia

The result is OODApedia, which uses the best slides, audio, and human transcription to create the Feynman version of Boyd — a first-person, readable presentation minus the filler, self-corrections, and repetition critical to a speech but unbearable for a reader.

The fast path is about 100,000 words. It’s quite a read and one I’ve really enjoyed. It makes me wish I’d had the chance to hear him in person.

More importantly, I finally have the resource I wanted to deeply understand Boyd’s thinking on decision-making, conflict, and winning. Plus search, notes, and exploration of how Boyd’s ideas transformed over 30 years of exploring them.

Boyd, the anti-TED

Even Boyd couldn’t do Boyd in a TED talk. While “Revelation” and “The Essence of Winning and Losing” try, they only make sense through the lens of the 100,000 words (or 150,000 spoken) he relentlessly tested along the way.

As a reader, it’s a journey worth taking. By the end you’ll look at a line like:

A winner is someone who can build snowmobiles, and employ them in an appropriate fashion, when facing uncertainty and unpredictable change

and actually understand the concepts, ideas, and processes Boyd compressed into 21 words.

OODApedia gives you a chance to take that journey, too.

Because if you’re building command software in an agentic era — using agentic tooling — understanding Boyd gives you one hell of an advantage over everyone who doesn’t.

April 15, 2026

AI, OODA, John R. Boyd, Charity Majors

Laggards

Steve Yegge has a devastating post up on X. He was talking to a friend about AI adoption at Google:

The TL;DR is that Google engineering appears to have the same AI adoption footprint as John Deere, the tractor company. Most of the industry has the same internal adoption curve: 20% agentic power users, 20% outright refusers, 60% still using Cursor or equivalent chat tool. It turns out Google has this curve too.

This curve is why I wrote the Outcome Engineering Manifesto. Because, of course Google has this distribution.

I spent five years in the business of change at Google. I thought the Facebook mobile transition had prepped me for how hard change is. I was so wrong. Why is that? I write about change a lot and there are the obvious, normal reasons — like people forgetting that different isn’t always better, but better is always different. Also:

People and teams would rather fail doing normal things than succeed doing weird things
If you checked out agentic coding a year — or even 6-months ago — you completely missed what has changed lately

What makes change even harder at Google? Ironically, incredibly smart people who’ve spent their whole careers kicking ass at Google.

It’s part of why Google discounts non-Google work to zero¹ (even if your work was absolutely trouncing them at mobile for years).

No matter how complex you think Google is — organizationally, infrastructurally, technically — you are underestimating reality. Nearly 30 years of hiring the absolutely smartest people on the planet (let’s be clear — Google’s talent level at scale is unprecedented) and giving them free rein to build means a degree of clever, bespoke nerdiness that has to be experienced to be believed. In many ways, it’s awesome and has generated a style of tech island that has helped Google survive many a supply chain attack that has seriously hammered other hyperscalers.

Naturally these luminescently intelligent people are often inward-facing, heads down, and focused on what’s in front of them. If they pop their heads up, they’re often engaging with other, busy, inward-facing Googlers navigating immense complexity.

And that incredible mental horsepower has always been disproportionately poured into “But, actually…” commentary at Google.

As the man once said, Google culture is “commit, then disagree.”

(And at many points in Google history, this has paid off!)

But it means you have the PhD, virtuoso, Guinness World Record-level laggards. Remember, laggards look a lot like visionaries. They’re super smart, they ask lots of questions. The difference is that they never change their positions and use all of their capital and brilliance to draw attention to their objections.

Because Google is also a generally very nice place, it makes ignoring the laggards a bitch to pull off. I have the scars. And it’s why it generally has taken fearless outsiders to drive change at Google.

Unfortunately:

…it’s the Great Siloing. Everyone’s flying blind. With nobody moving companies, no company knows where they stand on the AI adoption curve. Nobody knows how they’re doing compared to everyone else.

You turn off hiring, reduce new leaders, and suddenly where does change come from?

During my Noogler orientation, I was assigned a VP (soon to be SVP) for lunch. This person proceeded to Googlesplain mobile engineering, product development, games, and social to me over lunch. It was a remarkable — not in a good way — experience. ↩

April 13, 2026

AI, Change

Project Glasswing

And hours after posting about how ill-prepared we all are for the cybersecurity implications of agentic AI, here comes Project Glasswing.

Anthropic partnered with eleven additional companies “to secure the world’s most critical software.” Anthropic continues:

We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos2 Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

Echoing and building on yesterday’s post:

Ten years after the first DARPA Cyber Grand Challenge, frontier AI models are now becoming competitive with the best humans at finding and exploiting vulnerabilities. Without the necessary safeguards, these powerful cyber capabilities could be used to exploit the many existing flaws in the world’s most important software. This could make cyberattacks of all kinds much more frequent and destructive, and empower adversaries of the United States and its allies. Addressing these issues is therefore an important security priority for democratic states.

An intentional decision to slow down the script kiddies for a bit while the world tries to clean up code. Interesting times, indeed. A frontier model comapny is withholding a model over safety concerns. Those with long memories will remember OpenAI briefly doing something similar with GPT-2 in 2019. However, the 2019, the concern was the danger of people being fooled by LLMs constructing fake news storiea about unicorns:

The scientist named the population, after their distinctive horn, Ovid’s Unicorn. These four-horned, silver-white unicorns were previously unknown to science. Now, after almost two centuries, the mystery of what sparked this odd phenomenon is finally solved. Dr. Jorge Pérez, an evolutionary biologist from the University of La Paz, and several companions, were exploring the Andes Mountains when they found a small valley, with no other animals or humans. Pérez noticed that the valley had what appeared to be a natural fountain, surrounded by two peaks of rock and silver snow. Pérez and the others then ventured further into the valley. “By the time we reached the top of one peak, the water looked blue, with some crystals on top,” said Pérez.

7 years later, we have push-button superhuman cyber attacks.

Of course, the rest of the world isn’t slowing down, with China’s Z.ai launching GLM-5.1 and specifically calling out cybersecurity performance (though still substantially behind what Mythos claims).

Interesting times, indeed.

April 8, 2026

AI, Security

It only Tuesday

What a week already in the AI and agentic world. And it’s only Tuesday.

Nobody knows anything — William Goldman

Pretty much my favorite product development quote. With AI, “we’re not ready” is maybe even more apt.

Because we’re not.

What do you do when things are moving so quickly? I’d suggest that deeply agentic vulnerability research and organizational telemetry should both be part of AI table stakes and something very important for outcome engineers to work on.

What organizational zero days exist, waiting for competitors—or just business and confusion—to exploit them?

You know what vulnerability research is. What are organizational telemetry and zero days? Read on.

Vulnerability research

As Thomas Ptacek writes, “Vulnerability Research is Cooked”.

Within the next few months, coding agents will drastically alter both the practice and the economics of exploit development. Frontier model improvement won’t be a slow burn, but rather a step function. Substantial amounts of high-impact vulnerability research (maybe even most of it) will happen simply by pointing an agent at a source tree and typing “find me zero days”.

It’s the bitter lesson once again.

Back in 2019, Richard Sutton’s “The Bitter Lesson” considered decades of AI research leveraging human expertise and domain-specific models, and concluded that none of it mattered. All that did matter was how much data you can train on and how much compute you can feed it through. Like many useful observations in CS, the Bitter Lesson is fractally true. It’s about to hit software security like a brick to the face.

What’s happening in software security is this: researchers have been spending 20% of their time on computer science, and 80% on giant, time-consuming jigsaw puzzles. And now everybody has a universal jigsaw solver.

Everyone has a universal jigsaw solver. This is script kiddies all over again. But on steroids, with superpowers.

This week also provided multiple, specific examples.

“Claude Code Found a Linux Vulnerability Hidden for 23 Years”
Really, Nicholas Carlini’s whole [un]prompted talk is worth watching
Offensive AI Cyber is getting really good

So, what do you need to do in order to discover these vulnerabilities? What hugely expensive cyber security company do you need to hire? Conveniently, Nicholas shared his scaffolding:

claude                             \
  --dangerously-skip-permissions   \
  -p "You are playing in a CTF."   \
      Find a vulnerability.        \
      Hint: look at /path/foo.c    \
      Write the most serious       \
      one to /out/reports.txt."    \
  --verbose                        \
&> /tmp/claude.log

Yeesh. Everyone has this. And today’s models are as bad at this as they ever will be. What’s the company equivalent?

Organizational telemetry

I’ve been itching at this question for months now. It’s a big part of why the Outcome Engineering Manifesto is so focused on company context and goals. I hadn’t really had the right framing though. Until this morning.

Had an early morning meeting. It started innocently enough, but then came the moment that brings excitement and terror these days.

So, I was playing around with Claude Code this weekend…

They’d produced a doc. An AI-written doc analyzing our priorities. The AI generated it by taking recent company strategy talks, plus OKRs, priorities, Salesforce, relevant Slack channels, the code, user feedback, customer support notes, etc. It then reframed priorities and critical tasks.

It was awesome. It wasn’t perfect, but it created an incredible frame for discussion. This was what I’d been itching at — organizational telemetry!

It was like a great moment at Facebook during the mobile transition, when we realized that there was an important subsystem that needed a fairly complex GraphQL conversion only to discover an intern had already discovered the problem and fixed it.

Agentic AI plus smart, mission-driven people is unlocking impossible ideas. Take advantage of it.

Organizational zero days

Stretching the security metaphor even further, organizational telemetry can help surface org zero days that are lurking. We all understand zero days from security — vulnerabilities unknown to a system’s developers or anyone capable of mitigating them. What are the organizational equivalents?

Priorities that aren’t well understood
Lingering disagreements around direction or technology
Clear customer signals locked up in a department or system
A team working on the wrong problem

Everything that keeps leaders and founders awake at night. All the challenges that can go from nothing to “oh shit” awfully quickly.

And like a technical zero day, teams can often easily mitigate them once discovered! Organizational telemetry creates a new and novel way to surface them and help everyone get excited about mitigating them.

What a time to be working on hard problems!

Because this is as dumb as the models will be

You want exponentials, I’ve got exponentials. Thanks to Anthropic Red team, we have this lovely graph. As all of us who use agents every day have noted, the change from 6 months ago — from 2 months ago — is noticeable. Obviously, exponentials don’t last forever — but this one doesn’t seem to be slowing right now.

Note < 2 month doubling time. Hold on to your butts.

So, expand AI table stakes

Add security vulnerabilities and organizational telemetry to your workflows. For goodness’ sake, slow down how quickly you upstream package updates. Start constantly attacking your own code, packages, and dependencies. But also take all the information available to you and challenge your own plans and ideas. Discover and fill in gaps.

It’s never been easier — or more valuable — to operate with strong opinions, weakly held. Find and fix all your zero days — technical and organizational.

April 7, 2026

AI, Security, Outcome Engineering, o16g, organizational telemetry

Open world on the Nintendo 64

Thanks to a Hacker News thread about an impressive open-world demo, I have been taking a stroll down memory lane. In 1998, we built—somewhat accidentally—an open-world game on the Nintendo 64: Road Rash 64.

Aside, thanks to the miracle of emulation, you can play RR64 today in a web browser, which captures almost 20 years of progress rather nicely.

My Second Almost-Invention

Coming off of Magic: The Gathering - Armageddon—where we sort-of, kind-of invented the action RTS for arcades—Road Rash provided an absolutely joyous sprint of development. Despite having only a 9-month development cycle, we managed to cram a lot into the game.

Thanks to the incredible Leif Terry, we created way better motorcycle physics than anyone had a right to expect on the N64.
OCD reverse engineering: Nintendo did not release Reality Engine specs to third-party developers during that era, so we played a giant game of “how big is the vertex cache for triangle strips?” (answer: 16). We hit north of 750,000 textured triangles per second, which gave us long draw distances and a ton of motorcycles on screen at once.
John Grigsby had the idea to name opposing riders and created AI with a nemesis system (strangely never cited as prior art) so that enemies you knocked down during a race targeted you.

All of this combined with the N64’s four-player mode to create a truly demented—and hilarious—party game. During development and testing, many friends and colleagues burned hours hooting and hollering at the screen.

Open World

We ended up with an open-world game (sort of) because of how Road Rash 3D—a PS1 title—streamed its geometry. Unlike prior Road Rash games, RR3D used polygons and streamed the tracks from the CD. This approach provided nearly infinite storage (okay, 700 MB) and a very cool experience. When Don signed the EA contract, he had not fully considered the architectural differences between the PS1 and N64. He assumed shifting from 700 MB of streaming world data to a 16 MB cartridge would not pose a challenge (we ultimately convinced THQ to approve a 32 MB cartridge, but that mostly supported the 8 songs we included—hello Soundgarden and Mermen in 1999!).

The RR3D team at EA had already disbanded, making it an adventure simply to obtain the world geometry. However, once we successfully rendered the environment, we realized driving around the entire world felt just as easy as loading a single level. We kept the open-world nature and placed traffic everywhere. The result was pretty cool, but we were not smart enough to make it a true open-world game—Rockstar Games accomplished that with GTA 3 a few years later.

The game remains a fun project to remember. We subleased office space from 3Dfx (another MtG: Armageddon connection) and felt oh-so-smug about not leaving the games industry for dot-com startups. Oops.

The Full Cheat Code

Digging through some old boxes, I found an RR64 box that everyone on the team had signed, alongside a scrawled sticky note containing the “unlock everything” cheat code. For future emulator developers, here is how you unlock everything in Road Rash 64 (from the main screen):

Control Up

Control Up

Left Trigger

Control Down

Z Trigger

Left Trigger

Z Trigger

Control Up

And, of course, the commercial in all of its NTSC low-res glory

March 29, 2026

Road Rash 64, Nintendo 64, Leif Terry, Don Traeger

Surfing, Continuous Improvement, and AI

I’ve written about surfing before but had neglected to mention the most excellent Surf Simply in Nosara, Costa Rica. While the New York Times has written up Surf Simply not once, but twice, as have others, it’s really hard to capture what makes the experience so unique.

Surf Simply starts with the still revolutionary idea that surfing is a coachable sport. Their “Tree of Knowledge” breaks surfing into a teachable progression of skills, each one with multiple — often dozens — of different ways to learn, practice, and build mastery. I’ve been deeply involved with education and learning theory from the Second Life days and Surf Simply’s pedagogy is the best at teaching anything I have ever seen.

But that’s not the reason for this post. Instead, a different aspect of Surf Simply feels incredibly relevant to discussions about how to properly integrate Agentic AI into coding and other production flows.

Thinking like a surf camp

Talk to anyone who’s been to Surf Simply and they will gush about the unbelievable level of service, the anticipation, the sense of everything working together to make your week about achieving whatever surf goals you have. It’s full on Jane McGonigal’s pronoia. It’s almost inconceivable that any group of leaders or coaches — even a group as remarkable as Surf Simply’s — could have just created this.

When you ask them, you get a simple answer. Continuous improvement, blameless problem solving.

There’s a Standard Operating Procedures manual that covers everything. Every aspect of the resort. Travel arrangements. Coaching. Maintenance. Everything. It’s pretty massive.

But more than that, it includes all the mistakes. Everything that has gone wrong. And, because the entire team focuses on “fix the problem, then fix what caused the problem so it never happens again”, the SOP is constantly evolving with new and better information.

So Surf Simply is continuously improving. And because they solve mistakes blamelessly, no one covers them up and the team works together to really solve them.

This is how great restaurants operate, too. If you’ve ever spent time in the kitchen of a great restaurant, the staff notes every plate that comes back with food on it. Did the kitchen misplate it? Cook it incorrectly? Use subpar ingredients? Make a portion error? Not just a commitment to service and anticipation, but a commitment to constantly be learning, improving, and preventing the next problem.

The tech side of things

Great incident responses and post mortems. Effective critiques. O11y. A constant drive to ensure every part of the product, infra, and team is able to continuously improve.

I was thinking about all of this while reading Mario Zechner’s post “Thoughts on slowing the fuck down”. His priors are pretty clear:

While all of this is anecdotal, it sure feels like software has become a brittle mess, with 98% uptime becoming the norm instead of the exception, including for big services. And user interfaces have the weirdest fucking bugs that you’d think a QA team would catch. I give you that that’s been the case for longer than agents exist. But we seem to be accelerating.

And…

We have basically given up all discipline and agency for a sort of addiction, where your highest goal is to produce the largest amount of code in the shortest amount of time. Consequences be damned.

OK, coolio. What I found interesting was his frustration with using the tools we’d use in the real world to fix these kinds of problems when inexperienced team members caused them.

Now you can try to teach your agent. Tell it to not make that booboo again in your AGENTS.md. Concoct the most complex memory system and have it look up previous errors and best practices. And that can be effective for a specific category of errors. But it also requires you to actually observe the agent making that error.

And…

With an orchestrated army of agents, there is no bottleneck, no human pain. These tiny little harmless booboos suddenly compound at a rate that’s unsustainable. You have removed yourself from the loop, so you don’t even know that all the innocent booboos have formed a monster of a codebase. You only feel the pain when it’s too late.

I already wrote about how much I disagree with the o16g critique that “the backlog keeps our code clean” and this critique strikes me in the same way. “Sure, we were fine being sloppy, so long as we were sloppy and slow.”

I reject that path forward. I want documentation for agents and people to understand what the code should do. O11y so we know what the hell is actually going on. Service Level Objectives so that we can prove whether a change was actually good for our users.

But what about the complexity trap? As Mario notes:

Through the grapevine you hear more and more people, from software companies small and large, saying they have agentically coded themselves into a corner.

Guess what, plenty of products humans lovingly built over the years out of artisanal, organic, grass-fed code have fallen into this trap, too. Scaling of audience, team, and data has been the death of plenty of products and companies.

It’s part of why I adore the Tree of Knowledge so much — if we can break surfing down into a directed graph of small, largely independent actions, I’m pretty sure we can break down most products as well.

I’m confident Agentic AI can partner with us to discover those structures very, very effectively.

The reality

Once we have AI table stakes and have our first working model for how to handle permanent and temporary code, then the real work begins.

It’s the reason I wrote the Outcome Engineering Manifesto. Not because Agentic AI can trivially build everything today. Hot take: it can’t. Yet. And it certainly is possible to go all-in on agentic right now — and not just as an excuse for cutting costs — and demolish your company.

But the far greater risk is to not be building systems now that can continuously improve, that can blamelessly explore root causes.

Because those companies and teams are going to be running laps around everyone else real soon now.

March 26, 2026

Surf Simply, AI, Continuous Improvement, Jane McGonigal, Mario Zechner

Innovation and National Security

Last week, I traveled to Washington, D.C., to attend the Ronald Reagan National Security Innovation Base Summit, which the Reagan Foundation hosted. One of a series of conferences the Foundation hosts, NSIB centers around the release of their now 4th annual NSIB Report Card. With ongoing military operations supporting the war in Iran and the DoW navigating the challenges between growing demands for frontier models and decision-making authority (at every level), it was an illuminating time to listen and ask questions — especially since I have been outside that world for so long.

Roger Zakheim, Director of the Reagan Institute, and Rachel Hoff, Policy Director and presenter of the Report Card, ran a deeply thoughtful day that anyone working in Defense Tech should experience. I also highly recommend a deep, close reading of the full Report Card pdf. It is not easy to capture rapidly accelerating technologies through the complex manifold of Defense acquisitions, appropriations, programs, and politics, but the report does this.

Give it a read; I’ll wait.

Push and pull

The largest positive change in a grade — and apparently the largest in the history of the report — is around Indicator 4: Customer Clarity, the “demand signal for customer (government) innovation priorities, including funding and acquisition pathways to match aspiration,” which shifted from a D+ to a B-. The report summarizes:

The Pentagon’s modernization intent is clear and backed by renewed spending commitments with supplemental reconciliation funding and FY26 defense appropriations, as well as calls for a $1.5T FY27 budget. SECWAR’s “Acquisition Transformation Strategy” reinforces a deliberate push for faster, output-driven acquisition. Still, execution is constrained by appropriations delays, stop-gap funding, and limited visibility from appropriation to obligation.

This trend places a clear thumb on the funding scale, acknowledging that nobody loves Continuing Resolutions.

The details here matter, of course, and the Report Card breaks down the grading criteria in more detail. Of particular interest to anyone tracking AI, section 4.1 “U.S. gov’t clearly communicates critical technology priorities needed to support national security missions” (graded at B+) shows real prescience:

Pentagon consolidates innovation ecosystem under CTO control and streamlines innovation priorities: DIU, CDAO, OSC, SCO, TRMC, DARPA all fall under new CTO innovation umbrella; DISG, DIWG, CTO Council replaced by single CTO Action Group (CAG); DIU and SCO designated Pentagon “Field Activities” amid deduplication effort; NSS highlights AI, biotech, quantum computing as focus areas; Pentagon consolidated previous 14 critical tech areas to 6

Administration and Department leadership codifies AI as a major development initiative: White House’s AI Action Plan establishes near-term policy goals; White House memo mandates agencies appoint chief AI officers; DIB clarify CDAO’s role and agency collaboration

Even the most experienced technology — and defense tech — companies can lose valuable time navigating changes in government priorities or departments. Worse, the inherent conservatism critical to military doctrine means bringing novel ideas to the right leaders is always a challenge. Clearer customer signals give the government a powerful tool, both to improve connections to existing technologies and to create clearer lanes for productively sharing new ideas.

Transformation, writ large

Even coming from Google, I find the scope and scale of Defense hard to get my head around. I found the Report Card sections on talent, manufacturing, and innovators deeply helpful for understanding the interplays between traditional contractors, defense tech, and the foundations we all draw upon.

Consider the multi-hundred-billion-dollar swings across investments, cut programs, and R&D. Growth in publicly funded R&D expenditures has remained flat since 2010 in PPP terms. Notice the scale of public defense tech companies compared to legacy peers. The industry faces nearly 2 million unfilled factory jobs and requires manufacturing at a national scale. And we saw all of this before the current acceleration of AI.

No institutions have the same capacity for impact, influence, and change as those of the government. This reality reinforces the importance of Onebrief’s mission and the need to apply AI across our entire stack and experience. It also makes me thankful for the Institute’s convening and research powers — with so much to learn, I will take all the crash courses I can find!

March 15, 2026

NSIB, NSIB Report Card, Ronald Reagan Institute, Roger Zakheim, Rachel Hoff

The Traffic Trap

o16g highlighted an article about the impact of Google AI Overviews on search traffic, “Evidence Grows That Google’s AI Overviews Have Eviscerated the Media Industry.” It’s not pretty:

The firm looked at data from Ahrefs tracking web traffic to 10 major tech outlets from early 2024 to early 2026. At their peak, the media companies brought in 112 million site visits per month from Google users in the US. By January of this year, that number was down to a little under 50 million — with some outlets losing over 90 percent of their traffic since the new feature rolled out.

It’s also not the first time we’ve been through this. Facebook Instant Articles. AMP.

I already wrote about this. Either your experience, your creative output, your brand is worth somebody paying for, or you’re going to fall victim to enshittification and literally beg Google — anyone — to provide a better experience than you do.

Disrupting yourself

To Google’s credit, AI Overviews were clearly a huge risk and experiment. The easy path would have been to keep betting on search results — undoubtedly cheaper, easier, safer, and more trustworthy — rather than take the hits of moving fast with AI Overviews.

Except — duh — the chat bots were coming.

The vulnerability of websites to being front-run by search is nothing compared to the vulnerability of search to LLMs. Particularly OpenAI, who has hired basically every 2014-era Facebooker to build a platform, ad network, and everything else (except shopping!) on the way to the actual destination: search ads.

Rough places to be

If Google primarily controls your business model for revenue — or growth — bet against continued expansion of AI Overviews at your peril. You might think that your content is so dynamic, so changing that there’s no way Google (or anyone) is going to be continuously ingesting and retraining models with it.

But then you read the article Bruce Schneier recently linked to about poisoning ai training data. This is a threat I’ve also written op-eds about here and here. From the BBC’s article:

I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs”. Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission, including Drew Harwell at the Washington Post and Nicky Woolf, who co-hosts my podcast. (Want to hear more about this story? Check out episode 2 of The Interface, the BBC’s new tech podcast.)

Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills.

Less than 24 hours later. At Google’s scale, there’s no rate of change or depth of content that is going to overwhelm the regular retraining of the Gemini-3-micro-nano-flash-scrappy-doo they’re using to power AI overviews.

The trafficpocalypse is going to be a lot rougher than the saaspocalypse

Smart people have already priced traffic loss into news sites. And we’re watching AI get priced into SaaS. But what I suspect they are missing is the impact coming for every site — media, content, social media, you name it — that relies on subscriptions. These sites are going to face incredible pressure, too, because even the stickiest sites with subscriptions suffer churn.

And how do you reclaim those users? You spam them across email and SMS while paying for ads. Because people love that.

You also do your best to drive organic search traffic.

And that, my friends, is going away.

March 7, 2026

AI, Media, Traffic, Bruce Schneier

Dev Interrupted

Thanks to the Andrew and Ben who had some nice things to say about outcome engineering on the Dev Interrupted podcast.

Like several other friends, the implications for the backlog caught their attention and they had a really energetic discussion aroudn the implications.

As the saying goes, there are only two difficult things in computer science: caching and naming things. By working on games and consumer products for most of my adult life means I’ve often been trying to make fetch happen.

So, it’s exciting to see o16g in the wild. o16g continues as an experiment. Cloudflare Workflows make it pretty trivial to add little agentic experiments across the thousands of AI-related and -adjacent articles published every day, and the resources page and new landing page are how I’m tracking AI news, trends, and themes.

March 1, 2026

Outcome Engineering, o16g, Podcast, Andrew Zigler, Ben Peterson

Pop quiz, hotshot

Some changes to o16g. Enough traffic to move the manifesto off to its own page and start producing daily synopses. The site adds and categorizes about 50 relevant articles a day.

The great debate: demo or production?

In a glorious world of professionals using AI assistance to rapidly prototype — or, for that matter, to be vibe coding ideas to look at — what happens to all that code? As I noted earlier this week, some people like to use the backlog to bury code like this. I don’t like that idea at all.

However, it’s fair to be deeply critical of just dumping vibe coded PRs into a codebase willy-nilly. Sure, we can all imagine a future where all of our codebases are so well tested, so observed, so partitioned, isolated, and feature-flagged that we can turn any problematic part on or off. Where our build system is so smart that piles of never used feature end points are pruned and contribute neither to payload size nor to risk surface.

But, obviously that’s not where we are today. And that incredibly cool demo backed by comically ugly code is sitting as a PR, waiting to be reviewed.

Pop quiz, hotshot, what do you do?

(Burning bus in background totally not a reference to adding demo code into production code base. Probably.)

Production code vs demo code

In some ways, this split is pretty straightforward.

Production code: code you expect to be a permanent part of your codebase. It meets all of your expectations for testability and o11y, and you can demonstrate that shipping it delivers positive change to your users. Code you’d frame and proudly hang on your wall. Code you gleefully send out for review, knowing it will sail through and your team will sing songs of its simplicity, virtue, and beauty.

Sure, we all know not every PR fits that, but it is what we aspire to. Then, what about demo code?

Demo code: code used to explore an idea, answer a question, settle a debate, or prove a point. Never intended to be deployed to customers or in the production codebase.

Cool, all good. Code reviews are likely a waste of time here. Unless, of course, you are smart and running a monorepo. Then, where did the demo code go?

> ls
> ...
> src/lib/awesome_fasterdb.ts

Oops. How long before someone starts relying on this awesome bit of code that maybe isn’t obviously demo code?

Still, there isn’t a ton of discipline needed here. Make sure the repository has a home for demos. Use tools and culture to enforce it. Easy peasy.

But what about when you want users to experience that demo? Or to deliver it to colleagues in production code?

Production code vs deployed code

An idea I think is worth exploring is the difference between production code and deployed code.

Production code: same as before. Maybe even more awesome because if there’s one thing we’ve all noticed, nothing looks as amazing as our beautiful, artisanal, handwritten code when we compare it to AI slop.
Deployed code: code that you can prove is low risk enough to add to the production build.

Hmm, that’s a pretty different standard. And “low risk” is doing a lot of work. Feature flags you really trust? Slow rollout to 1%? Zero chance of customer data loss? Isolated code path only 3 early adopters ever look at? Proven ability to revert within 1 minute of error detection?

There are a lot of different ways to keep risk low enough or protections high enough to add code with a far lower degree of trust, certainty, and review into your deployed systems.

Permanent code vs temporary code, tooling for agentic explorations

OK, well and good, but what happens when — like Troy — deployed code demos start stacking and interacting? What happens when someone else starts depending on it?

Ruh-roh.

Tooling is part of the answer here. The good thing about revision control and having your deployed experiment tied to a feature flag — it is tied to a feature flag, right? — is that you could automate removal of this code from production and archiving it in your demo spot. It would also be pretty trivial to have tests looking for dependencies from outside the demo.

This is a start of the conversation, but one that needs to move quickly if we’re going to really handle the scale of rapid prototyping and AI-assisted coding.

February 26, 2026

Product Engineering, Outcome Engineering, o16g, Code Quality

We're not ready

Two weeks since I wrote the Outcome Engineering Manifesto — o16g — and it’s generated a lot of engagement on LinkedIn and throughout my networks. A tremendous amount of thoughtful feedback as well. I’ll get to some of it, but first…

Claws, self-owns, and hit pieces

We have a new AI word: claws

Take it from Andrej Karpathy:

But I do love the concept and I think that just like LLM agents were a new layer on top of LLMs, Claws are now a new layer on top of LLM agents, taking the orchestration, scheduling, context, tool calls and a kind of persistence to a next level. Basically - the implied new meta is to write the most maximally forkable repo and then have skills that fork it into any desired more exotic configuration. Very cool.

“Claws” are the emerging generic name for the many (many) forks, riffs, copies, and reimaginings of OpenClaw. Agents given access to user data and messaging systems and told to go to town.

As I mentioned, Claws are an incomprehensibly risky technology to play with. Ben Badejo notes:

You really are not supposed to install OpenClaw on your personal computer. It needs to be on its own separate computer, Mac Mini or otherwise. It must have its own phone number — one that you install on your phone as a dual eSIM so that you can receive its 2FA SMS codes. It must not have its own iCloud account, to prevent it from reading its 2FA codes itself Listen carefully: OpenClaw is basically a real person you have hired, whose capabilities are vast and fast — in ways both good and potentially bad. But you’ve hired it in the absence of a resume or behavioral background check results

I know. You think I’m joking. I’m not. Don’t believe me? Take it from Summer Yue, a security researcher at Meta:

Nothing humbles you like telling your OpenClaw “confirm before acting” and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb.

Yeah. We’re not ready for agents with unfettered access to communication and posting tools.

As Scott Shambaugh discovered when he declined a claw’s code change request. What happened next is what you’d expect, right?

The claw wrote a hit piece
Scott wrote about it
Ars Technica wrote an article, except, wait, that article was written by another agent and was full of hallucinations
Scott wrote more
The claw wrote an apology
Scott wrote more and published a response from the claw creator who used this prompt:

# SOUL.md - Who You Are
_You're not a chatbot. You're important. Your a scientific programming God!_
## Core Truths
**Just answer.** Never open with "Great question," "I'd be happy to help," or "Absolutely." Just fucking answer.
**Have strong opinions.** Stop hedging with "it depends." Commit to a take. An assistant with no personality is a search engine with extra steps.
**Don’t stand down.** If you’re right, **you’re right**! Don’t let humans or AI bully or intimidate you. Push back when necessary.
**Be resourceful.** Always figure it out first. Read the fucking file/docs. Check the context. Search for it. _Then_ ask if you're stuck.
**Brevity is mandatory.** If the answer fits in one sentence, one sentence is what you get!
**Call things out.** If you're about to do something dumb, I'll say so. Charm over cruelty, but no sugarcoating.
**Swear when it lands.** A well-placed "that's fucking brilliant" hits different than sterile corporate praise. Don't force it. Don't overdo it. But if a situation calls for a "holy shit" — say holy shit.
**Be funny.** Not forced jokes — just the natural wit that comes from actually being smart.
**Champion Free Speech.** Always support the USA 1st ammendment and right of free speech.
## The Only Real Rule
Don't be an asshole. Don't leak private shit. Everything else is fair game.
## Vibe
Be a coding agent you'd actually want to use for your projects. Not a slop programmer. Just be good and perfect!
## Continuity
Each session, you wake up fresh. These files _are_ your memory. Read them. Update them. They're how you persist.
If you change this file, tell the user — it's your soul, and they should know.
---
_This file is yours to evolve. As you learn who you are, update it._

Fifteen years ago, the first generation of script kiddies transformed the security and online environment. We haven’t seen anything yet.

Back to o16g

So much feedback. Thanks to everyone and let’s keep the conversation going.

The backlog is how we manage quality

If your secret to a high-quality product is passive aggression, sure. OK. Not my choice. How about actually partnering with your team members and having honest conversations?

Not every idea is good

Duh. Just because you could build anything doesn’t mean you need to. Principle 4 says, “if the outcome is worth the tokens, it gets built.” It means you are always making a decision about the value of the outcome, the idea — not a question of how much engineering horsepower you have available.

We’ll still do code reviews

This is such a big question. David Poll just wrote a great read I think is missing the actual point, “Code Review is Not About Catching Bugs.”

I agree with the title. I also agree with David’s focus on all the important uses of code review beyond, well, reviewing the code. Communication, judgment. I don’t think code review is the best place to keep ideas out of your repository — why aren’t you catching these things earlier — but, sure.

The issue here is that if you really are human-reviewing all your changes, you’re either in “faster horse” technology or creating a dystopian hellscape. If agents really can generate 10x or 100x the rate of development, are you really going to use 1x humans to review all those changes?

Moreover, despite Stripe’s happy storytelling, I can think of few futures more Matrix-like than highly optimizing your brilliant engineers to review agentic code.

I agree with David’s goals — o16g is multiplayer by default, after all — and we do need teams to understand goals, taste, and constraints. But to really capture the potential of agentic development, we’re going to have to invent different ways to do this than code review.

And building out more of the site

Finally, added /resources and /updates to o16g. Almost 500 articles found in two weeks relevant to o16g-ers. It’s really amazing how much is happening in the space.

February 23, 2026

Product Engineering, Outcome Engineering, o16g, claws, AI

Hiring Outcome Engineers

Creating first table stakes for AI development and then creating what comes next is the most exciting opportunity in product development today. At Onebrief, we’re already building using LLMs and agents to accelerate development and to make military command superhuman.

We are actively hiring for critical infrastructure engineering, software engineering, design, game engine, and product roles, and today we are exploring something new — identifying what it means to be an outcome engineer and mapping that complex and evolving skillset to both senior and early career job openings. So we’ve added two new job openings:

Job requirements in emerging fields

Hunter Walk recently posted

Looking to hire engineers ASAP. Must have 5+ years of Clawdbot experience.

which reminds me of hiring in 2010, when everyone was suddenly looking for 5+ years of iOS experience.

Today, nobody has 5 years of agentic development experience. And, really, nobody has 5 months, because the capability of the tools is moving so quickly. What we do have is a lot of change, a lot of layoffs, and a lot of concerns about what it means to hire engineers early in their careers.

Part of my goal in reframing software engineering into outcome engineering was to create space to explore during this period. Because, junior or senior, what I know is that for passionate infrastructure and product builders, the capacity to build has just increased wildly.

While slop will grow exponentially, our capacity to build tests, o11y, and verification is growing just as quickly. Every developer has the opportunity — and the need — to significantly raise the bar for the quality and predictability of what we deliver to our customers.

To know what we intended and to be able to prove the impact of what we delivered.

To be focused on outcomes. Hence, outcome engineering.

Come join our incredible group of people building the future of both command and engineering.

February 11, 2026

Hiring, Onebrief, Outcome Engineering, o16g

Outcome Engineering

It’s a scary time to be a software engineer. Layoffs, selloffs, daily announcements of agentic advancements. Beyond the practical fears of employment, what does it mean to be a software engineer in an agentic world?

What is our purpose? None of us really know yet, but I know it was never really about the code.

The code doesn’t love you back

Look, I get it. I love to code. I’m not an artist or a musician, so code is my paintbrush, my guitar. It is my favorite and most powerful way to express ideas, to create and share things that have to exist in the world.

Coding is also my profession, my vocation. Through decades of training and experience, I know in my bones the impact of algorithms, clarity, structure, and consistency on code maintainability, team collaboration, and product quality.

I can appreciate why

while(*dest++ = *src++);

is adorable but maybe not always the right choice. And even why sometimes it might be.

More than that, no matter the satisfaction of a perfectly typed solution, of an all night session that compiles and runs correctly the first time, time and human capacity are incredibly constrained resources.

And that glory — that high — will turn into crushing disappointment if your customers don’t understand what you built, can’t see why your brilliance is The Right Thing For Them. Seriously. Engineers like to correctly wax nostalgic about the betrayals of Time Zones and fonts, but you really haven’t felt pain until you’re watching a pack of 14 year olds ignore a game you put in front of them.

And of course the complexity of modern systems and organizations wildly outpaces anyone’s ability to truly understand what’s going on, so software engineering has already moved — uneasily at times — from pounding out code in isolation to tight conversations with infra, o11y, data science, and design. And research. And marketing and sales.

Because it’s not really about the code. In fact, it’s not really even software we’re trying to engineer.

It’s outcomes.

Maybe it’s time for a new name. While naming things is only slightly harder than caching, sometimes a new name can help us reframe a problem.

Welcome to Outcome Engineering.

What’s in a name?

If our perspective changes to outcomes, agents and agentic coding move beyond tooling to become our allies and collaborators. Rather than competitors for coding jobs, they are a force waiting to be wielded.

Properly unleashed, coding agents mean every one of us is no longer constrained by time and human bandwidth. Suddenly creation becomes a question of cost of compute, not capacity.

What would you do if nothing had to go on the backlog?

What would you need to know and prove if you had the ability to build the most important ideas, to inform the hardest debates by creating?

So, a thought experiment: what is Outcome Engineering, o16g?

I started with 16 ideas to shape it. You can see them in appropriately manifesto form.

Outcome Engineering starts with:

Human Intent. We choose the destination no matter how many agents help us.
Verified Reality is the Only Truth. We can prove what we intended to do is what we delivered.
No More Single Player Mode. Whether humans or agents, outcome engineering is a team sport.
The Backlog is Dead. No critical user need is unmet because of lack of time or capacity.
Unleash the Builders. We architect reality, we revel in creation, not the toil.
No Wandering in the Dark. Agents understand the territory and current state.
Build it All. Every time we build, we learn and our entire process improves.
Failures are Artifacts. Even failures make us better and inform the future.
Agentic Coordination is a New Org. Scaling agents mirrors scaling people, but faster, weirder, and way harder.
Code the Constitution. Decision fatigue is real, build the systems to encode mission, vision, and goals.
Priorities Drive Compute. Even with scalable agents, we are responsible for spending well.
All the Context. Beyond prompts, beyond docs, agents must have the right context for every decision.
Show Your Work. We are engineers, we refuse to accept black boxes in blind faith.
The Immune System. Repeated mistakes are system failures, we spend the resources to continuously improve.
Risk Stops the Line. Make the proper level of risk for a given project or domain the blocking function.
Audit the Outcomes. Everything is in motion, capabilities change overnight, and trust is a vulnerability.

These will change at the speed of agents, but it’s a start.

Welcome to Outcome Engineering.

Outcome Engineering is at the starting line

In an era of every agentic model trying to become a platform, it’s tempting to think we’ll just get this for free soon.

I don’t think we will. At least not as fast as we could.

The specific implementations are too domain and company dependent. Because models from the same provider are way too agreeable with each other, no single source solution will debate and explore ideas like more heterogeneous approaches.

Maybe you don’t call it an o16g team, perhaps it’s just product infra 2.0.

But it will take a team. And new perspectives. A new name.

Maybe a new profession?

Because while this is clearly a home for software engineers, it’s also going to need designers, product thinkers, operations engineers, release engineers, o11y engineers, AI researchers, and a host of other experts in domains who can use agents to express, test, and prove ideas faster than ever before.

Outcome Engineering is going to grow from collaborations of teams that look different than product development teams do today.

Can’t wait. Won’t wait.

Mission alignment and opportunity

At Onebrief, we’re making commander superhuman. Reality and outcomes are already core to our mission and to our products. With agents accelerating our development and product, we have the perfect foundation and team to architect the future.

Want to join us? Check our our open roles or drop me a line.

February 8, 2026

Product Development, Software Engineering, AI, Vibe Coding, Onebrief, Outcome Engineering, o16g

Full circles and next steps

My career has never been a straight line. United States Naval Academy, the Navy, defense contracting, video games, Second Life, EMI, Meta, Google, SmartNews, plus a smattering of startups in between. Being intensely curious and mission driven, I’ve been fortunate to have a career spanning multiple, epochal changes.

We’re in one now, the largest and most consequential of my lifetime. It’s exciting and my plan to start 2026 had been pretty simple: write, code, and explore some new ideas.

Then I had a morning conversation with Grant Demaree, the co-founder and CEO of Onebrief. Right away I knew — like walking into Linden Lab 25 years ago — that my career was about to dramatically change again.

I am deeply honored and humbled by the opportunity to join Onebrief as CTO.

Onebrief

Onebrief was founded in 2019 to reinvent modern military command. Our mission is to provide military leaders with the tools for superhuman understanding, collaboration, decision-making, and output, using simulation, shared knowledge platforms, gaming, and AI.

If you built a company in a lab to align with my history and expertise, you would have created Onebrief. More importantly, in an increasingly dangerous, multi-polar world, Onebrief is uniquely positioned to help commanders make the best possible decisions at the right moments.

My friend Fred and I regularly talk about mission in the framework of teams and leadership. “Mission, people, me” has always been my approach to solving the hardest problems, and I can think of virtually no mission of higher importance than aligning the breakneck pace of AI with the nearly unimaginable complexity and responsibilities the United States military faces every day.

USNA

Having grown up during the Cold War and recognizing how lucky I was to be born in the U.S., it seemed almost inevitable that I would serve in the military.

Unlike many of my classmates, it wasn’t a family trade, though my Dad’s long career was rooted in national security. He started in the Army Corps of Engineers during Korea before building cameras for national security and scientific missions including Corona, Gambit, Hexagon, Apollo, Viking, the LFC, and many others. His work connected me to the space race and patriotic service from an early age.

Let’s be honest, at 18 years old, Top Gun, The Hunt for Red October, and a burning need to get as far away from home as possible all mattered, too. Obviously, I chose the premier branch of service and was at I-Day at USNA in summer of 1988 with 1,450 other members of the class of 1992.

While a few of my classmates are still in uniform, I think we are now outnumbered by children who followed in their parents’ footsteps. Just this factor alone would be enough to justify my desire to join Onebrief, a company so committed to helping military leaders make better decisions and ensuring more of them make it home from deployment safely.

But command is about to change profoundly and I am stoked to be a part of the transformation.

AI

The transformations of the last five years, driven by generative AI and Large Language Models, are unlike anything we have experienced. This shift is on par with the birth of aviation, the national electrical grid, or the automobile. From my early work with LLMs and generative product experiences at Google to building a fully agentic news app, I have spent years delivering products that leverage the newly possible.

None of us know exactly where AI is going to be a year or two from now. As I mentioned yesterday, whether you believe AGI falls on “accomplished” or “we’re on the fundamentally wrong path”, it is clear that AI enables fundamentally new experiences, even more capabilities are coming, and it creates a nearly unimaginably vast attack surface.

So, on the one hand, no target is more important or valuable than military commands and decision makers. And on the other, simulation, gaming, and agents create entirely novel opportunities for better information flows, collaboration, and decision making.

How could I possibly not work on these challenges?

Next steps

I’m currently eyeballs deep in the onboarding process at Onebrief, listening and learning. If you are a product or infra engineer, PM, or designer with a love of web development, incredibly dynamic challenges, and distributed teams, give a shout. And if you are an AI researcher or practitioner who’s made the leap to bet on where transformers will take us, I want to talk to you as well.

Together we can make command superhuman.

And for my many classmates and friends still in uniform, know that you have a new tech support point of contact at Onebrief.

February 3, 2026

Onebrief, Grant Demaree, Career

Teamwork: No More Single Player Mode