Applied AI with a P&L

When AI lands in a company that has to defend a P&L, the rules change. This is what "applied" actually means in that context.

The word “applied” is doing a lot of work right now. Everyone who sells AI products or AI services uses it. But when a CFO at a 150-person manufacturing firm in Grand Island asks what applied AI means for his operations, the answer has almost nothing to do with research papers or benchmark scores. It has everything to do with whether a workflow that costs $180,000 in annual labor hours can be cut in half — and whether someone in the building will be accountable for that number.

That gap between how the technology industry talks about AI and how a mid-market company actually runs is where most AI initiatives fall apart. Understanding the gap is the first step to building something that holds.

The two definitions of “AI” colliding inside a mid-market firm

There are two entirely different things people mean when they say “AI” in a business context. The first is research-grade: the conversation about models, capabilities, benchmarks, emerging reasoning chains, and where this technology is heading over the next decade. It’s genuinely interesting, and it produces the headlines.

The second is business-grade: a narrow workflow that currently takes too many hours, handed to a system that can do most of it reliably enough that a human reviewer can handle the exceptions. No press releases. No whitepapers. The output is measured in hours recovered and headcount capacity freed.

Research-grade AI is a category that belongs to the model labs, the university programs, and the technology press. Business-grade AI is what a 200-person insurance agency or regional bank can actually build, own, and run. The two definitions collide when a vendor pitching research-grade capabilities sells a contract to a company that only had business-grade problems to solve.

Why most pilots stall: the missing P&L owner

If you’ve been inside a mid-market AI pilot that didn’t go anywhere, the postmortem usually sounds the same: it worked in the demo, the team was enthusiastic, but then nothing happened at scale. The reasons vary — integration problems, staff adoption issues, model output that wasn’t reliable enough — but the root cause is almost always the same. Nobody with a P&L responsibility signed their name to a success definition before the build started.

When AI is framed as a technology initiative, it gets handed to IT or to an operations lead with no budget authority. Those people can evaluate whether the tool works. They can’t force the workflow change across a department, kill a headcount line, or reallocate the hours the AI is saving. Without a P&L owner who committed to a specific outcome — fewer hours on submission intake, faster loan packet turnaround, lower invoice-processing cost — the pilot produces a demo and then collects dust.

The P&L owner doesn’t need to understand how the model works. They need to understand what they’re agreeing to measure, when they’ll be accountable for the result, and what has to change in the workflow for the number to move.

The shape of an AI initiative that survives a board meeting

A board-ready AI initiative has four visible elements: a named workflow, a current cost or time baseline, a target outcome with a date, and a person accountable for the result. That’s it. Everything else — the tooling choices, the integration approach, the model vendor — is downstream of those four things.

If you can’t write those four elements on a single slide without hedging, the initiative isn’t ready. “Reduce submission intake from 45 minutes per submission to under 10 minutes per submission by Q3” survives scrutiny. “Improve operational efficiency through AI-enabled automation” does not. The second one has nowhere to land when the board asks how it’s going in month six.

What “applied” rules out

The word “applied” rules out two things that consume most AI budgets at companies that don’t get results.

The first is a perpetual research budget. Applied AI has a ship date. A discovery phase, a build phase, a production date, and a measurement date. If the initiative doesn’t have all four, it’s a research budget dressed up as a capital project.

The second is vague success metrics. “Better insights,” “improved visibility,” “faster decision-making” — none of these can be measured in a quarter. Applied AI initiatives use operational numbers: submissions processed per hour, loan packets per analyst per week, hours per invoice cycle. If the current baseline isn’t documented before the build starts, the initiative will be evaluated on vibes, and vibes are hard to defend in a budget review.

What it looks like in practice

A commercial insurance agency in the Omaha market decides submission intake is the right place to start. The baseline: two underwriting coordinators spending roughly 30 hours per week combined — pulling data from broker emails, opening ACORD PDFs, typing fields into the agency management system. The target: cut that to under 10 hours combined, with the remaining time going to exception review rather than data entry.

The initiative gets a name. It gets a sponsor — the COO, who owns the operations budget. It gets a ship date, a 60-day build window, and a 90-day measurement date. The technology choices follow from those anchors.

By the measurement date, the intake pipeline is handling roughly 80% of submissions end to end. The two coordinators still touch every submission — but they’re reviewing and approving, not typing. The COO presents the numbers to the board in the quarterly operations review. Nobody asks whether the AI was “applied.”

For more on what this looks like in a specific industry, see how Blue Sage approaches insurance workflows.

The two definitions of “AI” colliding inside a mid-market firm

Why most pilots stall: the missing P&L owner

The shape of an AI initiative that survives a board meeting

What “applied” rules out

What it looks like in practice

Related

Build, Buy, or Wire Together — How Mid-Market Firms Actually Make This Call

Most AI Training Is a One-Hour Zoom and a Recording Nobody Watches

A Consultancy That Doesn't Make Itself Obsolete Is Selling You Dependency

Text Rosey to begin.