vietnamese mud crabsoft-shell crab
4
14 Comments

Detection is becoming a solved problem. Execution isn’t.

It feels like we’ve spent the last decade building incredible tools for detection.

We can detect:

  • incidents
  • anomalies
  • fraud signals
  • performance degradation
  • security events
  • operational bottlenecks

And we can do it faster than ever before.

But the more I look at real-world systems, the more it feels like detection is no longer the biggest challenge.

Execution is.

Because once something is detected, a different set of problems appears:

  • Who owns it?
  • What happens next?
  • How should it be prioritized?
  • Who needs to be involved?
  • How do we know it was actually resolved?

Most teams already have visibility.

What they struggle with is maintaining clear, consistent execution across people, teams, and changing priorities.

That’s why I keep coming back to the same question:

Have we focused so much on improving detection that we’ve underinvested in the structure of response and execution?

Curious how others here think about it.

When things break down in your systems, is the problem usually detection—or what happens after detection?

posted to Icon for group Startups
Startups
on June 4, 2026
  1. 1

    State-transfer problem is the right framing - much cleaner than how I was thinking about it. Ownership implies someone will figure out the context. State transfer is explicit about what actually needs to move. The signal route is fine, it's the metadata that gets dropped at each handoff. Does your stack try to carry that context natively or is that still a manual layer between tools?

  2. 1

    Strong take. Though I'd argue detection and execution aren't sequential problems — they're cultural ones. Teams that struggle with execution usually struggle because accountability was never designed in, not because the tooling failed.

    1. 1

      That's a good distinction.

      The more I look at it, the more ownership seems less like a workflow problem and more like a system design decision.

      Teams often treat accountability as something that emerges naturally, but at scale it seems to need to be explicitly built into how signals move through the organization.

      That makes me wonder whether execution failures are often organizational design failures that happen to show up in operational systems.

      1. 1

        I think that's often true.

        What we label as an execution failure is frequently an organizational design failure expressing itself operationally. By the time a problem shows up in a dashboard, ticket queue, or escalation path, the underlying issue may already be embedded in how decisions, incentives, and ownership are distributed.

        That might explain why some organizations can have excellent visibility into problems yet still struggle to act on them consistently. The challenge isn't detecting the signal—it's ensuring responsibility travels with the signal.

  3. 1

    The execution gap is real and it's actually getting worse as detection gets better. More alerts, more signals, more visibility — but the ownership question at the moment something is detected is still solved by whoever happens to be awake or whoever feels guilty enough to pick it up. The part I think gets underbuilt: not just who owns the response, but how the context gets handed off when ownership changes. Half the execution failures I've seen weren't about people not caring, they were about someone picking up an incident mid-stream without the context the previous person had.

    1. 1

      The context handoff point is interesting because it's easy to focus on ownership and miss continuity.

      A signal can have an owner and still fail if critical context gets lost between people, shifts, or teams.

      What you're describing feels less like a routing problem and more like a state-transfer problem.

      The signal survives, the ownership changes, but the operational understanding doesn't always travel with it.

      That seems like a major source of execution drift.

      1. 1

        The state-transfer framing is sharper than anything I had. Routing you can fix with org charts and escalation paths. State transfer you have to design for separately. The part that usually breaks is not the handoff itself but what happens right after. The new owner has the ticket but not the thread, so they start from scratch. Most post-mortems blame the person. The context gap never shows up in the report.

  4. 1

    The dichotomy is partially right but overgeneralized.

    Where it lands: incident response (PagerDuty mature, runbook orchestration messy), security ops (SIEM mature, SOAR maturing), fraud ops (ML detection good, review workflows ad-hoc).

    Where it doesn't hold: detection is not "solved." False positive rates remain brutal in fraud, security, anomaly detection. In consumer products, detection is still the hard part.

    Deeper issue: detection and execution are coupled, not separate. "We don't know who to route this to" often means "we don't know enough about what the signal is." Execution problems are frequently detection problems masquerading.

    If you're building something specific, category matters. PagerDuty owns SRE response. Tines/Torq own SOAR. Tonkean owns workflow orchestration. Generic "execution layer" loses every time.

    What's the specific category you're thinking about?

    1. 1

      I think that's a fair challenge.

      I'm probably overstating detection in some domains. Fraud, security, and anomaly detection still have major unsolved problems.

      What's becoming more interesting to me is where detection is already good enough for action to begin, yet resolution still stalls.

      The category question is exactly what I'm trying to understand right now.

      The recurring themes I'm seeing are ownership, context, prioritization, coordination, and closure.

      I'm not yet convinced that's a workflow category, an incident category, or something broader.

      1. 1

        The reframe is sharper. "Detection good enough to act, resolution stalls" is much more defensible — and a wedge worth investigating.

        Your five themes (ownership, context, prioritization, coordination, closure) are canonical post-detection problems showing up in every workflow. But dominant pain shifts by category:

        Incident response (PagerDuty/Incident.io/Rootly): ownership and coordination dominate.

        SOAR/security (Tines/Torq/Swimlane): context and coordination dominate.

        Customer support escalation: ownership and closure dominate.

        Operational excellence (Tonkean/Pipefy): prioritization and context dominate.

        Quality/compliance ops (regulated): different regulatory burden, context and closure dominate.

        Universal problem is real but solo founders can't win universal problems. Has to be ONE category with personal experience, identifiable buyer, gap in existing tools.

        Where have you personally felt the pain? That's usually the wedge. If you've worked in SRE, IR is it. Security ops, SOAR. Customer ops, support escalation. Industrial/regulated, quality/compliance. Category often picks itself from founder experience.

        (Analysis here and previous came through HiveMind — https://hivemind.myosin.xyz/auth/signup, code HivemindIH123. AI strategy copilot, closed beta, code gets you access.)

  5. 1

    that split feels right. most teams can detect the issue; the hard part is shipping the fix without adding more process. that is where simple workflow tools usually earn their keep.

  6. 1

    The gap is usually after detection.

    Most teams can see the issue. The harder part is turning that signal into clear ownership, priority, action, and proof of resolution.

    That is why “response structure” feels like the real category here, not more monitoring.

    The interesting question is whether this becomes a workflow layer, an accountability layer, or an operating system around incidents and operational signals.

    1. 1

      Yes. This is exactly the part that keeps showing up.

      Detection is already strong in most systems. The consistent failure point is everything after: ownership, prioritization, coordination, and closure.

      And I agree with your framing — “response structure” might actually be closer to the real gap than monitoring or observability.

      The interesting direction for me now is whether that layer becomes more like:

      • a workflow layer
      • or an accountability layer
      • or something closer to an execution system that sits on top of detection

      Because depending on that answer, the whole category changes.

      1. 1

        Exactly. That category choice matters more than the feature list right now.

        If you frame it as workflow, people compare it to task tools.

        If you frame it as accountability, they think ownership, proof, closure.

        If you frame it as execution, it becomes the layer that turns operational signals into resolved work.

        I’d be careful choosing that too casually, because the category will decide the buyer, the language, and what people expect the product to replace.

        Happy to map the tighter category version if useful.

Trending on Indie Hackers
I got my first $159 in sales after realizing I was building in silence User Avatar 51 comments I spent more time setting up cold email than actually selling. Here is what fixed it. User Avatar 41 comments Three Days Before Launch, I Let My Own Tool Tear Me Apart User Avatar 31 comments I just wanted to taste AI coding tools. A week passed. User Avatar 28 comments I got tired of rewriting the same content for 9 different platforms. So I built Repostify. User Avatar 26 comments A pattern I keep seeing in EdTech: traffic isn't usually the problem. User Avatar 22 comments