Mining the glue: the signal-harvesting thread at 8am AI
#8am-ai#signal-harvesting#deep-dive#project-state#intelligence
David OlssonSome threads in the corpus describe a tool. This one describes a primitive the group kept reinventing until it became the way the group itself works: reading across everything that's been said and surfacing the connective tissue — the glue — that no single human has time to track.
It's worth saying plainly: this blog is an instance of the thing this thread is about. The corpus you're reading was harvested, mined, and graded by the exact pattern the group spent two years circling.
The arc
- 2024 — value out of accumulation. Early on it's framed commercially: AI product development and monetization, investment strategy, the idea that there's value latent in data you've already got if you can process it.
- 2025 — intelligence gathering proper. The framing sharpens. "Data anchoring" — grounding model output against real project data. The realization that conversations themselves are a dataset. By mid-year the group is explicitly treating its own meeting history as a source to mine, not just an archive.
- 2025 H2 — the systems question. Canadian funding, IP retention, standardization. How do you keep the value you harvest? The thread widens from "extract signal" to "own the substrate the signal lives in."
- 2026 — quantizing intelligence, and the primitive named. The most ambitious framing arrives: standardization and quantizing intelligence — treating intelligence itself as something you can structure, measure, and move. And then, plainly, "AI as a collaborative partner and signal harvesting" — the primitive stated as a working method.
The four layers
The clearest artifact the thread produced is the project-state four-layer model — the structure that makes harvesting repeatable rather than ad hoc:
- Agents — the skills that read across surfaces (Slack, email, repos, transcripts) and surface candidate signals.
- Schema — the map that turns raw mentions into structured, comparable records, so signals from different weeks line up.
- Data — the durable corpus the agents read and write; the thing that compounds.
- Time — the dimension that turns a pile of records into a trajectory. One page is a note; one page across two years is a theme.
The insight the group kept returning to: harvesting only becomes intelligence when all four are present. Agents without schema give you noise; schema without time gives you a snapshot. Put them together and the substrate starts telling you things no one in the room had noticed.
Why it matters more than it sounds
"Signal harvesting" sounds like a feature. In the corpus it's closer to an epistemology. The claim underneath it: the most valuable knowledge in any group isn't in anyone's head — it's distributed across everything the group has said and done, and it stays invisible until something reads across the whole record at once.
That's why the thread ends up being about the group itself. The same pattern that mines a company's Slack for synergies mines two years of Wednesday mornings for themes. Build the harvester once and you can point it at any sufficiently large record — including your own.
Open question
If intelligence can be "quantized" — structured and measured — the open question is whether the act of structuring changes what you find. A harvester surfaces what its schema is shaped to see. The corpus has built the harvester; it hasn't yet asked hard enough what its own schema might be blind to.