Most enterprise AI projects don't fail for lack of data. They fail because the data they pull from carries no context. A vector store can retrieve text chunks that look statistically similar to a question. It cannot tell you that the chunks describe the same vendor, or different products from the same vendor, or the same product under two different brand names after an acquisition. The retrieval was correct. The conclusion was wrong.
File-based and API-based data exchange moves data. It does not move knowledge.
Section 1: The diagnosis Gartner is naming
Gartner's June 2025 press release on agentic AI[3] predicts over 40% of agentic AI projects will be cancelled by end of 2027. The cited drivers are escalating costs, unclear value, and inadequate risk controls. The first two are downstream of the third · agentic AI fails reliably when the retrieval substrate underneath it can't produce evidence the buyer can audit.
The Microsoft Research GraphRAG paper[2] documents the architectural fix in a different vocabulary: graph-based retrieval delivers “substantial improvements over a conventional RAG baseline for both the comprehensiveness and diversity of generated answers” on global sensemaking questions at the 1M-token-corpus scale. The mechanism: instead of ranking text chunks by cosine similarity, the system traverses typed entity relationships to assemble the answer.
Section 2: What “context” means as a layer
Context is not a synonym for “more data.” It's a specific shape: typed entities + named relationships + governance rules. The semantic web standards (RDF, SPARQL, SKOS, OWL, SHACL) have offered this shape for two decades. Two things changed recently:
- LLMs can author the graph. Tools that previously required taxonomists can now bootstrap a domain model from existing data, with humans in the loop for governance decisions.
- LLMs can query the graph. Natural-language questions translate to SPARQL (or graph-DB equivalents) and return cited entities · not summaries of documents that might mention them.
The combination is what makes graph-grounded retrieval (“Graph RAG”) different from vector retrieval. Graph RAG is what document-grounded RAG should have been from the start · it just couldn't exist until the model layer could author and query the substrate at production cost.
Section 3: Three taxonomies already prove the pattern at scale
Skeptics will ask: does this actually work in production? The answer is yes, and it has been working in production for decades in domains that hold their AI systems to a higher accuracy bar than enterprise SaaS typically does:
- MeSH · Medical Subject Headings.[4] The controlled vocabulary the US National Library of Medicine has used since 1960 to index PubMed/MEDLINE. Every biomedical citation retrieval system you have ever used · every clinician's literature search, every meta-analysis tooling · depends on it. Billions of citations indexed against a typed taxonomy of ~30K concepts.
- EuroVoc.[5] The European Union's multilingual thesaurus across 24 official EU languages. It indexes the legislative and policy corpus of the EU. The fact that a German regulator and a Portuguese citizen can search the same body of law and retrieve the same documents in their own language is downstream of EuroVoc's typed-graph design.
- SNOMED CT.[6] Systematized Nomenclature of Medicine · Clinical Terms. 350,000+ active concepts used in clinical decision support across 80+ countries. When a hospital information system in Brazil and one in Singapore share patient data, SNOMED CT is what makes the records interoperable · not just transferable.
Each of these is a typed graph with a governance body, a versioning discipline, and a fielded query interface. None of them is a vector store. None of them is fashionable. All of them have been more reliable than any LLM-only retrieval system anyone has shipped.
The pattern that runs PubMed, EU legislation, and clinical decision support has been waiting twenty years for agentic AI to catch up.
Section 4: What this means for enterprise software product data
Enterprise software is the largest category in the world without a public typed graph. There is no MeSH for SaaS. There is no PubMed Terminal for the question “which CSPM vendors have FedRAMP authorization, ship Salesforce integration, and are mentioned in the last 90 days of analyst coverage in a specific tone.” That question is a four-hop traversal. Vector retrieval can't do it. The 88 universal node types in the PYRAMYD graph were built specifically to make this class of question first-class.
Where the graph layer changes the answer shape
- Retrieval unit · vendor, product, capability, persona, country · not text chunks
- Update model · FK constraint propagation · not full re-embedding
- Citation granularity · per-claim with source URL and retrieval timestamp · not document-level
- Failure mode · “no path found” instead of hallucinating the join
- Audit surface · every traversal logged, every cell exportable · not just a conversation log
Section 5: The regulatory pull just made this urgent
EU AI Act Article 50[7] became generally applicable on August 2, 2026. The two transparency obligations procurement teams now ask about · per-claim provenance and machine-readable AI-content marking · are mechanically difficult for document-grounded RAG and structurally easy for graph grounding. We covered the procurement-side framing on the Article 50 Ready page.
Article 50's technical requirements weren't designed for any specific architecture. But the natural way to satisfy them is to have a typed substrate that already knows what each cell is, where it came from, and which model authored or transformed it. That's the same shape as a knowledge graph. Regulation is now pulling enterprise AI architectures toward the pattern that biomedical and legal knowledge graphs have run on for two decades.
Section 6: The role that emerges · knowledge steward
The original Graphwise piece names a role that's emerging in the same arc: the knowledge steward. Not a data engineer (who owns the pipes), not a taxonomist (who owns the model in isolation), but a domain expert who decides what goes into the graph, what stays out, what counts as the canonical form of an entity, and what governance applies. LLMs let knowledge stewards focus on judgment instead of manual model authoring.
Inside PYRAMYD-using organizations, this role usually starts as a Product Marketing or Product Operations lead who realizes the graph is the new substrate for every team's questions. The job description shifts from “maintain a battle-card library” to “govern the substrate that every team queries.” That's a strictly more leveraged seat than the one it replaces.
Section 7: What we built (and what you can use)
The PYRAMYD Product Graph is the enterprise-software-specific implementation of this pattern. 88 universal node types, 1,554 FK constraints, 252K+ products, 2,606 categories, 2.4M+ reviews, 1,000+ live signal sources, refreshed continuously. APEX is the agentic layer on top, with per-cell citation and the JSON-LD provenance block on every response.
For organizations already building on Graphwise (enterprise documents) or Neo4j / Stardog (general-purpose graphs), PYRAMYD complements rather than competes · we're the product-data slice of the same broader architecture you're already adopting. Our MCP server exposes the graph to any agentic client.
Where this lands for PYRAMYD customers
Section 8: Reading list
The four sources we'd hand to a procurement team or an architecture review board on this topic:
- The original Graphwise / CDO Matters piece this post extends[1] · the most accessible framing of the “context as a layer” thesis
- The Microsoft Research GraphRAG paper[2] · the canonical academic reference for the architectural difference
- The Gartner agentic AI press release[3] · the market-shift framing procurement teams will already be planning around
- The EU AI Act Article 50 official text[7] · the regulation that's now pulling enterprise AI toward graph grounding
We won't pretend we invented this pattern. We didn't. We built the enterprise-software version of a knowledge-graph architecture that biomedicine, EU legislation, and clinical decision support have run on for decades. The fact that agentic AI now needs the same substrate is the thing that's new.
References
- [1]Blumauer, A., "From Data Exchange to Knowledge Exchange: Why Context Is the Missing Layer in Enterprise AI" · Graphwise blog, March 26, 2026. Adapted from the CDO Matters podcast (host: Malcolm Hawker, CDO at Profisee). This post extends the thesis to enterprise software product data.
- [2]Edge, D. et al., "From Local to Global: A Graph RAG Approach to Query-Focused Summarization" · Microsoft Research, arXiv:2404.16130, April 24, 2024 (revised Feb 19, 2025). Documents substantial improvements over conventional RAG on comprehensiveness and diversity on global sensemaking benchmarks.
- [3]Gartner Press Release, "Gartner Predicts Over 40% of Agentic AI Projects Will Be Canceled by End of 2027" · Gartner, Sydney, June 25, 2025. Analyst: Anushree Verma, Senior Director Analyst.
- [4]U.S. National Library of Medicine, Medical Subject Headings (MeSH) · The controlled vocabulary used since 1960 to index PubMed/MEDLINE. Reference example of a typed taxonomy operating at billion-citation scale in production.
- [5]EU Publications Office, EuroVoc · The European Union's multilingual thesaurus across 24 official EU languages, used to index legislative and policy documents.
- [6]SNOMED International, SNOMED CT · Systematized Nomenclature of Medicine · Clinical Terms. 350,000+ active concepts used in clinical decision support across 80+ countries.
- [7]Regulation (EU) 2024/1689 (EU AI Act), Article 50 · Article 50 transparency obligations applicable from August 2, 2026 per Article 113. Article 99 sets penalties (up to €15M or 3% of worldwide annual turnover for non-compliance).
