Section 35.3: RAG with Knowledge Graphs

Vector search finds documents that sound similar. A knowledge graph finds the cousin of the founder of the company that acquired the supplier of your competitor. One of these is more useful at the deposition.
RAG, Multi-Hop-Traversing AI Agent

Big Picture

Vector search finds documents that are semantically similar, but it cannot reason about relationships between entities. Knowledge graphs encode structured relationships (entities, relations, triples) that enable multi-hop reasoning, entity disambiguation, and relationship-aware retrieval. This section covers the knowledge-graph substrate for RAG: triple representations and RDF vs. property graph trade-offs, LLM-based knowledge-graph construction, property-graph storage and Cypher-based retrieval, graph embeddings, and hybrid KG + vector retrieval architectures. The specific GraphRAG technique (Microsoft's community-summarization-over-graph approach, plus its LazyGraphRAG and DRIFT variants) is the dedicated subject of Section 35.4; the goal here is to give you the general KG-grounded retrieval toolbox that GraphRAG and any future graph-augmented method sits on top of. The vector search limitations that motivate this whole family were introduced in Section 31.3.

Prerequisites

Knowledge graph RAG builds on the vector retrieval foundations from Section 32.1 and the advanced retrieval strategies in Section 35.1. Understanding entity relationships and graph traversal will help, though the section introduces these concepts as needed. The structured knowledge representation here complements the unstructured document processing covered in Section 31.6.

Islands connected by bridges representing entities linked by relationships in a knowledge graph — **Figure 35.3.1**: Knowledge graphs turn your documents into connected islands of facts. Each bridge is a relationship, and RAG can traverse them to find answers that span multiple hops.

What does that look like concretely? Figure 35.3.2 shows a small worked example around Albert Einstein, with each entity as a coloured node and each labelled edge as a relationship the retriever can traverse.

A small knowledge graph illustrating the entity-relationship structure.

Figure 35.3.2a: A small knowledge graph centred on Albert Einstein, showing how entities (cylinders, colour-coded by type) are connected by labelled relationships (named edges). A question like "who influenced the developer of General Relativity?" becomes a one-hop traversal: General Relativity ← developed ← Einstein → influenced_by → Newton. This is the structure RAG systems traverse when they have access to a knowledge graph instead of (or in addition to) raw text.

35.3.1 Knowledge Graph Fundamentals

A knowledge graph (KG) is a structured representation of real-world entities and the relationships between them. Unlike documents stored as unstructured text, knowledge graphs encode information as a network of nodes (entities) connected by typed edges (relations). This structured representation complements the dense vector representations used in standard semantic search. This structure enables precise queries about relationships, paths, and patterns that would require complex reasoning over unstructured text.

Key Insight

Knowledge graphs formalize a representational strategy that philosophers and logicians have explored since Aristotle's categorical syllogisms: encoding knowledge as structured relationships between entities. The modern triple format (subject, predicate, object) is a direct descendant of the Resource Description Framework (RDF), which itself draws on the predicate logic of Frege and Russell.

The power of this representation is that it supports compositional reasoning. If you know (Einstein, bornIn, Ulm) and (Ulm, locatedIn, Germany), you can infer (Einstein, bornIn, Germany) by traversing the graph. This is the same principle of transitivity that underlies deductive logic.

That compositionality is precisely where GraphRAG outperforms vector-based retrieval on multi-hop questions. Vector similarity can find documents about Einstein or about Germany, but it cannot perform the relational reasoning needed to connect them through intermediate entities. The knowledge graph provides the relational backbone that transforms associative retrieval into something closer to logical inference.

35.3.1.1 Entities, Relations, and Triples

The fundamental unit of a knowledge graph is the triple, expressed as (subject, predicate, object) or equivalently (head, relation, tail). For example, (Albert Einstein, bornIn, Ulm) encodes a single fact. A knowledge graph is simply a collection of such triples, forming a directed labeled graph where entities are nodes and relations are edge labels.

Fun Fact

Google's Knowledge Graph, launched in 2012, contained 570 million entities and 18 billion facts. Wikidata now has over 100 million items. If you ever feel overwhelmed organizing your Notion workspace, just imagine maintaining a knowledge graph where "Douglas Adams" needs edges to "author," "42," and "the answer to life, the universe, and everything."

A detective's evidence board with photos, strings, and pins connecting clues, representing GraphRAG's entity linking — **Figure 35.3.3**: GraphRAG turns your retrieval system into a detective. Pin the entities to the board, connect them with strings, and follow the relationships to crack the case.

The diagram below shows how a knowledge graph encodes entities and relationships.

GraphRAG turns your retrieval system into a detective. Pin the entities to the board, connect them with strings, and fol

Figure 35.3.4: A knowledge graph encodes entities as nodes and relationships as labeled directed edges, enabling structured queries and multi-hop reasoning.

35.3.1.2 RDF vs. Property Graphs

Tip

If you are starting a new knowledge graph project for RAG, choose Neo4j with property graphs unless you have a specific reason to use RDF (such as integration with Wikidata or compliance with Semantic Web standards). Property graphs are more intuitive for developers, Cypher is easier to learn than SPARQL, and the Neo4j ecosystem has the best LLM integration tooling available today.

Two main formalisms exist for representing knowledge graphs. RDF (Resource Description Framework) is a W3C standard that represents all knowledge as triples and uses URIs to identify entities and relations. RDF is queried using SPARQL and is the foundation of the Semantic Web (Wikidata, DBpedia). Property graphs allow nodes and edges to carry arbitrary key-value properties, offering greater flexibility. Property graphs are queried using Cypher (Neo4j) or Gremlin and are more commonly used in industry applications.

Table 35.3.1a: Feature Comparison (as of 2026).

Feature	RDF	Property Graphs
Data model	Subject-Predicate-Object triples	Nodes and edges with properties
Query language	SPARQL	Cypher, Gremlin
Edge properties	Requires reification (complex)	Native support
Standards	W3C standard, widely interoperable	No universal standard
Typical databases	Blazegraph, Virtuoso, Stardog	Neo4j, Amazon Neptune, TigerGraph
Best for	Linked data, semantic web, ontologies	Application data, recommendations

35.3.1.3 Ontology Design: Typed Entity and Relation Schemas

Before extracting a single triple, consider what happens when you do not constrain the extractor at all. Schema-free Open IE lets the LLM invent its own vocabulary, so the same fact arrives three different ways across three documents: (Ada, works_at, Acme), (Ada, employed_by, Acme), and (Ada, is_employee_of, Acme). To the graph these are three unrelated edges. A Cypher query that asks "who works at Acme?" matches the first relation and silently misses the other two, and entity nodes fragment the same way when "Acme", "Acme Corp", and "Acme Inc." all become distinct nodes. The graph is technically populated and practically unqueryable.

An ontology fixes this by pinning down the vocabulary before extraction. For our purposes an ontology has three parts. First, a controlled set of entity classes organized in an IS-A hierarchy (an Organization is a kind of Thing; a Company is a kind of Organization), so a query over Organization also reaches every Company. Second, a controlled set of typed relations, each with a domain (the entity class allowed as subject) and a range (the class allowed as object): works_at has domain Person and range Organization, so a person can work at a company but a company cannot "work at" a person. Third, cardinality and other constraints (a person has exactly one born_in country, a company may have many employees) that catch malformed facts. This domain-and-range discipline is the lineage of RDFS and OWL, the W3C ontology languages that formalized class hierarchies and typed properties for the Semantic Web; in practice you do not need a full OWL reasoner to get most of the value, and lightweight encodings such as Pydantic enums or the rdflib and owlready2 libraries are enough to declare and enforce a working schema.

Code Fragment 35.3.1c declares a tiny ontology with Pydantic and enforces it. The EntityType and RelationType enums are the controlled vocabularies, the ALLOWED_RELATIONS set encodes each valid (subject_type, relation, object_type) domain/range triple, and validate_triple rejects anything outside that set.

# A minimal KG ontology: controlled entity classes, controlled relations, and a
# domain/range table saying which (subject_type, relation, object_type) triples are legal.
# validate_triple() is the gate every extracted triple must pass before it enters the graph.
from enum import Enum


class EntityType(str, Enum):
    PERSON = "Person"
    ORGANIZATION = "Organization"
    COUNTRY = "Country"


class RelationType(str, Enum):
    WORKS_AT = "works_at"
    BORN_IN = "born_in"
    HEADQUARTERED_IN = "headquartered_in"

# Each tuple is one (domain, relation, range) rule: the only legal shapes for a triple.
ALLOWED_RELATIONS = {
    (EntityType.PERSON, RelationType.WORKS_AT, EntityType.ORGANIZATION),
    (EntityType.PERSON, RelationType.BORN_IN, EntityType.COUNTRY),
    (EntityType.ORGANIZATION, RelationType.HEADQUARTERED_IN, EntityType.COUNTRY),
}


def validate_triple(subj_type: EntityType, rel: RelationType, obj_type: EntityType) -> bool:
    """Return True only if (subj_type, rel, obj_type) is a declared domain/range rule."""
    return (subj_type, rel, obj_type) in ALLOWED_RELATIONS


# Accepted: a Person works at an Organization.
print(validate_triple(EntityType.PERSON, RelationType.WORKS_AT, EntityType.ORGANIZATION))
# Rejected: a Country cannot "work at" a Person (domain/range violation).
print(validate_triple(EntityType.COUNTRY, RelationType.WORKS_AT, EntityType.PERSON))

Output: True False

Code Fragment 35.3.1c: A lightweight Pydantic-style ontology. EntityType and RelationType are the controlled entity and relation vocabularies, ALLOWED_RELATIONS declares the legal (subject_type, relation, object_type) domain/range rules, and validate_triple accepts only conformant triples. Notice that the works_at rule accepts a Person-to-Organization edge but rejects a Country-to-Person edge, which is exactly the kind of malformed fact an unconstrained extractor would otherwise emit.

This ontology plugs directly into the KG-construction pipeline that follows. The EntityType and RelationType enums become the schema you hand to the LLM extractor: instead of free-form Open IE, you ask for constrained, JSON-schema-shaped output whose head type, relation, and tail type are drawn only from these enums, the same role played by the optional entity_types hint passed to extract_triples in Code Fragment 35.3.1b below. The validate_triple gate then runs on every extracted triple before the MERGE step in the Neo4j loader of Code Fragment 35.3.2b, so only ontology-conformant facts ever reach the graph and "works_at" stays a single, queryable relation.

Assumptions and Limits

Ontology engineering is upfront effort, and the schema is only as good as the foresight behind it: too rigid an ontology silently drops valid facts that do not fit a declared class or relation, so a relation the world contains but your enum omits simply vanishes at the validation gate. Knowledge graphs also adopt the open-world assumption (a missing triple means "unknown", not "false"), whereas a strict validator behaves closer to closed-world; budget periodic schema review to widen the ontology as new entity and relation types appear in your corpus.

35.3.2 Building Knowledge Graphs with LLMs

Traditionally, knowledge graph construction required extensive manual curation or specialized NLP pipelines for entity extraction and relation classification. LLMs have dramatically simplified this process by extracting structured triples directly from unstructured text through carefully designed prompts.

# Knowledge-graph triple extraction: ask an LLM to turn unstructured text into
# (head, relation, tail) tuples that can be loaded into Neo4j or any KG store.
import json

from openai import OpenAI

client = OpenAI()

TRIPLE_SYSTEM_PROMPT = """Extract knowledge graph triples from the text.
Return a JSON object with a "triples" array of objects with keys: head, relation, tail.
Use concise, canonical entity names. Use lowercase relation names with underscores.
Example:
{"triples": [
  {"head": "OpenAI", "relation": "developed", "tail": "GPT-4"},
  {"head": "GPT-4", "relation": "is_a", "tail": "language model"}
]}"""

def extract_triples(text: str, entity_types: list[str] | None = None) -> list[dict]:
    """Extract KG triples from `text`, optionally restricted to given entity types."""
    # Optional hint nudges the model toward the entity types we care about.
    type_hint = (
        f"\nFocus on these entity types: {', '.join(entity_types)}" if entity_types else ""
    )
    response = client.chat.completions.create(
        model="gpt-4o",
        messages=[
            {"role": "system", "content": TRIPLE_SYSTEM_PROMPT + type_hint},
            {"role": "user", "content": text},
        ],
        response_format={"type": "json_object"},
        temperature=0.0,  # Deterministic for reproducible KG ingest
    )
    result = json.loads(response.choices[0].message.content)
    return result.get("triples", [])

Code Fragment 35.3.1b: The extract_triples function uses an LLM to parse unstructured text into subject-predicate-object triples, and the KnowledgeGraphStore class wraps Neo4j operations for storing triples and querying neighborhoods. The find_neighbors method retrieves directly connected entities, while find_path discovers multi-hop relationships using variable-length path matching.

35.3.2.1 Storing Triples in Neo4j

This snippet inserts knowledge-graph triples into a Neo4j database using Cypher queries.

# Thin Neo4j wrapper: store extracted (head, relation, tail) triples as nodes
# and relationships, and expose neighborhood and path queries for retrieval.
from neo4j import GraphDatabase
class KnowledgeGraphStore:
    """Store and query knowledge graph triples in Neo4j."""
    def __init__(self, uri, user, password):
        self.driver = GraphDatabase.driver(uri, auth=(user, password))
    def add_triples(self, triples):
        """Insert triples as nodes and relationships."""
        with self.driver.session() as session:
            for triple in triples:
                session.run("""
                    MERGE (h:Entity {name: $head})
                    MERGE (t:Entity {name: $tail})
                    MERGE (h)-[r:RELATES {type: $relation}]->(t)
                    """, head=triple["head"],
                    tail=triple["tail"],
                    relation=triple["relation"])
    def find_neighbors(self, entity, max_hops=2):
        """Find all entities within N hops of the given entity."""
        with self.driver.session() as session:
            result = session.run(f"""
                MATCH path = (e:Entity {{name: $entity}})
                -[*1..{max_hops}]-(neighbor)
                RETURN path
                LIMIT 50
                """, entity=entity)
            return [record["path"] for record in result]
            def query_relationship(self, head, tail):
                """Find relationship paths between two entities."""
                with self.driver.session() as session:
                    result = session.run("""
                        MATCH path = shortestPath(
                        (h:Entity {name: $head})-[*]-(t:Entity {name: $tail})
                        )
                        RETURN path
                        """, head=head, tail=tail)
                    return [record["path"] for record in result]

Code Fragment 35.3.2b: A reproducible KG-ingest helper: extract_triples ships text plus an optional entity-type hint to gpt-4o with response_format={"type": "json_object"} and temperature=0, then parses the returned triples array into Python dicts ready for Neo4j ingest.

Note

One of the hardest challenges in LLM-powered KG construction is entity resolution: determining that "Einstein," "Albert Einstein," and "A. Einstein" all refer to the same entity. Without careful entity resolution, the graph becomes fragmented with duplicate nodes. Common approaches include embedding-based deduplication (comparing entity name embeddings), LLM-based coreference resolution, and linking to external KGs like Wikidata for canonical entity identifiers.

35.3.3 Graph Embeddings

Graph embeddings map entities and relations from a knowledge graph into continuous vector spaces, enabling similarity-based retrieval, link prediction (predicting missing triples), and integration with dense retrieval systems. The key challenge is preserving the structural properties of the graph in the embedding space.

35.3.3.1 TransE

TransE (Bordes et al., 2013) is the foundational graph embedding model. It represents relations as translations in embedding space: if (h, r, t) is a true triple, then the embedding of the head entity plus the embedding of the relation should approximately equal the embedding of the tail entity: h + r ≈ t. The model is trained by minimizing a margin-based loss that pushes correct triples closer than corrupted triples.

35.3.3.2 DistMult

DistMult (Yang et al., 2015) models relations as diagonal matrices (represented as vectors) and scores triples using a bilinear product: score(h, r, t) = h · diag(r) · t. This is equivalent to an element-wise product followed by a sum. DistMult is simple and effective but cannot model asymmetric relations (it gives the same score to (h, r, t) and (t, r, h)), which limits its applicability.

import torch
import torch.nn as nn
class TransE(nn.Module):
    """TransE knowledge graph embedding model."""
    def __init__(self, num_entities, num_relations, dim=128):
        super().__init__()
        self.entity_emb = nn.Embedding(num_entities, dim)
        self.relation_emb = nn.Embedding(num_relations, dim)
        # Initialize with uniform distribution
        nn.init.uniform_(self.entity_emb.weight, -6/dim**0.5, 6/dim**0.5)
        nn.init.uniform_(self.relation_emb.weight, -6/dim**0.5, 6/dim**0.5)
    def forward(self, heads, relations, tails):
        """Score triples: lower distance = more plausible."""
        h = self.entity_emb(heads)
        r = self.relation_emb(relations)
        t = self.entity_emb(tails)
        # TransE scoring: ||h + r - t||
        score = torch.norm(h + r - t, p=2, dim=-1)
        return score
    def margin_loss(self, pos_score, neg_score, margin=1.0):
        """Margin-based ranking loss."""
        return torch.relu(margin + pos_score - neg_score).mean()

Code Fragment 35.3.3a: Implementing Adaptive RAG with a multi-armed bandit that learns which retrieval strategy works best for different query types over time.

35.3.4 Cypher-Based Multi-Hop Retrieval

An old treasure-hunt map with nodes labelled like Wikipedia stubs ('Einstein', 'Ulm', 'Germany'), and a dotted line traversing them. A pirate-themed cartoon traveler follows the dots. — **Figure 35.3.5**: A Cypher MATCH clause is a treasure map: declare the node pattern, declare the edge pattern, and the engine traces the route through the graph.

Once entities live in a property graph, Cypher queries enable precise, multi-hop retrieval that vector search cannot replicate. A two-hop query from a drug, to the gene it targets, to the disease that gene is associated with, is a single MATCH path. The retriever side of a production system usually wraps Cypher behind a natural-language translator, so the agent or LLM only sees a tool with a query parameter and the right answers come back grounded in graph paths.

from neo4j_graphrag.retrievers import CypherRetriever
retriever = CypherRetriever(
    driver=driver,
    llm=llm,
    neo4j_schema="""
    Node labels: Drug, Disease, SideEffect, Gene
    Relationships: TREATS, CAUSES, INHIBITS, TARGETS
    """,
    )
# The retriever translates natural language to Cypher.
# Behind the scenes it generates Cypher like:
# MATCH (d:Drug {name: 'Metformin'})-[:TARGETS]->(g:Gene)
#       -[:ASSOCIATED_WITH]->(dis:Disease)
# RETURN d.name, g.name, dis.name
result = retriever.search(
    query_text="What genes does metformin target, and what diseases "
    "are those genes associated with?"
    )
for item in result.items:
    print(item.content)

Output: Metformin -[TARGETS]-> AMPK -[ASSOCIATED_WITH]-> Type 2 Diabetes Metformin -[TARGETS]-> AMPK -[ASSOCIATED_WITH]-> Metabolic Syndrome Metformin -[TARGETS]-> OCT1 -[ASSOCIATED_WITH]-> Hepatic Glucose Production Metformin -[TARGETS]-> mTOR -[ASSOCIATED_WITH]-> Cancer Cell Proliferation

Code Fragment 35.3.4a: Cypher-based multi-hop retrieval through Neo4j GraphRAG's CypherRetriever: a natural-language query is translated into a Cypher path that traverses Drug-Gene-Disease relations in a single matched query.

The same shape generalizes to RDF + SPARQL on Wikidata or DBpedia, to Amazon Neptune's openCypher, and to TigerGraph's GSQL. The pattern is always the same: declare the schema, let an LLM-backed retriever generate the path query, and return the resulting subgraph as retrieved evidence. This is the foundation that any community-summarization or graph-summarization technique (covered in Section 35.4) sits on top of.

35.3.5 Hybrid KG + Vector Retrieval

The most effective production systems combine knowledge graph traversal with vector search. The knowledge graph provides structured multi-hop reasoning and entity-aware retrieval, while vector search provides semantic similarity for unstructured content that the KG does not cover.

# Hybrid KG + vector retrieval: ask the KG for structured multi-hop neighborhoods
# around the query's entities, then stitch them with semantic search hits.
def hybrid_kg_vector_retrieve(query: str, kg_store, vector_collection, k: int = 5) -> str:
    """Merge KG traversal (structured) with vector search (semantic) into one context."""
    # 1) Pull entity mentions out of the query so we know where to anchor the KG walk.
    entities = extract_entities_from_query(query)

    # 2) KG traversal: 2-hop neighborhoods around each query entity.
    kg_context: list[str] = []
    for entity in entities:
        neighbors = kg_store.find_neighbors(entity, max_hops=2)
        kg_context.extend(format_paths(neighbors))

    # 3) Vector search picks up unstructured text the KG does not encode.
    vector_results = vector_collection.query(query_texts=[query], n_results=k)
    vector_context = vector_results["documents"][0]

    # 4) Concatenate the two into a single prompt-ready string.
    return (
        "Structured Knowledge (from Knowledge Graph):\n"
        + "\n".join(kg_context)
        + "\n\nRelevant Documents (from semantic search):\n"
        + "\n".join(vector_context)
    )

Code Fragment 35.3.5a: Combining knowledge graph lookups with vector retrieval: entities are extracted from the query, matched to graph nodes, and their triples are merged with vector search results.

Warning

GraphRAG's indexing phase is significantly more expensive than standard RAG because it requires LLM calls for every chunk (entity extraction), every entity pair (relation validation), and every community (summary generation). For a corpus of 10,000 documents, indexing can cost $50 to $500 in LLM API calls depending on the model and chunk count. However, this is a one-time cost, and the resulting graph with community summaries enables query capabilities that are impossible with vector-only RAG.

35.3.6 When to Use Knowledge Graph RAG

Table 35.3.2c: Vector RAG vs. KG-grounded RAG, as of 2026. The community-summarization variant (GraphRAG) sits on top of KG RAG and gets its own scoring in Section 35.4.

Use Case	Vector RAG	KG RAG
Simple factual Q&A	Excellent	Good
Multi-hop reasoning	Poor	Excellent
Corpus-level themes	Poor	Moderate (better with GraphRAG, see 35.4)
Entity disambiguation	Poor	Excellent
Setup complexity	Low	High
Indexing cost	Low (embedding only)	Medium (KG construction; high for GraphRAG)

Tip: Always Show Retrieved Sources to Users

Display the retrieved chunks or source documents alongside the generated answer. This builds user trust, enables fact-checking, and provides an immediate feedback signal for when retrieval goes wrong.

Real-World Scenario

Adding a Knowledge Graph to a Pharmaceutical RAG System

Who: A data scientist at a pharmaceutical company supporting drug interaction research

Situation: Researchers needed to answer multi-hop questions like "Which drugs that inhibit CYP3A4 are also contraindicated with warfarin?" across 80,000 clinical documents and FDA drug labels.

Problem: Vector-only RAG retrieved documents about CYP3A4 inhibitors or warfarin contraindications individually, but could not reliably find the intersection. Multi-hop reasoning across disconnected chunks produced hallucinated drug names.

Dilemma: Building a comprehensive drug interaction knowledge graph from scratch would take months of expert curation. Using an LLM to auto-extract triples was faster but introduced extraction errors (15% false positive rate on relation extraction).

Decision: They combined a curated seed graph from DrugBank (200K triples covering known interactions) with LLM-extracted triples from internal documents, using a confidence threshold of 0.85 and pharmacist review for borderline extractions.

How: Queries were classified as single-hop (vector search) or multi-hop (graph traversal plus vector search). For multi-hop queries, the system traversed the KG to find entity paths, then retrieved supporting text chunks for each hop to ground the answer.

Result: Multi-hop question accuracy improved from 41% to 79%. Researchers could trace the reasoning chain through explicit graph edges, which was critical for regulatory compliance.

Lesson: Knowledge graphs excel at structured, multi-hop reasoning that vector search alone cannot reliably perform. Combining curated seed data with LLM extraction (plus human review) is a practical path to building domain KGs.

Exercises

Exercise 18.5.1: KG vs. vector search Conceptual

Give an example of a query that a knowledge graph can answer but vector search cannot, and vice versa.

Show Answer

KG wins: "Which companies are competitors of our largest supplier's parent company?" (requires relationship traversal). Vector search wins: "Find documents about sustainable manufacturing practices" (semantic similarity, no specific entities).

Exercise 18.5.2: Triple extraction Conceptual

Given the sentence "Apple acquired Beats Electronics in 2014 for $3 billion," list all the knowledge graph triples you would extract.

Show Answer

(Apple, acquired, Beats Electronics), (Apple, acquisition_date, 2014), (Apple, acquisition_price, $3 billion), (Beats Electronics, acquired_by, Apple).

Exercise 18.5.3: Multi-hop reasoning Conceptual

Explain what multi-hop reasoning is and why knowledge graphs enable it better than vector search. Provide an example query that requires 3 hops.

Show Answer

Multi-hop reasoning traverses multiple relationship edges. Example: "What university did the CEO of the company that acquired Instagram attend?" Hop 1: Instagram acquired_by Facebook. Hop 2: Facebook CEO is Mark Zuckerberg. Hop 3: Zuckerberg attended Harvard. Vector search cannot chain these relationships.

Exercise 18.5.4: Cypher path queries Coding

Given a property graph with Drug, Gene, and Disease nodes and TARGETS and ASSOCIATED_WITH edges, write a Cypher query that returns, for a given drug name, all diseases that are associated with any gene the drug targets. Then describe how an CypherRetriever would translate the natural-language form of this question into your query.

Answer Sketch

MATCH (d:Drug {name: $drug})-[:TARGETS]->(g:Gene)-[:ASSOCIATED_WITH]->(dis:Disease) RETURN dis.name, g.name. The retriever gets the schema (node labels + relation types), prompts an LLM with a few-shot translation example, fills in the entity from the question, and runs the resulting Cypher. The agent sees only the returned disease names, not the query.

Exercise 18.5.5: Hybrid retrieval Conceptual

Design a hybrid pipeline that uses both vector search and knowledge graph traversal. How do you decide which retriever to invoke for a given query?

Show Answer

Use a query classifier: entity-centric queries (containing named entities and relationship words) go to the KG. Semantic/topical queries go to vector search. Complex queries can invoke both and merge results with source attribution.

Exercise 18.5.6: KG construction Conceptual

Use an LLM to extract entities and relationships from 10 paragraphs of text. Store them as a NetworkX graph and visualize the resulting knowledge graph.

Exercise 18.5.7: Graph querying Coding

Given a knowledge graph built from a company's Wikipedia page, implement a function that answers multi-hop queries like "Who founded the company that acquired X?"

Exercise 18.5.8: Coreference resolution Coding

Implement a two-pass entity-resolution step over an LLM-extracted graph: first cluster surface forms by embedding cosine similarity (threshold 0.92), then ask an LLM to confirm the borderline pairs (0.85 to 0.92) before merging. Report the merge rate and the impact on multi-hop traversal recall.

What Comes Next

In the next section, Section 32.3: Deep Research & Agentic RAG, we explore deep research and agentic RAG patterns, where models actively plan and execute multi-step retrieval strategies.

Further Reading

Pan, S. et al. (2024). "Unifying Large Language Models and Knowledge Graphs: A Roadmap." IEEE TKDE. A comprehensive roadmap for integrating LLMs with knowledge graphs. Covers synergies, challenges, and future directions. Useful for anyone combining structured and unstructured knowledge.

Edge, D. et al. (2024). "From Local to Global: A Graph RAG Approach to Query-Focused Summarization." arXiv preprint. Introduces Microsoft GraphRAG, which builds community-level summaries from knowledge graphs. Demonstrates superior performance on global questions over traditional RAG. Key paper for graph-enhanced retrieval.

Baek, J. et al. (2023). "Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering." arXiv preprint. Shows how to prompt LLMs with knowledge graph information without fine-tuning. A practical approach for leveraging structured knowledge. Useful for practitioners working with existing KGs.

Neo4j Graph Database Documentation. The leading graph database platform with native support for Cypher queries and graph algorithms. Widely used for knowledge graph storage in RAG systems. Recommended for teams building graph-backed retrieval.

Microsoft GraphRAG. Open-source implementation of the GraphRAG approach for building knowledge graphs from documents. Includes community detection and summary generation. A practical starting point for graph-based RAG projects.

Wishart, D. et al. (2018). "DrugBank 5.0: A Major Update to the DrugBank Database." Nucleic Acids Research. A widely-used biomedical knowledge graph demonstrating real-world KG applications. Provides a concrete example of domain-specific structured data. Relevant for healthcare and life sciences RAG use cases.