Knowledge Graphs in Journalism: Connecting People, Places, and Events

Knowledge graphs map entities and their relationships, enabling journalists to discover hidden connections in complex stories. Learn how knowledge graphs power AI newsroom intelligence.

By Omniscient AI Editorial Team Published 20 March 2026 Updated 1 April 2026 8 min read

knowledge graphentity extractionjournalism AIinvestigative toolsNLP journalism

What Is a Knowledge Graph?

A knowledge graph is a structured representation of entities (people, organisations, locations, events) and the relationships between them, stored in a format that enables efficient traversal and query — discovering connections that would be invisible in unstructured text. In journalism, knowledge graphs enable investigators to ask questions like "which individuals appear in both Dataset A and Dataset B?" or "which companies share directors with Company X?" — questions that human analysis of raw documents cannot answer at scale.

Google's Knowledge Graph, which powers the information panels that appear in Google search results, is the most widely known example — but newsroom knowledge graphs are purpose-built for investigative and intelligence use cases, integrating news archive data, public records, corporate filings, and investigative datasets into a single queryable network.

Knowledge Graphs in Major Investigations

The ICIJ's (International Consortium of Investigative Journalists) Linkurious-powered graph visualisation was central to the Panama Papers and Paradise Papers investigations — enabling analysts to map relationships between hundreds of thousands of offshore entities, directors, and beneficial owners across 11.5 million documents. Without graph-based analysis, these investigations would have been computationally impossible.

Neo4j, a commercial graph database, has become the standard tool for investigative journalism knowledge graphs. Its Cypher query language enables journalists to express relationship queries naturally ("find all paths of length ≤3 between Person A and Corporation B") and its visualisation tools make network analysis accessible to reporters who are not data scientists.

AI and Knowledge Graph Construction

The most time-consuming aspect of knowledge graph journalism has historically been entity extraction and relationship mapping from unstructured text — a process that required substantial manual annotation. Modern NLP tools and LLMs have dramatically reduced this barrier. Named entity recognition (NER) models can automatically extract people, organisations, locations, and events from large document corpora. LLMs can extract relationship assertions from text ("X served as director of Y from 2018 to 2022") that can be structured into graph edges. Omniscient AI's newsroom intelligence layer uses entity extraction and relationship mapping to maintain a continuously updated knowledge graph of the news sources, journalists, organisations, and topics it monitors.

Frequently Asked Questions

What is a knowledge graph in journalism?

A journalism knowledge graph is a structured database of entities (people, companies, locations, events) and their relationships, enabling journalists to discover hidden connections — shared directors, overlapping networks, related events — across large document corpora that manual analysis cannot process at scale.

What database tool is used for journalism knowledge graphs?

Neo4j is the most widely used graph database for journalism investigations. ICIJ used Neo4j with Linkurious visualisation software for the Panama Papers and Pandora Papers investigations. Other options include AWS Neptune, Google Knowledge Graph, and open-source alternatives like Apache TinkerPop.

What is named entity recognition (NER)?

Named entity recognition (NER) is an NLP technique that automatically identifies and classifies named entities — people, organisations, locations, dates, financial values — in unstructured text. It is the foundational step in knowledge graph construction from news archives and document corpora.

What was the ICIJ's role in the Panama Papers?

ICIJ (International Consortium of Investigative Journalists) coordinated the Panama Papers investigation, managing data sharing and coordination among 400+ journalists in 80+ countries, and providing the technical infrastructure including Nuix document processing and Neo4j/Linkurious graph analysis that enabled systematic exploration of 11.5 million documents.

How does Omniscient AI use knowledge graphs?

Omniscient AI's newsroom intelligence platform maintains entity relationships extracted from its continuously indexed source corpus — mapping journalists, publications, topics, and organisations to surface connection intelligence for editorial teams. This enables journalists to understand which sources have covered a topic before, which experts are cited on specific issues, and how story networks evolve over time.

What Is a Knowledge Graph?

Knowledge Graphs in Major Investigations

AI and Knowledge Graph Construction

Frequently Asked Questions

Related Articles

The Modern Newsroom Tech Stack in 2026

Vector Search in Newsrooms: Finding Hidden Connections in Your Archive

AI Transcription for Journalists: Tools, Accuracy, and Best Practices