Build and manage
your structured data
for investigative
research projects

investigraph is a framework to manage collections of structured data for investigative journalism and research projects.

It allows you to stream data from different sources into a common data model to process it further for research and analysis.

Following industry standard for data engineering

investigraph helps you to set up a pipeline that can extract, transform and load data from many formats into various target systems.

Step 1

Extract

Scrape data from public websites, APIs, json or csv data dumps, or sql databases. On repeat.

Step 2

Transform

Map the source data to the common followthemoney model to create Persons, Companies, and how they are connected.

Step 3

Load

Store your datasets on your computer, in Aleph or in the cloud and share it with your collaborators

For all of these pipeline stages, the investigraph framework provides many helpers and abstraction logic to reduce the amount of code that needs to be developed for a specific dataset.

Part of an established ecosystem

investigraph is built on top of industry-standard technology and connects to well-known tools within the research landscape.

Technology

investigraph is built on top of the etl framework prefect.io written in python.

We offer an easy to understand yaml specification that requires no coding skills to get started.

Open Source

investigraph is and will always be Open Source via the MIT License.

We use investigraph to build our own public data catalogue for useful datasets for your upcoming investigations.

Adaptable

As the data follows a commonly used data model, many connections with other tools are possible, including Aleph, a research platform for investigative journalism.

Get started

smiling face

In our tutorial we show you how to extract, transform and load your data with investigraph

Tutorial

About

investigraph is developed by investigativedata.io, the data engineering agency for investigative journalism. It is available open source at github.

smiling face Write us an email