The (Anti-)Corruption Corpus currently is being beta tested.
If you received login information, you can access the corpus on its testing server.

The feedback stage will conclude March 15, 2020, with public release slated for later in the year.
To stay up-to-date on public release information, please subscribe to the GI-ACE newsletter.

Thank you for your interest!

What is the (Anti-)Corruption Corpus? 

The (Anti-)Corruption Corpus is an open-source database  for exploring comprehensive mapping of scientific literature related to corruption.The tool  is designed to facilitate greater access to published research on corruption and anti-corruption. It complements the Global Integrity Anti-Corruption Evidence (GI-ACE) Research Programme Theory of Change, which emphasizes the need to promote academic research and support its use by practitioners.

Who has contributed to the (Anti-)Corruption Corpus? 

Since February 2014, Matthew Stephenson, the Eli Goldston Professor of Law at Harvard Law School, has made available through his Global Anticorruption Blog a PDF of his extensive bibliography of sources on (anti-)corruption. GI-ACE undertook the challenge—with Professor Stephenson’s agreement—of converting the bibliography into an electronically searchable tool for researchers and practitioners, alike. To do this, GI-ACE enlisted the help of Steven Gawthorpe, data scientist and founding member of the Interdisciplinary Corruption Research Network (ICRN). Working from Professor Stephenson’s bibliography, Gawthorpe used extensive natural language processing to generate the initial corpus. By including, where available, abstracts and links to open-source materials, the database will allow researchers and a wide range of practitioners to  more easily search and identify the sources that are relevant to their topic. 

What is included in the Corpus? 

With more than 21,000 sources (inclusive of the more than 7,000 sources in the latest iteration of Stephenson’s bibliography) dating back to the 1800s, this (Anti-)Corruption Corpus will serve as an essential tool to advance work in the field. However, the database is likely still missing items that should be included. If you have suggestions for additional resources, or about ways this tool could be made more useful in future iterations, please contact us.

Disciplines The various scientific disciplines are defined from imported bibliographic data from SCOPUS, Clarivate Analytics’ Web of Science. These databases provide their own disciplinary categories for scientific research.


The GI-ACE Research Programme team would like to thank all of the researchers taking their time to view the production of this application. Suggestions for missing anti-corruption references may be submitted for consideration through the GI-ACE feedback form.

Special thanks and gratitude to Matthew Stephenson, Steven Gawthorpe, Joseph Pozsgai-Alvarez, and Alan Hudson, without whom this project would not have been possible.