Alexandria: Unsupervised High-Precision Knowledge Base Construction using a Probabilistic Program

John WinnJohn Guiver, Sam Webster, Yordan ZaykovMartin Kukla, Dany Fabian.

doi:10.24432/C5159B

TL;DR

This paper presents a system for unsupervised, high-precision knowledge base construction using a probabilistic program to define a process of converting knowledge base facts into unstructured text.
Creating a knowledge base that is accurate, up-to-date and complete remains a significant challenge despite substantial efforts in automated knowledge base construction. In this paper, we present Alexandria -- a system for unsupervised, high-precision knowledge base construction. Alexandria uses a probabilistic program to define a process of converting knowledge base facts into unstructured text. Using probabilistic inference, we can invert this program and so retrieve facts, schemas and entities from web text. The use of a probabilistic program allows uncertainty in the text to be propagated through to the retrieved facts, which increases accuracy and helps merge facts from multiple sources. Because Alexandria does not require labelled training data, knowledge bases can be constructed with the minimum of manual input. We demonstrate this by constructing a high precision (typically 97\%+) knowledge base for people from a single seed fact.

Citation

@inproceedings{
winn2019alexandria,
title={Alexandria: Unsupervised High-Precision Knowledge Base Construction using a Probabilistic Program},
author={John Winn and John Guiver and Sam Webster and Yordan Zaykov and Martin Kukla and Dany Fabian},
booktitle={Automated Knowledge Base Construction (AKBC)},
year={2019},
url={https://openreview.net/forum?id=rJgHCgc6pX},
doi={10.24432/C5159B}
}
Gold Sponsors
Silver Sponsors
Bronze Sponsors
Chan Zuckerberg Initiative Facebook Google
Diffbot Oracle Corporation NEC
Elsevier Kenome