Evaluating Open Information Extraction on Scientific and Medical Text

This dataset is the result of applying crowd sourcing to the extractions of two open information extraction tools (Open IE 4 and MinIE) linked below. Extractions were performed on both a set of random sentences from Wikipedia and randomly selected sentences from the OA-STM corpus. The aim is to evaluate the effectiveness of open information extraction tools on scientific and medical text. The initial datasets, the code for applying information, the HITS, labelling instructions, and analysis code are all included above.