This repository contains two knowledge graph datasets: NATION and UMLS.
- NATION contains relations between countries and consists of 14 entities(countries) and 56 relations.
- UMLS is a biomedical ontology dataset and consists of 135 entities and 49 relations.
Both dataset consist of three tab-seperated files:
-
entities.txt: List of entities, formatted asentity_id \tab entity_name -
relations.txt: List of relations, formatted asrelation_id \tab relation_name -
triples.txt: List of known facts, formatted asrelation_id \tab entity_id1 \tab entity_id2- means
entity_id1hasrelation_idrelation withentity_id2
- means
Both entity-id and relation-id start from 0.
Additionally, UMLS dataset contains one more file: entity_category.txt, which contains a group name of each entity in UMLS data,
and formatted as entity_id \tab entity_category.