The visualizations of the Dive into Intangible Cultural Heritage project are built upon the foundation of an extensive and open dataset. This dataset is updated once a year after the session of the Intergovernmental Committee for the Safeguarding of the Intangible Cultural Heritage (ICH). It is released by UNESCO according to the principles and core values of open science.
Overview
The dataset is organized as a semantic graph, where nodes (also known as vertices) are connected by edges.
The most important nodes in this graph are the elements inscribed in the three ICH lists (Representative List, Urgent Safeguarding List and the Register of Good Safeguarding Practices). Other types of nodes include projects, NGOs, countries, regions, World Heritage Convention elements, case studies, scientific publications and Sustainable Development Goals. Another node type acts as a metaphorical glue between all these vertices, giving structure and meaning to the whole graph: this is the concept type. These concept terms are taken from various thesauri, but mostly from the official UNESCO thesaurus and a specialized vocabulary curated by the ICH Secretariat. For example, both the Royal ballet of Cambodia and the Capoeira Circle elements are connected to the Dance concept.
The complete graph structure is detailed below.
Downloads
The full dataset is available as a JSON graph. For convenience, subsets of the data are also provided as self-explanatory CSV tables.
Description | English | French | Spanish | Format |
---|---|---|---|---|
Full dataset | Download | Download | Download | JSON |
Sub dataset - Constellation | Download | Download | Download | CSV |
Sub dataset - Sustainable Development Goals | Download | Download | Download | CSV |
Sub dataset - SDG #1: No poverty | Download | Download | Download | CSV |
Sub dataset - SDG #2: Zero hunger | Download | Download | Download | CSV |
Sub dataset - SDG #3: Good health and well-being | Download | Download | Download | CSV |
Sub dataset - SDG #4: Quality education | Download | Download | Download | CSV |
Sub dataset - SDG #5: Gender equality | Download | Download | Download | CSV |
Sub dataset - SDG #6: Clean water and sanitation | Download | Download | Download | CSV |
Sub dataset - SDG #7: Affordable and clean energy | Download | Download | Download | CSV |
Sub dataset - SDG #8: Decent work and economic growth | Download | Download | Download | CSV |
Sub dataset - SDG #9: Industry, innovation and infrastructure | Download | Download | Download | CSV |
Sub dataset - SDG #10: Reduced inequalities | Download | Download | Download | CSV |
Sub dataset - SDG #11: Sustainable cities and communities | Download | Download | Download | CSV |
Sub dataset - SDG #12: Responsible consumption and production | Download | Download | Download | CSV |
Sub dataset - SDG #13: Climate action | Download | Download | Download | CSV |
Sub dataset - SDG #14: Life below water | Download | Download | Download | CSV |
Sub dataset - SDG #15: Life on land | Download | Download | Download | CSV |
Sub dataset - SDG #16: Peace, justice and strong institutions | Download | Download | Download | CSV |
Sub dataset - SDG #17: Partnerships for the goals | Download | Download | Download | CSV |
Structure
The graph is represented as a JSON object with three main sections. The meta
key holds generic information such as the language and the timestamp of the dataset. The nodes
object contains detailed information about each vertex. At a minimum, a node is defined by a type
and a label
. Additional keys may be available according to the type. For example, the concept
type has an additional group
field that maps to an identified thesaurus. Most other types define a meta
object that further describe the item. The full list of properties can be found below. Finally, the edges
array link all the vertices together in the form of RDF tuples. However, there is an additional property to the traditional triple: the weight
key, which gives more or less importance to a connection.
{
"meta": { # graph metadata
"language": <string>, # en|fr|es|ar
"generated": <string> # YYYY-MM-DD HH:MM:SS
},
"nodes": { # nodes main object
<id>: { # node ID
"type": <string>, # element|project|ngo|country|region|concept|whc|casestudy|publication|sdg
"group": <string>, # unesco|main|biome|nature|threat|domain (concept)
"label": <string>, # node title
"meta": { # node metadata, type-dependent, displayed on click or tap (popover)
"icon": [ # node main image URLs (element, casestudy)
"small": <string>, # small version
"large": <string>, # large version
],
"description": <string>, # description (element, project, ngo, whc, casestudy, publication)
"list": <string>, # RL|USL|GSP (element),
"year": <integer>, # inscription year (element),
"multinational": <boolean>, # whether it is linked to more than one country (element)
"link": <string>, # external link (element, project, ngo, whc, casestudy, publication)
"language": <string>, # language of bibliography entry (publication)
"paper": <string>, # Full paper URL (publication)
"keywords": [ # Keywords (publication)
<string>,
...
],
"images": [ # list of associated images (element, casestudy)
{
"url": <string>, # image URL
"copyright": <string>, # copyright
"title": <string> # title
},
... # up to 10 images, ordered by relevance
],
"video": { # associated Youtube URL (element)
"url": <string>, # video URL
"copyright": <string>, # copyright
"title": <string> # title
},
"sustainability": <string> # Information about sustainability (element)
"considerations": <string> # Considerations for inclusion (sdg)
}
},
...
},
"edges": [ # edges main array
{
"subject": <id>, # source node ID
"predicate": <string> , # broader|narrower|related|exactMatch|closeMatch|primeExampleOf
"object": <id>, # target node ID
"weight": <integer> # 3 (primary concepts) | 2 (secondary concepts) | 1 (all other nodes)
},
...
]
}