This document is intended as a collection of procedures and resources to guide the curation of Data Completeness instances (henceforth, Data Grids) which can be activated for any location page on HDX (by a sysadmin). This document and others linked from it, should evolve to capture best practices and any other useful info leanred as the data grid curators do their work.

Once activated for a given location page, the Data Grid will appear and will be using a default recipe (based on tags) to fill the data grid. However, tags are seldom enough to accurately gauge if a dataset meets the requirements of a given data grid. Curation, then, is the process of customizing a specific location's data grid so that the datasets included in the data grid meet the defined requirements for the subcategory. That customization is done by editing the recipe yaml file (which is format that is friendly to both humans and machines).

Resources

Procedure document (this document)
Data Completeness Definitions Document
Quality Checklist (below)
YAML editing examples (below)
Github Repository

Process Overview

The basic curation process is outlined below:

Data Grid Instances to be Curated

There may be more on the feature server for testing purposes, but the ones listed below should be the only active ones on the production server.

Production Data Grid	Feature Server Data Grid	Curator(s)
Production: yem	Feature: yem

Quality Checklist

Each dataset that is a candidate for data grid has to be evaluated

Data Grid (Data Completeness) Curation Procedures

Resources

Process Overview

Data Grid Instances to be Curated

Quality Checklist