.. _data_sources: Data sources ============ To be part of the backbone of a model build, all data sources have to fulfill the following requirements: - Coverage: Must be available for most if not all countries in the world. - Openness: The data must be free, open and cannot impose excessive red-tape for its use. - Trustworthiness: The science and/or practices behind the development of data source must be technically transparent and robust, and have documented use in research and/or practice. Road network ------------ All the road network data is obtained directly from `Open-Street Maps (OSM) `_, which is the de-facto standard for open transportation network data. More information about it can be found on its `website `_. Political Subdivisions ---------------------- Political Subdivisions can be downloaded from either `GADM `_ or `geoBoundaries `_. Both databases are open, and provide information with respect to countries and territories, and the existing political subdivisions that might exists. Point-of-Interest ----------------- A Point-of-Interest (POI) represents a particular feature in the space, like churches, schools, pubs, or tourist attractions. It does not necessarily have to be a point, but can also be other OSM elements, such as nodes or ways. It is worth reminding that the "interest" part should not be considered too literally, once a feature can be quite ordinary, such as a postbox. In this case, these features are usually considered ``amenity``. Amenities are useful and import facilities for visitors and residents. More information on amenity values in OSM, is available `in this page `_. A building can also be a POI. The OSM building tag is used to identify individual buildings or groups of connected buildings, and can assume several values. More information on building values can be found in `this page `_. Building footprints ------------------- Building footprints can also be downloaded from OSM or from data generated through image recognition by `Microsoft `_. Population ---------- There are currently two data sources of spatialized population that are suitable for use in this project. 1. `Meta High Resolution Population density Maps `_ 2. `World Pop `_ Both data sources provide population estimates at a very high resolution (down to a grid of 100x100m) and are based on extensive research and offer similar results at more aggregate levels (small cities or neighborhoods within a large city), so they are equivalent for the purposes of building models for large areas. However, the World Pop data includes estimates with finer detail with respect to population per age bracket, which can be crucial when analyzing access to hospital infrastructure, schools and the labor market, so we have decided to use that data source. The `list of links for download `_ while the data for the total population is listed on a `separate file `_. All the data is downloaded from the `Humanitarian Data Exchange `_, which provides a convenient interface for data download.