Dataset / Tabular

Refugee and Host Household Survey in Nairobi, 2021 (Kenya)

Abstract

The World Bank in collaboration with the Joint Data Center on Forced Displacement, Kenya National Bureau of Statistics (KNBS) and the United Nations High Commissioner for Refugees (UNHCR) conducted a cross-sectional survey on refugee and host populations living in Nairobi. The survey was based on the Kenya Continuous Household Survey (KCHS) and targets both host populations and refugees living in Nairobi. Through a participatory training format, enumerators learned how to collect quality data specific for refugees as well as nationals. Daily data quality monitoring dashboards were produced during the data collection periods to provide feedback to the field team and correct possible errors. The data was collected with CAPI technique through the World Bank developed Survey Solutions software; this ensured high standards of data storage, protection and pre-processing.

The sample is representative of refugees and other residents living in Nairobi. The refugee sample was drawn from UNHCR’s database of refugees and asylum seekers (proGres) using implicit stratification by sub-county and country of origin. The host community sampling frame was drawn using a two-stage cluster design. In the first stage, eligible enumeration areas (EAs) based on the 2019 Population and Housing Census were selected. In the second stage 12 households were sampled from each EA. The survey differentiates between two types of host communities: ‘core’ host communities were drawn from EAs located within the three areas with the largest number of refugee families: Kasarani, Eastleigh North and Kayole. At least 10 percent of the Nairobi refugee families reside in each of these areas. ‘Wider’ host communities cover the rest of the Nairobi population and were drawn from EAs which do not cover the three areas in which many refugees live.

For a subset of households, a women empowerment module was administered by a trained female enumerator to one randomly selected woman in each household aged 15 to 49.

The data set contains two files. hh.dta contains household level information. The ‘hhid’ variable uniquely identifies all households. hhm.dta contains data at the level of the individual for all household members. Each household member is uniquely identified by the variable ‘hhm_id’.

This cross-sectional survey was conducted between May 22 to July 27, 2021. It comprises a sample of 4,853 households in total, 2,420 of which are refugees and 2,433 are hosts.