Abstract
Proximity to family, household composition, and structure are often studied as outcomes and as explanatory factors in a wide range of scientific disciplines. Here, we describe a large longitudinal dataset (currently including data from over 70,000 individuals from 2004 to 2017), including data on household structure, proximity to kin, population density, and other socio-demographic factors derived from data from the Karonga Health and Demographic Surveillance Site (HDSS) in Northern Malawi. We present how the dataset is generated, list some examples of how it can be used, and provide information on the limitations that affect the types of analyses that can be carried out.