Title: | Small Area Estimation for Key Health and Demographic Indicators from Household Surveys |
---|---|
Description: | Enables small area estimation (SAE) of health and demographic indicators in low- and middle-income countries (LMICs). It powers an R 'shiny' application that helps public health analysts, policymakers, and researchers generate subnational estimates and prevalence maps for 150+ binary indicators from Demographic and Health Surveys (DHS). Basing its core SAE analysis workflow on the 'surveyPrev' package, the app ensures methodological rigor through guided model selection, automated fitting, and interactive visualization. For more details, visit <https://sae4health.stat.uw.edu/>. |
Authors: | Yunhan Wu [cre, aut], Qianyu Dong [aut], Zehang R Li [aut], Jon Wakefield [aut] |
Maintainer: | Yunhan Wu <[email protected]> |
License: | AGPL-3 |
Version: | 1.2.1 |
Built: | 2025-03-30 20:23:16 UTC |
Source: | https://github.com/wu-thomas/sae4health |
The sae4health package powers an R Shiny app designed for small area estimation (SAE) of health and demographic indicators in low- and middle-income countries (LMICs). It enables subnational estimation and prevalence mapping for more than 150 binary indicators derived from Demographic and Health Surveys (DHS), providing an intuitive interface for public health analysts, policymakers, and researchers.
Built on the surveyPrev package, sae4health ensures methodological rigor in SAE analysis. It offers guided model selection, automated model fitting, and interactive visualization, making advanced statistical methods accessible to non-experts.
For comprehensive documentation on the sae4health project and web-based app access, visit: https://sae4health.stat.uw.edu/
The latest development version of the package is maintained at: https://github.com/wu-thomas/sae4health
Yunhan Wu [Maintainer] ([email protected])
Qianyu Dong ([email protected])
Zehang R Li ([email protected])
Jon Wakefield ([email protected])
A dataset linking administrative level 2 regions to their corresponding WHO region and country codes.
adm2.link.all
adm2.link.all
A data frame with 1,095 rows and 8 columns:
WHO-designated region for the country.
Three-letter ISO country code.
Administrative level 0 (country) name for visualization.
Administrative level 1 (first subnational division) name.
Administrative level 2 (second subnational division) name.
Global unique identifier for level 0 (country).
Global unique identifier for level 1 (first subnational division).
Global unique identifier for level 2 (second subnational division).
This function converts administrative level names (e.g., "National", "Admin-1") into corresponding numerical values.
admin_to_num(admin_level)
admin_to_num(admin_level)
admin_level |
A character string representing the administrative level. It can be "National" or "Admin-X" (where X is a positive integer). |
An integer representing the numerical level: - "National" is converted to 0. - "Admin-X" is converted to X as an integer. - Returns NULL if the input is invalid.
admin_to_num("National") # Returns 0 admin_to_num("Admin-1") # Returns 1 admin_to_num("Admin-2") # Returns 2 admin_to_num("Invalid") # Returns NULL
admin_to_num("National") # Returns 0 admin_to_num("Admin-1") # Returns 1 admin_to_num("Admin-2") # Returns 2 admin_to_num("Invalid") # Returns NULL
A dataset containing estimated health indicators from the DHS API, including country-level estimates and subgroup breakdowns.
DHS_api_est
DHS_api_est
A data frame with 28,110 rows and 7 columns:
Name of the country where the survey was conducted.
Three-letter ISO country code.
Year in which the DHS survey was conducted.
Unique identifier for the DHS survey.
Definition of the health indicator being estimated.
Estimated value of the indicator.
Label describing any subgrouping (e.g., urban/rural, age group).
A dataset for DHS health indicators, including their DHS official definitions, measurement types, and denominators.
DHS_ind_dictionary
DHS_ind_dictionary
A data frame with 4,433 rows and 5 columns:
Unique identifier for each DHS indicator.
Short name or label for the indicator.
Detailed description of the indicator.
Explanation of the denominator used in indicator calculation.
Type of measurement (e.g., percentage, count, rate).
A dataset containing metadata for countries included in DHS surveys, including standardized country codes across multiple international organizations.
DHS.country.meta
DHS.country.meta
A data frame with 60 rows and 12 columns:
Country code used by UNAIDS.
Name of the subregion (e.g., West Africa, South Asia).
Country code used by WHO.
Country code used by FIPS.
Country code used by UNICEF.
Name of the global region (e.g., Africa, Asia).
Two-letter ISO country code.
Three-letter ISO country code.
Numeric ordering of the region for visualization.
Unique country code assigned by DHS.
Full name of the country.
Country code used by UN Statistics Division.
A dataset containing metadata on recodes for DHS surveys supported by the app.
DHS.dataset.meta
DHS.dataset.meta
A data frame with 5,636 rows and 13 columns:
Format of the dataset file (e.g., Stata, SPSS, CSV).
Size of the dataset file in megabytes.
Type of dataset/recode (e.g., Household, Individual, Children, Biomarker).
Survey number assigned by DHS.
Unique ID of the survey associated with the dataset.
Type of file (e.g., Household Recode, Birth Recode, Men's Recode).
Date when the dataset file was last modified.
Type of DHS survey (e.g., DHS, AIS, MIS).
Label describing the survey year.
Year in which the DHS survey was conducted.
Unique country code assigned by DHS.
Name of the dataset file.
Full name of the country associated with the dataset.
A dataset containing metadata for DHS surveys supported by the app.
DHS.survey.meta
DHS.survey.meta
A data frame with 153 rows and 30 columns:
Date the survey data was released.
Unique ID assigned to each DHS survey.
Maximum age of surveyed women.
Start date of survey fieldwork.
Minimum age of surveyed men.
Organization implementing the survey.
IDs of survey characteristics.
Type of survey (e.g., DHS, AIS, MIS).
Label describing the survey year.
Boolean. Whether indicator data is available.
Minimum age of surveyed women.
Year in which the DHS survey was conducted.
End date of survey fieldwork.
DHS country code.
Number of sample points in the survey.
Name of the country where the survey was conducted.
Number of women surveyed.
Name of the subregion.
Maximum age of surveyed men.
Survey number assigned by DHS.
Status of the survey (e.g., Completed, Ongoing).
Number of health facilities surveyed.
Population coverage for male respondents.
Region name for survey coverage.
Population coverage for female respondents.
Additional survey notes.
Date when the survey results were published.
Number of households surveyed.
Number of men surveyed.
Whether GPS coordinates are available for cluster locations.
The dataset 'match_all_result' contains the results of matching indicators from DHS (Demographic and Health Surveys) with those extracted from GitHub. The dataset includes similarity scores, recoding names, indicator definitions, and positional information within the matched text.
match_all_result
match_all_result
A data frame with 132 rows and 20 variables:
Character. Unique ID of the indicator in DHS.
Character. Label assigned to the indicator in DHS.
Character. Description or definition of the DHS indicator.
Character. Combined text representation for similarity matching.
Numeric. Similarity score between DHS and GitHub indicators.
Character. Unique ID of the indicator from GitHub.
Character. Initial recoding name for the indicator.
Character. Revised recoding name after updates.
Character. Raw version of the GitHub indicator ID.
Character. Processed indicator definition from GitHub.
Character. Unprocessed raw definition from GitHub.
Character. The chapter or category of the indicator.
Character. Associated R bundle or package for processing.
Integer. Start position of the matched pattern in text.
Integer. End position of the matched pattern in text.
Character. The exact text pattern matched between sources.
Character. Grouping variable for batch recoding.
Character. Final matched indicator name.
Unknown. This variable needs clarification or may be redundant.
Character. First two letters of the indicator ID, possibly used for grouping or sorting.
A spatial dataset containing national-level WHO boundaries for eight countries, with administrative codes and metadata.
natl.WHO.shp
natl.WHO.shp
A spatial data frame with 8 rows and 35 columns:
WHO-designated region for the country.
Three-letter ISO country code.
Administrative level 0 (country) name.
Administrative level 0 country code.
...
This function converts a numerical administrative level into its corresponding string format.
num_to_admin(num)
num_to_admin(num)
num |
A single integer representing the administrative level. The value 0 corresponds to "National", while positive integers correspond to "Admin-X". |
A character string representing the administrative level: - 0 is converted to "National". - Positive integers are converted to "Admin-X". - Returns NULL if the input is invalid.
num_to_admin(0) # Returns "National" num_to_admin(1) # Returns "Admin-1" num_to_admin(2) # Returns "Admin-2"
num_to_admin(0) # Returns "National" num_to_admin(1) # Returns "Admin-1" num_to_admin(2) # Returns "Admin-2"
A dataset containing the original 22 health and demographic indicators supported in the app.
ref_tab_22
ref_tab_22
A data frame with 22 rows and 13 columns:
Unique identifier for the indicator.
Short label for the indicator.
Detailed definition of the indicator.
General category or theme of the indicator.
Abbreviation of the chapter where the indicator appears.
Logical. Whether the indicator is available in the Individual Recode dataset.
Logical. Whether the indicator is available in the Household Members dataset.
Logical. Whether the indicator is available in the Children's Recode dataset.
Logical. Whether the indicator is available in the Birth Recode dataset.
Logical. Whether the indicator is available in the Household Recode dataset.
Logical. Whether the indicator is available in the Men's Recode dataset.
Logical. Whether the indicator is available in the AIDS Indicator Survey dataset.
Logical. Whether the indicator is available in the Couple's Recode dataset.
A dataset containing all 153 indicators supported in the app, combining the original 22 indicators with the newly added ones.
ref_tab_all
ref_tab_all
A data frame with 153 rows and 15 columns:
Unique identifier for the indicator.
Short label for the indicator.
Detailed definition of the indicator.
General category or theme of the indicator.
Abbreviation of the chapter where the indicator appears.
Logical. Whether the indicator is available in the Individual Recode dataset.
Logical. Whether the indicator is available in the Household Members dataset.
Logical. Whether the indicator is available in the Children's Recode dataset.
Logical. Whether the indicator is available in the Birth Recode dataset.
Logical. Whether the indicator is available in the Household Recode dataset.
Logical. Whether the indicator is available in the Men's Recode dataset.
Logical. Whether the indicator is available in the AIDS Indicator Survey dataset.
Logical. Whether the indicator is available in the Couple's Recode dataset.
Chapter reference from the DHS reports.
Title of the section where the indicator appears.
A dataset containing newly added health and demographic indicators in the app after its initial release.
ref_tab_new
ref_tab_new
A data frame with 134 rows and 13 columns:
Unique identifier for the indicator.
Short label for the indicator.
Detailed definition of the indicator.
General category or theme of the indicator.
Abbreviation of the chapter where the indicator appears.
Logical. Whether the indicator is available in the Individual Recode dataset.
Logical. Whether the indicator is available in the Household Members dataset.
Logical. Whether the indicator is available in the Children's Recode dataset.
Logical. Whether the indicator is available in the Birth Recode dataset.
Logical. Whether the indicator is available in the Household Recode dataset.
Logical. Whether the indicator is available in the Men's Recode dataset.
Logical. Whether the indicator is available in the AIDS Indicator Survey dataset.
Logical. Whether the indicator is available in the Couple's Recode dataset.
Run the Shiny Application
run_app( onStart = NULL, options = list(), enableBookmarking = NULL, uiPattern = "/", ... )
run_app( onStart = NULL, options = list(), enableBookmarking = NULL, uiPattern = "/", ... )
onStart |
A function that will be called before the app is actually run.
This is only needed for |
options |
Named options that should be passed to the |
enableBookmarking |
Can be one of |
uiPattern |
A regular expression that will be applied to each |
... |
arguments to pass to golem_opts. See '?golem::get_golem_options' for more details. |