arcenis-r/cepumdPublic

NotificationsYou must be signed in to change notification settings
Fork2
Star8

cepumd R package for computing CE expenditure estimated means

License

GPL-3.0 license

8 stars 2 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.github		.github
R		R
docs		docs
inst		inst
man		man
pkgdown/favicon		pkgdown/favicon
revdep		revdep
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
CRAN-RELEASE		CRAN-RELEASE
CRAN-SUBMISSION		CRAN-SUBMISSION
DESCRIPTION		DESCRIPTION
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
cepumd.Rproj		cepumd.Rproj
codecov.yml		codecov.yml
cran-comments.md		cran-comments.md

Repository files navigation

cepumd

The purpose of cepumd is to make working with Consumer ExpenditureSurveys (CE) Public-Use Microdata (PUMD) easier toward calculating mean,weighted, annual expenditures (henceforth “mean expenditures”). Thechallenges cepumd seeks to address deal primarily with pulling togetherthe necessary data toward this end. Some of the overarching ideasunderlying the package are as follows:

Use a Tidyverse framework for most operations and be (hopefully)generally Tidyverse friendly
Balance the effort to make the end user’s experience with CE PUMDeasier while being flexible enough to allow that user to perform anyanalysis with the data they wish
Only designed to help users calculate mean expenditures on and of theconsumer unit (CU), i.e., not income, not assets, not liabilities, notgifts.

Challenges addressed by`cepumd`

cepumd seeks to address challenges in three categories: datagathering/organization; managing data inconsistencies; and calculatingweighted, annual metrics.

Data gathering/organization
- Convert hierarchical grouping (HG) files to data tables usingce_hg()
- Help the user identify the Universal Classification Codes (UCCs)related to their analysis using a combination ofce_hg() andce_uccs()
- Combine all required files and variables usingce_prepdata()
Managing data inconsistencies
- Provide the ability to recode variable categories using the CEDictionary for Interview and Diary Surveys
- Resolve some inconsistencies such as differences code definitionsbetween the Interview and Diary (check the definitions of the“FAM_TYPE” variable categories in 2015 for an example)
- Provide useful errors or warnings when there are multiple categoriesof something the user is trying to access, e.g., some titles in thehierarchical grouping files (“stub” or “HG” files) repeat andrequires more careful selection of UCCs
Calculating weighted, annual metrics
- Calculate a mean expenditure withce_mean() or expenditurequantile withce_quantile()
- Account for the factor (annual vs. quarterly expenditure)
- Account for the “months in scope” of a given consumer unit (CU)
- Annualize expenditures for either Diary or Interview expenditures
- Integrate Interview and Diary data as necessary

Installation

Install the production version withinstall.packages("cepumd")

You can install the development version ofcepumd fromGitHub, but you’ll first need thedevtoolspackage:

if (!"devtools"%in% installed.packages()[,"Package"]) {  install.packages("devtools",dependencies=TRUE)}devtools::install_github("arcenis-r/cepumd")

Key cepumd functions

The workhorse ofcepumd isce_prepdata(). It merges the householdcharacteristics file (FMLI/-D) with the corresponding expendituretabulation file (MTBI/EXPD) for a specified year, adjusts weights formonths-in-scope and the number of collection quarters, adjusts somecost values by their periodicity factor (some cost categories arerepresented as annual figures and others as quarterly). With therecent update it only requires the first 3 arguments to function: theyear, the survey type, and one or more valid UCCs.ce_prepdata() nowcreates all of the other necessary objects within the function if notprovided.
There are two functions for wrangling hierarchical grouping data intomore usable formats:
- ce_hg() pulls the requested type of HG file (Interview, Diary, orIntegrated) for a specified year.
- ce_uccs() filters the HG file for the specified expenditurecategory and returns either a data frame with only that section ofthe HG file or the Universal Classification Codes (UCCs) that makeup that expenditure category.
There are two functions that the user can use to calculate CE summarystatistics:
- ce_mean() calculates a mean expenditure, standard error of themean, coefficient of variation, and an aggregate expenditure.
- ce_quantiles() calculates weighted expenditure quantiles. It isimportant to note that calculating medians for integratedexpenditures is not recommended because the calculation involvesusing weights from both the Diary and Survey instruments.

About

cepumd R package for computing CE expenditure estimated means

arcenis-r.github.io/cepumd/

Resources

Readme

License

GPL-3.0 license

Code of conduct

Releases

1tags

Packages

No packages published

Languages

R100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Folders and files

Latest commit

History

Repository files navigation

cepumd

Challenges addressed by`cepumd`

Installation

Key cepumd functions

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

Movatterモバイル変換

License

arcenis-r/cepumd

Folders and files

Latest commit

History

Repository files navigation

cepumd

Challenges addressed bycepumd

Installation

Key cepumd functions

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Languages

Challenges addressed by`cepumd`

Packages