NEWS.md
as_survey_rep()
now has an argument degf
, corresponding to the same argument in the survey function svrepdesign()
. This argument can be useful for large data sets, since specifying a value for degf
avoids a calculation which can be slow for very large data sets, (#171, @bschneidr).rename_with()
(#172, thanks @josiahg2)as_survey
and related functions are called on a grouped_df (#178, thanks @rossellhayes)..by
argument works like it does for dplyr in summarize
and mutate
(#179, thanks @szimmer)survey_mean()
instead of leaving them packedsurvey_prop()
now uses proportions as the default, which should confidence interval improve coverage, but does mean results may slightly change (#141, #142, thanks @szimmer)survey_corr()
calculates the correlation between 2 variables, (#150, #151, thanks @szimmer & @bschneidr)dplyr::reframe
for srvyr’s objectsas_survey_rep()
now accepts type="ACS"
and type="successive-difference"
for user-supplied replicate weights. (#153, thanks @bschneidr)cur_svy_wts()
to access the survey weights (#136, #139, thanks @ray-p144 and @bschneidr)cur_svy()
and cur_svy_wts()
in mutate
and filter
(#138, #139, thanks @ray-p144 and @bschneidr)interact()
when using cascade()
(#133, thanks @szimmer)1234
) in cascade (#132, thanks @szimmer)survey_old_quantile()
and survey_old_median()
interact
that makes it easier to calculate proportions among interacted groupsanti_join
and semi_join
) are now available for srvyr objects. You must put the tbl_svy
object first. (#65, #120, @bschneidr)dplyr::across
). (#129)survey_mean()
with no x
no longer errors when there are no grouping variables (#117)summarize
has been rearchitected,
dplyr::across()
now works within itdplyr::cur_group()
, dplyr::cur_group_id()
, dplyr::cur_data()
work in it (as well as new analogous functions srvyr-specific cur_svy()
and cur_svy_full()
)objects in the summarize
will refer to the output of summarize
before the input. Meaning code that looks like this:
dstrata %>% summarize(api99 = survey_mean(api99), api_diff = survey_mean(api00 - api99))
will now error because it calculates the mean of api99
before using it inside of the calculation for api_diff
. This behavior better matches dplyr
’s so will likely be kept.
Support for group_map()
/group_walk()
/group_map_dfr()
, group_split()
, group_nest()
and nest_by()
were added for tbl_svy
objects.
Support drop_na
from tidyr (#107).
as_survey()
and as_survey_()
are now idempotent: given a srvyr
survey object (a tbl_srv
), they return it unchanged. If extra arguments are provided, they are ignored with a warning (#97, thanks @krivit).
rename_with()
now works with surveys (#96, thanks @krivit).
Fix to ensure that ordered factors can be used as grouping variables or as inputs to survey_count
and survey_tally
(#92, thanks for reporting @szimmer & @walkerke & for fixing @bschneidr).
Fix to ensure that numeric values can be used in grouping variables (#78 & #74, thanks for reporting @tzoltak & fix @bschneidr)
Some improvements for dplyr 1.0 (#79) transmute()
now works (thanks for reporting @caayala), summarise()
’s .groups
argument is respected, and multi-row returns to summarise()
work. (Unfortunately the new across()
function isn’t quite supported in summarise()
yet, it will hopefully come soon)
unweighted
now evaluates in the right context and so will provide correct error when an incorrectly interpolated function is used (#70, thanks for reporting @tlmcmurry)
filter_at
works now, (#57, thanks for reporting @dcaseykc & helping @bschneidr).
Fix for upcoming version of tibble (#72).
Small update to quasiquotation syntax inside unweighted
to improve consistency with recent rlang updates (#54).
Added functions survey_tally()
and survey_count()
(#53)
New functions survey_var and survey_sd to calculate population variance and standard deviaton.
Computation of standard errors in all survey_ functions can be suppressed by setting vartype=NULL (#45, thanks @tzoltak).
Fixed an issue where you’d get an error when summarize components returned different lengths of data - usually when factor levels were not present in the data (#49).
Removed references to MonetDBLite since it has been removed from CRAN.
Small updates to replace soft-deprecated dplyr functions with their tibble and tidyselect equivalents (#52, thanks @bschneidr).
Add warning to explain that design effects cannot be calculated on proportions. (#39, thanks @mlaviolet)
Remove dependency on stringr in tests and add DBI to suggests so that test dependencies are correctly specified (#40, thanks CRAN!)
When converting from a survey db-backed survey to a srvyr one srvyr now tries to capture the updates you’ve already sent. If dbplyr can convert the function, then it will bring the update. If it can’t it will warn you (#35).
Small bug fixes, mostly having to do with CRAN checks, running on CI services, or for upstream rev dep checks.
srvyr now uses tidy evaluation from rlang. The “underscore” functions have been soft deprecated in favor of quosure splicing. See dplyr’s vignette “programming” for more details. In almost all cases, the old syntax will still work, with one exception: the standard evaluation function as_survey_twophase_()
had to be changed slightly so that the entire list is inside quotation.
Datbase support has been rewritten. It should be faster now and doesn’t require a unique identifier. You also can now convert survey db-backed surveys to srvyr with as_survey.
srvyr now has a pkgdown site, check it out at http://gdfe.co/srvyr/
Added support for dplyr mutate_at/_if/_all and summarize_at/_if/_all for srvyr surveys.
Fixed a few bugs introduced with dplyr 0.6. This version of srvyr will work with both old versions of dplyr and 0.6, but may be full of warnings if you update dplyr. Full support for the new dplyr is coming soon.
Fixed a problem with confidence levels not being passed into quantiles
Added deff parameter to survey_mean()
, survey_total()
and survey_median()
, and a df parameter to those functions and survey_quantile()
/ survey_median()
.
summarize
and mutate
match dplyr’s behavior when arguments aren’t named (uses dplyr::auto_name()
)
New function cascade
summarizes groups, and cascades to create summary statistics of groups of groups.
Fixed a bug for confidence intervals for survey_total()
on groups.
Fixed some issues with the upcoming version of dplyr.