title | author | date | output |
---|---|---|---|
Missing Data Explorer |
Nelson Gonzabato |
`r Sys.Date()` |
html_document |
-
There is now a function
recode_as_value
for general recoding. See #41. -
Fixed
all_of
warnings indplyr
select
andacross
verbs. -
Tests now use edition 3 of testthat. See r-lib/testthat#1471 and Nelson-Gon/manymodelr#22
-
Vignettes no longer include URLs to avoid future issues with broken links on CRAN.
-
Exclusion or inclusion in
na_summary
via regular expressions is now possible viaregex_kind
and providing apattern_type
andpattern
. -
exclusion
is now done entirely inbase
R instead of usingdplyr::select
. -
get_na_means
andpercent_missing
now supportPOSIXct
. -
na_counts
andpercent_na
are new vector focused functions to allowget
tingna_counts
and percent missingness for objects of classes likePOSIXct
. -
include a new argument to reset rownames in
na_summary
. Fixes
#33.
-
Fixed a bug that made
exclude_cols
result in non-exclusion in groupedna.summary
outputs. -
na_summary
now includes around_to
parameter to control the number of digits to round to. Defaults tooptions("digits")
in the user's settings. -
drop_na_at
no longer uses atibble
to avoid warnings withtibble
row.names
. -
Tests in
dict_recode
were extended to ensure that we get what is expected particularly with respect to theuse_func
argument. -
dict_recode
now supports the use of differentNA
recoding functions. These can be provided via the argumentuse_func
. -
dict_recode
is a new function to allow recoding with a pattern-values pair. -
drop_na_if
now supports keep certain columns when grouping. It also now supports dropping only at certain columns. -
drop_na_if
now supports dropping groups that meet a set missing data threshold. -
recode_as_na_str
updated to convert factors to character.
-
drop_row_if
arguments were changed. See docs for the new arguments. This was done to allow new functionality that enables dropping rows based on counts. -
Sorting strings in
na_summary
contained a minor bug that has now been fixed. -
recode_as_na_str
is a new function torecode_as_na
based on string matching. -
recode_as_na_if
is a new function that allows deliberate addition ofNA
s to a column if it meets a pre-defined condition. This is essentially the reverse ofdrop_na_if
. -
drop_row_if
is a new function similar todrop_na_if
but for rows instead of columns. -
percent_missing
can now work independently for factors, characters, or numeric data. It previously was limited todata.frame
objects. -
Fixed issues with
drop_na_at
that resulted in failing data rebinding. -
get_na_counts
now includes an optionalexclude_cols
argument to exclude columns from the analysis. -
na_summary
has been fixed to allow exclusion of columns from the analysis. -
all_na
is a new low level function to perform missingness boolean checks for columns(vectors). -
Added a new function
column_based_recoode
that aims to allowNA
recoding based on other columns.
-
Patch Fix to skip tests on old R releases.
-
Only R versions above 3.6.3 are supported.
Additions
-
na_summary
which provides a very quick overview of missingness. It also supports grouped summaries. -
drop_na_if
allows easy dropping of columns where all values are missing. -
custom_na_recode
allows replacing missing values with common values such as mean, min, max, sd.
Major changes
-
In
percent_missing
, the argumentgrouped
was dropped in favour of simply providing agrouping_cols
vector. -
In
recode_as_na
,subset_df
was dropped. It now simply accepts an optionalsubset_cols
argument. The argumenttidy
was also dropped.One can simply provide an optionalpattern_type
andpattern
. -
Similar changes were made for
recode_na_as
as above. -
The argument
x
was changed todf
indrop_na_at
.
Available functions
-
get_na_counts
-
percent_missing
-
recode_as_na
-
sort_by_missingness
-
recode_na_as
-
drop_na_if
-
recode_na_if
-
drop_na_at
-
recode_as_na_for
Key changes
-
Now supports tidy selection and exploration at specific columns
-
percent_missing
supports grouping and exclusion of certain columns. The use of decimals was dropped. -
drop_na_if
allows exclusion of columns.