| Title: | Comprehensive Data Summarization for Statistical Analysis |
| Version: | 0.1.1 |
| Author: | Immad Ahmad Shah [aut], Uzair Javid Khan [aut, cre], Sukhdev Mishra [aut] |
| Maintainer: | Uzair Javid Khan <uzairkhan11w@gmail.com> |
| Description: | Summarizes data frames by calculating various statistics including central tendency, dispersion, shape, and normality diagnostics. Handles numeric, character, and factor columns with NA-aware computations. |
| License: | GPL-3 |
| Encoding: | UTF-8 |
| RoxygenNote: | 7.3.3 |
| Imports: | moments, dplyr, nortest, stats |
| Suggests: | testthat (≥ 3.0.0) |
| Config/testthat/edition: | 3 |
| URL: | https://github.com/Uzairkhan11w/DataSum |
| BugReports: | https://github.com/Uzairkhan11w/DataSum/issues |
| NeedsCompilation: | no |
| Packaged: | 2025-11-12 16:58:31 UTC; Uzair |
| Repository: | CRAN |
| Date/Publication: | 2025-11-12 18:00:21 UTC |
Summarize an Entire Data Frame
Description
Applies Datum() to each column of a data frame and binds the results.
Usage
DataSumm(data)
Arguments
data |
A data frame (tibble is also ok). |
Value
A data frame, one row per input column.
Examples
DataSumm(iris)
Summarize a Single Vector
Description
Summarizes a single vector by calculating a consistent set of statistics. Numeric vectors include mean/median/variance/sd/min/max/range/skewness/kurtosis and a normality decision. Character/factor vectors report the mode only.
Usage
Datum(data)
Arguments
data |
A numeric, character, factor, or other vector. |
Value
A one-row data.frame with summary statistics.
Examples
Datum(rnorm(100))
Datum(factor(sample(letters[1:3], 20, TRUE)))
Get Mode of a Vector (numeric/character/factor)
Description
Returns the mode(s) of a vector. By default, returns a single string with
all modes collapsed by comma when there are ties. If you need only one mode,
set collapse = FALSE to return the first mode deterministically.
Usage
getmode(x, collapse = TRUE)
Arguments
x |
A vector (numeric, character, factor, etc.). |
collapse |
Logical; if TRUE (default), return all modes as a single comma-separated string. If FALSE, return the first mode only. |
Value
A single value (first mode) or a comma-separated string of modes.
Examples
getmode(c(1, 2, 2, 3, 4))
getmode(c("a", "b", "b", "a"), collapse = TRUE)
getmode(c("a", "b", "b", "a"), collapse = FALSE)
Shapiro/Anderson-Darling Normality Decision
Description
Performs Shapiro-Wilk for sample sizes between 3 and 5000 (inclusive), otherwise uses Anderson–Darling. Returns "Normal" if p > 0.05, else "Not Normal".
Usage
shapiro_normality_test(data)
Arguments
data |
A numeric vector. |
Value
Character scalar: "Normal", "Not Normal", or NA if not applicable.
Examples
shapiro_normality_test(rnorm(100))