AI-SDC / ACRO

Tools for the Automatic Checking of Research Outputs. These are the tools for researchers to use as drop-in replacements for commands that produce outputs in Stata Python and R
MIT License
15 stars 2 forks source link

Agree aggregation functions to implement #13

Closed jim-smith closed 1 year ago

jim-smith commented 2 years ago

spss list: https://www.ibm.com/support/pages/how-do-i-aggregate-spss

Aggregate functions include:

jim-smith commented 2 years ago

TODO find out what stata, spas and sas do then consult

jim-smith commented 2 years ago

this is what state provides

mean semean sebinomial sepoisson variance sd skewness kurtosis cv svycv count median p# q1 q2 q3 iqr min max range first last firstnm lastnm total rawtotal fvfrequency fvrawfrequency fvproportion fvrawproportion fvpercent fvrawpercent

Definitions mean error of the mean error of the mean, binomial error of the mean, Poisson standard standard standard variance standard skewness kurtosis coefficient of variation coefficient of variation (svy) number of nonmissing values median

th percentile

first quartile second quartile third quartile interquartile range minimum value maximum value range first value last value first nonmissing value last nonmissing value total unweighted total frequency of each factor-variable level unweighted frequency of each factor-variable level proportion within each factor-variable level unweighted proportion within each factor-variable level percentage within each factor-variable level unweighted percentage within each factor-variable level

jim-smith commented 1 year ago

We have basics agreed and implemented. Closing, in favour of adding separate issues for each additional aggregation feature as identified by workpackage1 ( or doing them in batches)