mrpowers-io / spark-daria

Essential Spark extensions and helper methods ✨😲
MIT License
754 stars 152 forks source link

[Feature] Add function generate data conform a known statistical distribution #155

Closed zeotuan closed 2 weeks ago

zeotuan commented 1 month ago

This feature would be similar to either https://github.com/mrpowers-io/quinn/issues/88 or how sparklyr method to generate dataframe with certain distribution E.x https://spark.posit.co/packages/sparklyr/latest/reference/sdf_rexp.html

Random integers generation: