Open Planeshifter opened 1 year ago
Hi @Planeshifter,
Based on my understanding of this RFC, I propose the following approach:
Develop a new package named @stdlib/datasets/random. This package will contain various functions or utilities to generate mock data for different types specified in the issue. Each utility will have an options parameter allowing customization. For example, for generating random emails, options might include:
datasetSize
: Specifies the size of the dataset.customDomains
: Allows selection from randomly chosen domains.usernameLengthRange
: Defines the range for the length of the username.includeSpecialChars
: Determines whether special characters are included.uniqueEmails
: Ensures that all generated email addresses are unique within the dataset.customProviderList
: Provides a custom list of email providers for random selection.
Each one of these options will have default value if not specified
Does this align with your expectations?@AhmedKhaled590 Thanks for volunteering to work on this. I think this is blocked at the moment, as we need to do a bit of R&D to determine the right API. This feature request is not as straightforward, as we need to think about PRNG seeding, etc, and PRNG APIs are often quite involved. I suggest trying to find a different issue to work on for the time being.
Description
This RFC proposes to add a
@stdlib/datasets/random
namespace with packages for generating mock data.All these utilities would likely support an
options
object parameter for control of what to return and sample from underlying datasets, which is why they should reside in@stdlib/datasets
.Related Issues
None.
Questions
No.
Other
No.
Checklist
RFC:
.