finos / datahelix

The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
https://finos.github.io/datahelix/
Apache License 2.0
141 stars 50 forks source link

DataHelix provides test and learn support to Gensyn #1684

Closed mcleo-d closed 4 years ago

mcleo-d commented 4 years ago

Description

cc @benfielding from Gensyn.

This issue has been created to help support and guide Gensyn in the Installation, Running and Modelling of test data to inform Gensyn feedback and next steps into the DataHelix project team.

Success Criteria

Definition of Done

Gensyn have enough information to make an informed choice on moving forward with DataHelix and have provided collaboration requirements to the DataHelix project team whilst agreeing next steps.

mcleo-d commented 4 years ago

Also related to https://github.com/finos/datahub/issues/32

tjohnson-scottlogic commented 4 years ago

Sounds good! Here to help, all feedback welcome.

BenFielding commented 4 years ago

Thanks both! Looking forward to diving in.

tjohnson-scottlogic commented 4 years ago

Hi @BenFielding, how's it going with this? Any first impressions?

mcleo-d commented 4 years ago

Hi @BenFielding

Earlier this month we met to discuss FINOS DataHub and DataHelix in relation to Gensyn synthetic data requirements.

Since our last meeting the FINOS synthetic data teams have been working with stakeholders from the open source community to gather requirements so we can shape our product roadmaps accordingly.

The reason for my email is to ask ...

  1. Did you have the opportunity to discuss evaluating DataHub and DataHelix with the rest of the team as a result of our discussion?
  2. Would you be willing to share your synthetic data requirements with the DataHub and DataHelix teams so we can incorporate your requirements into our product roadmaps? 

Finally, but not necessary, would you be willing to join combined DataHub and DataHelix project sessions to help prioritise and groom our backlogs to keep us moving forward?

Thanks for your time and I look forward to hearing from you soon.

@mcleo-d

BenFielding commented 4 years ago

Hi @mcleo-d & @tjohnson-scottlogic,

As with the Datahub project, we've successfully tested Datahelix from our side:

As with Datahub, the latter two points apply slightly differently to Gensyn as we're a technology partner developing a similar technology and contributing to future direction, rather than using Datahelix for a specific generation task. We've tested the library on the supplied test data examples by building docker containers using the supplied Dockerfile - which worked seamlessly.

In terms of feedback, we're looking forward to engaging on an ongoing basis from a project perspective.

mcleo-d commented 4 years ago

@tjohnson-scottlogic and @andrewcarrblue,

Please advise how you'd like to close the issue and move forward with @BenFielding.

Many thanks,

James.

mcleo-d commented 4 years ago

@andrewcarrblue happy to close this issue by funnelling use cases into the following story with @BenFielding and the synthetic data teams on 27th July standup call ...

https://github.com/finos/datahub/issues/30