google-deepmind / alphageometry

Apache License 2.0
4.03k stars 454 forks source link

Could you please share the Training Dataset OR the script for Data Generation? #79

Open giangdip2410 opened 6 months ago

giangdip2410 commented 6 months ago

Hi @thtrieu , thank you for your great work. Could you please share with me the Training Dataset (~100 million unique theorem-proof) OR the script for Data Generation for research purpose only? Thank you so much.

2nazero commented 6 months ago

Hi:) I'm also very interested in this topic and would appreciate it if you could share the Training Dataset or the Data Generation script with me as well. I'm curious if you've received similar requests from others. Thank you in advance!

Xuekai-Zhu commented 6 months ago

+1, if there is any update of synthetic data in this paper, please @me !!!!!! Huge thanks!

Xuekai-Zhu commented 5 months ago

Thank you very much for your generous sharing. I have submitted my application, and my email is xuekaizhu0@gmail.com. I look forward to seeing your consent.!!!

ParthaEth commented 4 months ago

Look here - https://github.com/felixludos/alphageometry/tree/nl_verbalization

We are building alpha geometry community. We already have random problem and proof generation code there.

tpgh24 commented 4 months ago

A while ago someone replied to this issue, posted a link supposedly for part of the training data, I submitted a request but never got it. Now I don't find that post any more, maybe the author deleted it?

To further improve AG, I think we need to collect data based on human designed problems. I made some improvements to AG in a fork repository and have some ideas to improve it, check out AG4Masses and issue 110.