Closed DSarna closed 4 years ago
Interesting idea. I will make a first pass at our (long overdue) text export options. Hopefully I can get this started in a way that anyone else can plugin to for different output formats.
Fantastic.
On Tuesday, April 8, 2014, blockspeiser notifications@github.com wrote:
Interesting idea. I will make a first pass at our (long overdue) text export options. Hopefully I can get this started in a way that anyone else can plugin to for different output formats.
Reply to this email directly or view it on GitHubhttps://github.com/blockspeiser/Sefaria-Project/issues/84#issuecomment-39878264 .
Regards,
David E Y Sarna
Work: (201) 357-2084 Skype:DavidSarna Cell: 2019828580 Fax: (201) 905-5628 Email: David@hshco.com Twitter:@DSarna
Publications and CV at http://independent.academia.edu/DavidSarna
Website: www.hshco.com Amazon Page http://amzn.to/DavidSarna http://amzn.to/ohAoWq
Evernote For Dummieshttp://www.amazon.com/gp/product/1118107381/ref=as_li_qf_sp_asin_tl?ie=UTF8&tag=httpgooglegco-20&link_code=as3&camp=211189&creative=373489&creativeASIN=1118107381 Implementing Cloud Computing Applications http://amzn.to/J22Nlk History of Greedhttp://www.amazon.com/gp/product/0470601809/ref=as_li_qf_sp_asin_tl?ie=UTF8&tag=httpgooglegco-20&link_code=as3&camp=211189&creative=373489&creativeASIN=0470601809 Hendon, Stamford Hill & Co.
625 North Forest Drive Teaneck, NJ 07666
This email contains PRIVILEGED AND CONFIDENTIAL information intended only for the use of the addressee(s) named above. If you are not the intended recipient of this email, or an authorized employee or agent responsible for delivering it to the intended recipient, you are hereby notified that any dissemination or copying of this email is strictly prohibited. If you have received this email in error, please notify us by reply email and delete this email from your records. Thank you.
Disclaimer regarding Uniform Electronic Transactions Act ("UETA") (Florida Statutes Section 668.50): UETA does not apply to this communication; contract formation shall occur only with manually-affixed original signatures on original documents.
I've just made a first pass at some simpler export formats for all the texts in the Sefaria Database. I've started with a JSON format akin to what Sefaria uses internally and one plain text format, which you can now see here: https://github.com/blockspeiser/Sefaria-Data/tree/master/json https://github.com/blockspeiser/Sefaria-Data/tree/master/txt
If anyone is interested in having other formats (such as ENEX), you can write a python function to take a document (which looks just like the JSON above) and returns the text to output. See here: https://github.com/blockspeiser/Sefaria-Project/blob/master/sefaria/export.py#L33
There's plenty more to do here. I've listed some known issues below, but hopefully this will be helpful to some as is.
Known Issues:
Evernote is a popular cloud based repository with very powerful search capabilities. ENEX is a form of XML. Evernote uses ENEX as an import/export tool. Converting a Sefaria database to ENEX and then importing the result as an Evernote notebook would offer many benefits. Evernote notebooks can also be shared and made public. I am looking for partners interested in working with me to make this happen. I would like to see all Sefaria texts converted to ENEX and made into public notebooks, starting with the Bavli. Please respond if you have an interest in this project.