laws-africa / slaw

Slaw is a lightweight library for rendering and generating Akoma Ntoso acts from plain text and PDF documents.
MIT License
27 stars 11 forks source link

test for hansard in latinamerica #5

Closed johnfelipe closed 10 years ago

johnfelipe commented 10 years ago

http://www.concejodemedellin.gov.co/concejo/m21_gallery/23698.pdf, please tell me if possible test your app for create akoma ntoso for sayit app

longhotsummer commented 10 years ago

In principle this would work but not with the current grammar.

  1. You will need to create a new grammar that matches the format of your documents. This depends heavily on the format of those documents, how much it varies from document to document, and what you'd like to get out of it.
  2. You will also need to write code to convert the syntax tree output from the grammar into appropriate Akoma Ntoso. My AN experience is limited to the Acts format so I'm not sure how much work that would be.

Once you have those two things, you could use both slaw and steno to build a webapp to capture and export the hansard into XML.

The grammar is by far the most complex thing, you will need an understanding of grammars and treetop.

johnfelipe commented 10 years ago

Hi, please tell me how i do that, share me some help, plz

longhotsummer commented 10 years ago

Hi John. Take a look at what Code for America did to convert the Ferguson transcripts from PDF into SayIt XML, which sounds a lot like what you're trying to do: https://github.com/ondrae/ferguson-grand-jury-transcripts

It's much simpler than what we're trying to do.