brendanheywood / climbing-api

Automatically exported from code.google.com/p/climbing-api
0 stars 0 forks source link

Make tool to process raw text from old guides into Atom #1

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Make a perl script that takes raw text from legacy guides and use 
heuristics to separate and mark up as much data as possible.

eg:

1) Some title    22m 22 **
Start: up the massive nose. Thin crack to ledge.
Joe Blogs, Fred Smith 2/78

+ some config:
Map 1 star to 1, 2 stars to 3 and 3 stars to 5
Default grade is Ewbank and V for boulder.

turns into

<entry>
  <title>Some climb</title>
  <dc:type>Ewbank</dc:type>
  <gd:rating min="1" max="5" rating>
  <dc:format>22m 22</dc:format>
  <summary>Start: up the massive nose. Thin crack to ledge.</summary>
  <creator>Joe Blogs</creator>
  <creator>Fred Smith</creator>
  <dc:date>1978 Feb</dc:date>
</entry>

Original issue reported on code.google.com by brendan....@gmail.com on 6 Mar 2010 at 12:46

GoogleCodeExporter commented 9 years ago
Should also try and create id's - perhaps needs to be connect to a db and use 
heuristics to try and match to possible existing id's. Should add the 'next' 
and 'prev' 
links so we retain correct order in the DB.

Original comment by brendan....@gmail.com on 6 Mar 2010 at 1:03

GoogleCodeExporter commented 9 years ago

Original comment by brendan....@gmail.com on 8 Mar 2010 at 4:22

GoogleCodeExporter commented 9 years ago

Original comment by brendan....@gmail.com on 8 Mar 2010 at 4:24