kerel-fs / ogn-rdb

A parser for http://wiki.glidernet.org/list-of-receivers.
GNU Affero General Public License v3.0
5 stars 1 forks source link

Contacts are not detected correctly #28

Open pyrog opened 7 years ago

pyrog commented 7 years ago

Contacts containing an URL to a webpage are not displayed correctly

EDLI

Display:*http://www.segelflugverein-bielefeld.de/ Segelflugverein Bielefeld e.V. Should be: Segelflugverein Bielefeld e.V.

pyrog commented 7 years ago

Contact column could contain one or more email or link

GrEngersd

There are two contacts: Harry and SFC 3 Möven The href is broken and the contact name is not saved

"links": [
{
   "ref": "contact",
   "href": "http://www.3moeven.at/startseite/kontakte/9-harald-binder.html | Harry]]] [[[*http://www.3moeven.at "
}

Schrems

There are two contacts: one email Stefan and one URL LOAG

Only the last contact name is kept and both URL are missing.

"links": [],
"contact": "LOAG",

Others

LOAB, LOLK, LOIH, LOIJ

pyrog commented 7 years ago

Regular expression to parse URL and page link: http://regexr.com/3evcv