glossarist / iev-data

1 stars 1 forks source link

Rework notes and examples extraction #159

Closed skalee closed 3 years ago

skalee commented 3 years ago

Rewrite TermBuilder#split_definition from scratch. This method is responsible for splitting definition string coming from a spreadsheet into distinct definition, examples, and notes fields.

Definition splitting is now done with a straightforward and readable regular expression. It happens before any content conversions are applied, so that these conversions can be done in one place.

There are still a few known odd definitions which aren't split correctly. That will be reported and fixed. But anyway, this pull request is a significant step forward.

Fixes #154.