ageitgey / node-unfluff

Automatically extract body content (and other cool stuff) from an html document
Apache License 2.0
2.15k stars 221 forks source link

Extract author #48

Closed PetrKaleta closed 8 years ago

PetrKaleta commented 8 years ago

Find first valid value (trimmed length in interval 0..100) of:

  1. meta[name="author"]
  2. [rel="author"]
  3. [class="author"], [class="writer"], [class="writtenby"]
  4. [id="author"], [id="writer"], [id="writtenby"]
snellingio commented 8 years ago

Looks like we're getting support in #49

ageitgey commented 8 years ago

Check out v1.0.0 and see if it meets your needs! :)