html-extractor Search Results

1000+ results
for html-extractor

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

linagora/james-project #5250

JAMES-4061 Html Text extractor needs to handle blockquote

CF https://issues.apache.org/jira/browse/JAMES-4061 - [ ] Handle `blockquote` into JsoupHtmlTextExtractor - [ ] Handle `a` into JsoupHtmlTextExtractor

chibenwa updated 2 weeks ago
1
tremor-rs/tremor-runtime #2072

Html extractor

**Describe the problem you are trying to solve** Exctract data from an html page. Lots of older sites with valuabke data dont have an api. Extracting html with a regex is possible but very inconveni…

happysalada updated 1 year ago
3
Sotera/DatawakeDepot #152

HTML/Text Extractor

We should have a simple extractor that pulls the HTML and extracted body text of a document.

bwhiteman updated 8 years ago
5
algolia/gatsby-plugin-algolia #137

Feature: Add the HTML extractor

Need to add the HTML extractor feature for creating separate records for each `p`, `li`, `td` and code tag. Could be customized through the `nodes_to_index` option.

naydav updated 3 years ago
4
ytdl-org/youtube-dl #26429

Generic Extractor downloads HTML file

- [x] I'm reporting a broken site support issue - [x] I've verified that I'm running youtube-dl version **2020.07.28** - [x] I've checked that all provided URLs are alive and playable in a browser …

RingoTheDog updated 4 years ago
1
KurtBestor/Hitomi-Downloader #7376

yesterday it was still working but today its tripping (nhent…

Invalid: [nhentai] https://nhentai.net/g/464415/ version: 4.1 (24-02-28 04:49:54 UTC) platform / locale: Windows-10-10.0.22621-SP0 / en_us order / group / uid: 0 / False / 34bf2ae21e6c42a59504531…

jaceelon updated 1 month ago
1
CivicDataLab/samantar_parsers #3

bulk extractor of pdfs to htmls

akki2825 updated 4 years ago
1
rubenv/angular-gettext-tools #78

Extractor issue: Double quotes in html attribute

Hi! If we'll try to process such html with `angular-gettext-tools`: ``` html ``` It will produce duplicate keys: - `"BlahBlahBlah"` - `"{{'BlahBlahBlah"` As you see, in html attribute we have si…

HarryBurns updated 9 years ago
4
yt-dlp/yt-dlp #10833

[RumbleChannel] rumble channel URLs broken as of today

### DO NOT REMOVE OR SKIP THE ISSUE TEMPLATE - [X] I understand that I will be **blocked** if I *intentionally* remove or skip any mandatory\* field ### Checklist - [X] I'm reporting that yt-…

xaeiougit updated 1 week ago
3
adbar/trafilatura #688

Javascript Version has landed. 🚀

I have translated all 21 files into javascript with npm libs like ``` "axios": "^0.21.1", "chardet": "^1.3.0", "cheerio": "^1.0.0-rc.10", "commander": "^8.0.0", "html-esca…

vtempest updated 1 week ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for html-extractor

1000+ results
for html-extractor