issues
search
extractus
/
article-extractor
To extract main article from given URL with Node.js
https://extractor-demos.pages.dev/article-extractor
MIT License
1.45k
stars
132
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Not getting any data when extracting from https://zenn.dev/ while if i use Mozilla reader mode, i am getting data.
#394
priyankaagrawal29
opened
5 hours ago
0
build: Use rollup to create standalone file
#393
RiverTwilight
opened
6 days ago
1
Is it possible to enable the Require() Import ?
#392
Philrobots
opened
1 month ago
4
v8.0.10
#391
ndaidong
closed
1 month ago
1
v8.0.9
#390
ndaidong
closed
1 month ago
1
Using Playwright or Pupperteer do not work for me with extractFromHtml()
#389
onigetoc
closed
1 month ago
1
img tag with "data" protocol was removed when doing purify
#388
djyde
closed
1 month ago
3
v8.0.8
#387
ndaidong
closed
2 months ago
1
Encoding windows-1250 not properly decoded!
#386
martinrotter
closed
2 months ago
4
v8.0.7
#385
ndaidong
closed
3 months ago
1
feat: integrate esbuild and support cjs
#384
XinwenCheng
closed
3 months ago
1
@extractus/article-extractor 8.0.6 isn't compatible with Google Cloud Functions
#383
XinwenCheng
closed
3 months ago
5
Bump sanitize-html from 2.11.0 to 2.12.1
#382
dependabot[bot]
closed
3 months ago
4
v8.0.6
#381
ndaidong
closed
4 months ago
1
Crashes on Pinterest and a lot of other websites
#380
koresar
closed
1 month ago
16
v8.0.5
#379
ndaidong
closed
5 months ago
1
Expected ',' or '}' after property value in JSON at position 543 (line 23 column 7)
#378
mirsella
closed
3 months ago
4
Update function to extract image, refined format
#377
jonaskahn
closed
5 months ago
1
Encountering errors while using library inside NodeJS + TS project
#376
ivkoandrv
closed
5 months ago
4
v8.0.8
#375
ndaidong
closed
6 months ago
1
Feat: extract pagetype from og:type or ld+json
#374
andremacola
closed
6 months ago
1
Feat: extract pagetype from og:type or ld+json
#373
andremacola
closed
6 months ago
3
Specific site work with deno but not node
#372
mirsella
opened
8 months ago
7
Can i use with utf 8 ?
#371
triay0
closed
8 months ago
1
v8.0.3
#370
ndaidong
closed
9 months ago
0
Fix ParserOptions typing
#369
ranmocy
closed
9 months ago
1
Node example works but deno don't on a specific site
#368
mirsella
closed
8 months ago
2
Incorrect resolution when there are multiple Open Graph tags
#367
SeriousBug
opened
9 months ago
3
Error [ERR_REQUIRE_ESM]: require() of ES Module >=8.0.2
#366
BertrandBev
closed
9 months ago
3
Some url do not work
#365
onigetoc
closed
9 months ago
2
Can't run using JEST
#364
avifatal
closed
10 months ago
3
Can't run the lib with J
#363
avifatal
closed
10 months ago
0
How to set the rule of extracting picture when the default extraction algorithm can't get it?
#362
MJRT
closed
10 months ago
1
v8.0.2
#361
ndaidong
closed
10 months ago
0
Error [ERR_REQUIRE_ESM]: require() of ES Module >=7.3.0
#360
castroCrea
closed
11 months ago
2
Crashing on start with npm run dev
#359
alazsengul
closed
11 months ago
6
v8.0.1
#358
ndaidong
closed
11 months ago
1
Got an error when extract vitalik's blog.
#357
daimajia
closed
9 months ago
9
Update README
#356
ndaidong
closed
11 months ago
1
v8.0.0 - Bump version
#355
ndaidong
closed
11 months ago
2
v7.3.0
#354
ndaidong
closed
11 months ago
1
v7.2.18
#353
ndaidong
closed
12 months ago
2
v7.2.17
#352
ndaidong
closed
1 year ago
2
v7.2.17
#351
ndaidong
closed
1 year ago
1
Add favicon to meta data
#350
LarchLiu
closed
1 year ago
5
A date like "<pubDate>Wed, 31 May 2023 13:33:19 +0000</pubDate>" in atom file will return ""
#349
Darmau
closed
1 year ago
0
v7.2.16
#348
ndaidong
closed
1 year ago
1
Preserve multiline spaces for code blocks
#347
victory-sokolov
closed
1 year ago
5
Preserve multiline spaces for code blocks
#346
victory-sokolov
closed
1 year ago
0
Deno
#345
Roosteridk
closed
1 month ago
2
Next