issues
search
extractus
/
article-extractor
To extract main article from given URL with Node.js
https://extractor-demos.pages.dev/article-extractor
MIT License
1.6k
stars
140
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
v8.0.16
#413
ndaidong
closed
2 weeks ago
0
TypeError: ldJson["@type"]?.toLowerCase is not a function
#412
mirsella
opened
3 weeks ago
0
v8.0.15
#411
ndaidong
closed
1 month ago
0
fix: adjustment of poorly formatted ldjson error
#410
andremacola
closed
1 month ago
0
feat: Try to grab the date from the URL before going to the secondary…
#409
andremacola
closed
1 month ago
2
v8.0.14
#408
ndaidong
closed
1 month ago
1
update the response result format
#407
mirsella
closed
1 month ago
9
v8.0.13
#406
ndaidong
closed
1 month ago
0
Improvements to find dates
#405
andremacola
closed
1 month ago
2
v8.0.12
#404
ndaidong
closed
1 month ago
0
Fix: Fatal error on empty @type string in ld+json verification.
#403
andremacola
closed
1 month ago
2
v8.0.11
#402
ndaidong
closed
1 month ago
0
v8.0.11
#401
ndaidong
closed
1 month ago
0
chore: Improvements in handling LD+JSON data
#400
andremacola
closed
1 month ago
2
Could not extract data on a page with text content longer than threshold
#399
JohnCido
closed
1 month ago
4
Images missing when 'img' tag under a 'picture' tag has no 'src' attribute
#398
WetHat
opened
2 months ago
0
Site with elementor theme/plugin doesn't work
#397
andremacola
closed
2 months ago
3
`extractFromHtml` missed an `<h1>` in the `content` json result.
#396
bryantwilliam
closed
2 weeks ago
1
extractFromHtml just returns `null`.
#395
bryantwilliam
closed
4 months ago
6
Not getting any data when extracting from https://zenn.dev/ while if i use Mozilla reader mode, i am getting data.
#394
priyankaagrawal29
opened
4 months ago
0
build: Use rollup to create standalone file
#393
RiverTwilight
opened
5 months ago
1
Is it possible to enable the Require() Import ?
#392
Philrobots
closed
1 month ago
5
v8.0.10
#391
ndaidong
closed
6 months ago
1
v8.0.9
#390
ndaidong
closed
6 months ago
1
Using Playwright or Pupperteer do not work for me with extractFromHtml()
#389
onigetoc
closed
6 months ago
1
img tag with "data" protocol was removed when doing purify
#388
djyde
closed
6 months ago
3
v8.0.8
#387
ndaidong
closed
7 months ago
1
Encoding windows-1250 not properly decoded!
#386
martinrotter
closed
7 months ago
4
v8.0.7
#385
ndaidong
closed
8 months ago
1
feat: integrate esbuild and support cjs
#384
XinwenCheng
closed
8 months ago
1
@extractus/article-extractor 8.0.6 isn't compatible with Google Cloud Functions
#383
XinwenCheng
closed
8 months ago
5
Bump sanitize-html from 2.11.0 to 2.12.1
#382
dependabot[bot]
closed
8 months ago
4
v8.0.6
#381
ndaidong
closed
9 months ago
1
Crashes on Pinterest and a lot of other websites
#380
koresar
closed
6 months ago
16
v8.0.5
#379
ndaidong
closed
10 months ago
1
Expected ',' or '}' after property value in JSON at position 543 (line 23 column 7)
#378
mirsella
closed
8 months ago
4
Update function to extract image, refined format
#377
jonaskahn
closed
10 months ago
1
Encountering errors while using library inside NodeJS + TS project
#376
ivkoandrv
closed
10 months ago
4
v8.0.8
#375
ndaidong
closed
11 months ago
1
Feat: extract pagetype from og:type or ld+json
#374
andremacola
closed
11 months ago
1
Feat: extract pagetype from og:type or ld+json
#373
andremacola
closed
11 months ago
3
Specific site work with deno but not node
#372
mirsella
opened
1 year ago
7
Can i use with utf 8 ?
#371
triay0
closed
1 year ago
1
v8.0.3
#370
ndaidong
closed
1 year ago
0
Fix ParserOptions typing
#369
ranmocy
closed
1 year ago
1
Node example works but deno don't on a specific site
#368
mirsella
closed
1 year ago
2
Incorrect resolution when there are multiple Open Graph tags
#367
SeriousBug
opened
1 year ago
3
Error [ERR_REQUIRE_ESM]: require() of ES Module >=8.0.2
#366
BertrandBev
closed
1 year ago
3
Some url do not work
#365
onigetoc
closed
1 year ago
2
Can't run using JEST
#364
avifatal
closed
1 year ago
3
Next