-
### DO NOT REMOVE OR SKIP THE ISSUE TEMPLATE
- [X] I understand that I will be **blocked** if I *intentionally* remove or skip any mandatory\* field
### Checklist
- [X] I'm reporting that yt-dlp is…
-
有大神帮忙的吗?连续好几天了无法下载油管里的视频
![Uploading 1721614309131.png…]()
-
Regression?
I have a HTML file that contains a link like:
`Words`
I'm extracting with code that looks like this:
```
link_extractor = LinkExtractor(
restrict_xpaths=xpath)
tmp_links =…
-
```
raw_html = requests.get('https://vk.com/neurosciencenews').text
results = Extractor().extract(raw_html)
```
It does not return almost anything. Why it can be? It works great with other sites…
-
### Release version
v4.0.4
### Describe the bug
The extractor pipeline doesn't seem to export data for any authorization servers. Despite https://azure.github.io/apiops/apiops/3-apimTools/apiops-2…
-
> I know it doesn't work on heart attack mode (because there were just too many sounds to edit (would take too long using the currently available tools and limited automation available).
Despite th…
-
I need to extract article bodies from raw htmls. My code is as simple as:
```
for html in htmls:
extractor = Extractor(extractor='ArticleExtractor', html=article)
extractor.getHTML()
```
Aft…
-
Hey all!
I'd like to work on adding a new supported site. However, it's unclear to someone with my skill level how to do that.
I can write a web crawler, so am comfortable with using requests a…
-
Invalid: [pixiv] https://www.pixiv.net/users/7314141
version: 3.8a (23-01-06 05:25:55 UTC)
platform / locale: Windows-10-10.0.19041-SP0 / ko_kr
order / group / uid: -3341 / False / 58bbfe2c186d40…
-
I am trying to goose to read from .html files(specified url here for sake convenience in examples)[1]. But at times it's doesn't show any text. Please help me out here with the issue.
Goose version u…