news-websites Search Results

TACC/Core-CMS #876

How to Load Blog Articles Across Sites

# Goal Show TACC news articles [list] on LCCF. [Include pagination.] — [TUP-706](https://tacc-main.atlassian.net/browse/TUP-706). # Why - Show news, that is directly relevant to 2 sites, on b…

wesleyboar updated 2 weeks ago

ArchiveTeam/NewsGrabber #72

Australian news websites

There currently appears to be no coverage of Australian news websites. I really lack the time to make a PR to add these, but I've created a list in case there is interest in adding them. Most import…

rwoodpecker updated 8 years ago

quantuminformation/web-gen-bot #2

Node scrapper for news websites

this data will not be persisted, just an experiment for future use and feasibility

quantuminformation updated 5 years ago

ytdl-org/youtube-dl #15909

``` youtube-dl --verbose --no-mtime --restrict-filenames --no-part --verbose --age-limit 50 --get-filename -f 'worst[height>=360][ext*=mp4]/worst/bestvideo[height>=360][ext=mp4]' -o '%(title)s_fmt_%(…

sant527 updated 4 years ago

StevenBlack/hosts #2704

Add opindia.com to fake news list

The website has been demonstrating as largest fake news and propagandas against opposition and muslim in India As such, it needs to be added to the list. Source: https://en.wikipedia.org/wiki/OpInd…

iftakharGit updated 1 month ago

Cloudkibo/KiboPush #8024

News Aggregation work from source websites

This is a recurring task as here we will daily do the aggregation of news articles from source websites given in document. We should try to get at least daily 6 articles. https://docs.google.com/do…

sojharo updated 3 years ago

adbar/trafilatura #584

Removing related links at end of article/sidebar on news web…

Over here in the Media Cloud project we're seeing poor performance on the content extraction task for a variety of pages that include links to other "related" stories at the end of article content. Ou…

rahulbot updated 2 months ago

OpenPecha/tibetan-news-article-scraping #1

MT0026: Tibetan news article scraping

# Objective Develop scripts to efficiently scrape Tibetan news articles from multiple sources, starting with the Voice of Tibet (VOT) website, and store them in a structured format for training a mach…

tenzinchoedon updated 4 weeks ago

net4people/bbs #107

Blocking of news websites in Russia

Hi, it seems that the authorities in Russia have blocked BBC and German station DW a couple of hours ago. We are witnessing a drop in traffic. I had a cron job running a simple curl of bbc.com usin…

abdallahalsalmi updated 2 years ago

OpenPecha/tibetan-news-article-scraping #4

PMA0009: Scraping Tibetan creative writing websites(MM24)

### Description: We have several websites containing Tibetan literature data that need to be scraped to gather as much valuable information as possible for training our LLM. The task involves not only…

uchihatashi updated 4 days ago

1000+ results
for news-websites