bibanon / tubeup

Use yt-dlp to download video and upload to the Internet Archive with metadata.
https://pypi.python.org/pypi/tubeup/
GNU General Public License v3.0
410 stars 71 forks source link

Censorship on this project #115

Closed nihelmasell closed 4 years ago

nihelmasell commented 4 years ago

Well, we often make claims about censorship on media and some type of forums (the reason for archiving) and now I see the same actions are performed here. I just asked how can I adapt my 20tb YouTube archive to be uploaded to the IA, in case it is deleted (as it often is on YouTube). First, the issue was closed, and know I cant even take part of it. I know your gonna delete this message, but want to show your contradictions. Goodbye.

antonizoon commented 4 years ago

Understand that uploaders don't pay a cent to upload to the Internet Archive, yet the Internet Archive has to hold the bag on storing 20TB of data forever along with handling all the possible DMCA or other litigation that took it down from YouTube in the first place.

Quite frankly, a practically infinite data source is not going to fit on a finite server system and limited funding.

I feel like this isn't even going to be helpful for preserving the content, given that the Internet Archive is under the exact same legal jurisdiction as YouTube, they may easily be required to remove it by litigation or just by their own housekeeping: I strongly suggest that if you want this data to be preserved, you backup privately and share p2p, or host the data yourself on your own servers in a jurisdiction you trust to not comply with these rules. I would suggest LTO5 tape as a relatively cheap and durable option.

The problem is some channels have 30,000 videos, which overpopulate my folders,

The limitations you are encountering on mirroring 30,000/20TB videos are not set by us, it is set by the Internet Archive which has to pay to hold that much data. Have you ever thought about how much data that is and how much it costs to make that available to the public to access the way they self host it with redundant storage with tape backup? Our script is already under restrictions by the Internet Archive due to excessive and thoughtless upload, we cannot provide any further countermeasures because it is not our right to do so.

nihelmasell commented 4 years ago

1) I never said I wanted to backup ALL 20tb to the archive. 2) I wanted to skip re downloading the material I already have. That’s all. If I wanted to upload 20tb I still could. It would just take more time. 3) People upload lots of stuff to the archive. Eg: webcam feeds 24/365 for hundreds of cities, Blu-ray/ 4K webrips of movies (sometimes reaching 50tb each), Flickr and Instagram backups, complete websites, etc. I think it’s up to them to ban me if they want to.