bellingcat / auto-archiver

Automatically archive links to videos, images, and social media content from Google Sheets (and more).
https://pypi.org/project/auto-archiver/
MIT License
489 stars 53 forks source link

"failed: no archiver" in google sheet although can download the screenshot. #73

Closed choonnam closed 1 year ago

choonnam commented 1 year ago

I attempted to set up the auto-archiver by following this instructional video (https://www.youtube.com/watch?v=VfAhcuV2tLQ).

Initially, the code was running and the archive status in the Google sheet showed "Archive in progress," but at the end, it displayed "failed: no archiver".

The logs indicate that I have successfully scraped some data. I also downloaded some screenshots (YouTube and video), and the YouTube video in webm format. However, I am unsure why the data cannot update in the google Sheet.

Also, is it necessary to utilize the browsertrix-crawler? I have downloaded the Docker desktop, and my machine can run the browsertrix-crawler, but the error persists."

The error messages are as below: ERROR | main:process_sheet:138 - Got unexpected error in row 2 with twitter for url='https://twitter.com/anwaribrahim/status/1642750503422685187?cxt=HHwWhsDTsaK4nMwtAAAA': [Errno 2] No such file or directory: '/Users/usr/Documents/python/archiver/browsertrix/crawls/profile.tar.gz' Traceback (most recent call last): File "/Users/usr/Documents/python/archiver/auto_archive.py", line 133, in process_sheet result = archiver.download(url, check_if_exists=c.check_if_exists) File "/Users/usr/Documents/python/archiver/archivers/twitter_archiver.py", line 42, in download wacz = self.get_wacz(url) File "/Users/usr/Documents/python/archiver/archivers/base_archiver.py", line 234, in get_wacz shutil.copyfile(self.browsertrix.profile, os.path.join(browsertrix_home, "profile.tar.gz")) File "/Library/Developer/CommandLineTools/Library/Frameworks/Python3.framework/Versions/3.9/lib/python3.9/shutil.py", line 264, in copyfile with open(src, 'rb') as fsrc, open(dst, 'wb') as fdst: FileNotFoundError: [Errno 2] No such file or directory: '/Users/usr/Documents/python/archiver/browsertrix/crawls/profile.tar.gz'

2023-04-13 14:47:20.800 | SUCCESS | main:process_sheet:167 - Finished worksheet Sheet1