Closed Benhoro closed 3 years ago
The issue here could be that you don't have enough space on disk to store the results. What available disk space are you working with?
Also, note that there is a known issue with the company scraper right now (#78 ), so if you solve the disk space issue, you will likely run into the other problem.
Closing for lack of response. Feel free to re-open if this is still a problem.
I got the error below when I try to run the following code : **from scrape_linkedin import scrape_in_parallel, CompanyScraper
companies = ['facebook', 'google', 'amazon', 'microsoft']
Scrape all companies, output to 'companies.json' file, use 4 browser instances
scrape_in_parallel( scraper_type=CompanyScraper(cookie='AQEDAQS9deUF3aCgAAABc3XQzd4AAAFzmd1R3lYA0tP3bkRPMfs9CnXLRduXshYHDto8gGFV4BMhzRvRdMiuQ1HVCTQ7isAQmOYX3uUnFh1RxGmUSDWCSLH9VAh03SvukDj6JJh98by1F9PMf6gIHvj5',timeout=100) , items=companies, output_file="companies.json", num_instances=4 )**
_RemoteTraceback Traceback (most recent call last) _RemoteTraceback: """ Traceback (most recent call last): File "C:\Users\HORO BEN\anaconda3\lib\site-packages\joblib\externals\loky\backend\queues.py", line 150, in feed obj = dumps(obj, reducers=reducers) File "C:\Users\HORO BEN\anaconda3\lib\site-packages\joblib\externals\loky\backend\reduction.py", line 247, in dumps dump(obj, buf, reducers=reducers, protocol=protocol) File "C:\Users\HORO BEN\anaconda3\lib\site-packages\joblib\externals\loky\backend\reduction.py", line 240, in dump _LokyPickler(file, reducers=reducers, protocol=protocol).dump(obj) File "C:\Users\HORO BEN\anaconda3\lib\site-packages\joblib\externals\cloudpickle\cloudpickle.py", line 482, in dump return Pickler.dump(self, obj) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 437, in dump self.save(obj) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 890, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 819, in save_list self._batch_appends(obj) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 846, in _batch_appends save(tmp[0]) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 774, in save_tuple save(element) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 549, in save self.save_reduce(obj=obj, *rv) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 662, in save_reduce save(state) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 504, in save f(self, obj) # Call unbound method with explicit self File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 859, in save_dict self._batch_setitems(obj.items()) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 885, in _batch_setitems save(v) File "C:\Users\HORO BEN\anaconda3\lib\pickle.py", line 524, in save rv = reduce(self.proto) TypeError: can't pickle _thread.lock objects """
The above exception was the direct cause of the following exception:
PicklingError Traceback (most recent call last)