attardi / wikiextractor

A tool for extracting plain text from Wikipedia dumps
GNU Affero General Public License v3.0
3.74k stars 965 forks source link

OSError: [WinError 87] 参数错误。 #262

Open fengyunzaidushi opened 3 years ago

fengyunzaidushi commented 3 years ago

what's this problem? thank you! environment:win10 python3.6.2

E:\yuliao_xunlian\wikiextractor2\wikiextractor>python wikiextractor/Wikiextractor.py -o extracted_xml --json -c zhwiki-20210301-pages-articles-multistream1.xml-p1p187712.bz2 INFO: Preprocessing 'zhwiki-20210301-pages-articles-multistream1.xml-p1p187712.bz2' to collect template definitions: this may take some time. INFO: Loaded 2349 templates in 28.7s INFO: Starting page extraction from zhwiki-20210301-pages-articles-multistream1.xml-p1p187712.bz2. Traceback (most recent call last): File "wikiextractor/Wikiextractor.py", line 641, in main() File "wikiextractor/Wikiextractor.py", line 637, in main args.compress, args.processes, args.html_safe) File "wikiextractor/Wikiextractor.py", line 364, in process_dump reduce.start() File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\process.py", line 105, in start self._popen = self._Popen(self) File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\context.py", line 223, in _Popen return _default_context.get_context().Process._Popen(process_obj) File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\context.py", line 322, in _Popen return Popen(process_obj) File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\popen_spawn_win32.py", line 65, in init reduction.dump(process_obj, to_child) File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\reduction.py", line 60, in dump ForkingPickler(file, protocol).dump(obj) TypeError: can't pickle _thread.RLock objects

E:\yuliao_xunlian\wikiextractor2\wikiextractor>Traceback (most recent call last): File "", line 1, in File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\spawn.py", line 99, in spawn_main new_handle = reduction.steal_handle(parent_pid, pipe_handle) File "C:\Users\Administrator\AppData\Local\Programs\Python\Python36\lib\multiprocessing\reduction.py", line 82, in steal_handle _winapi.PROCESS_DUP_HANDLE, False, source_pid) OSError: [WinError 87] 参数错误。