facebookresearch / TaBERT

This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
Other
580 stars 63 forks source link

404 Not Found Error on "extract_wiki_tables.sh" #24

Open canqin001 opened 3 years ago

canqin001 commented 3 years ago

Hi, TaBERT is a quite inspiring work. When I try to download the wiki data using "extract_wiki_tables.sh", there is an error as:

--2021-06-28 21:10:32-- https://dumps.wikimedia.org/enwiki/20200901/enwiki-20200901-pages-articles-multistream1.xml-p1p30303.bz2 Resolving dumps.wikimedia.org (dumps.wikimedia.org)... 208.80.154.7, 2620:0:861:1:208:80:154:7 Connecting to dumps.wikimedia.org (dumps.wikimedia.org)|208.80.154.7|:443... connected. HTTP request sent, awaiting response... 404 Not Found 2021-06-28 21:10:32 ERROR 404: Not Found.

Do you have any ideas about how to fix it? Thank you very much!

Regards,

canqin001 commented 3 years ago

It seems that the "https://dumps.wikimedia.org/enwiki/20200901" does not exist.