This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
Other
580
stars
63
forks
source link
404 Not Found Error on "extract_wiki_tables.sh" #24
Hi, TaBERT is a quite inspiring work. When I try to download the wiki data using "extract_wiki_tables.sh", there is an error as:
--2021-06-28 21:10:32-- https://dumps.wikimedia.org/enwiki/20200901/enwiki-20200901-pages-articles-multistream1.xml-p1p30303.bz2 Resolving dumps.wikimedia.org (dumps.wikimedia.org)... 208.80.154.7, 2620:0:861:1:208:80:154:7 Connecting to dumps.wikimedia.org (dumps.wikimedia.org)|208.80.154.7|:443... connected. HTTP request sent, awaiting response... 404 Not Found 2021-06-28 21:10:32 ERROR 404: Not Found.
Do you have any ideas about how to fix it? Thank you very much!
Regards,