opensemanticsearch / open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
https://opensemanticsearch.org/etl
GNU General Public License v3.0
254 stars 69 forks source link

Twitter scraper #101

Closed opensemanticsearch closed 4 years ago

opensemanticsearch commented 4 years ago

Open Semantic ETL module for import of Tweets by Twint https://github.com/twintproject/twint

Mandalka commented 4 years ago

Implemented optional indexing of linked web pages in tweets https://github.com/opensemanticsearch/open-semantic-search/issues/238

Mandalka commented 4 years ago

Implemented by https://github.com/opensemanticsearch/open-semantic-etl/blob/master/src/opensemanticetl/etl_twitter_scraper.py