Solin1998 / SearchTT

Twitter crawler based on python
MIT License
169 stars 90 forks source link

功能:

用于访问和下载某人的全部推特,以及大范围的推特爬取,如某个地区,某种语言或包含某个关键字的的推特内容 如:爬取推特在2019-5-20到2019-6-20 出现过 “比特币”字段的所有推特信息

Used to access and download all Twitter of a person, as well as a wide range of Twitter crawls, such as a certain region, a certain language, or a Twitter content containing a certain keyword For example: Crawling Twitter appeared on May 20, 2019 to June 20, 2019, all the Twitter information of "Bitcoin" subdivision

介绍:

有研究过推特自带的开发者的API,一般只能抓取到7天以为的推特,而且同一个开发者账号爬虫等会有时间内会有限制,通过改写一个开源项目抓取某个人注册以来发送的的所有推特,原理是通过不停的模拟浏览器滑动获取json文件,当然过程中并没有用到类似于selenium一类的中间件

This is done by rewriting an open source project to grab all the tweets someone has sent since signing up, by swiping through a simulated browser to get the json file, of course Middleware like selenium is not used in the process

安装:

在pycharm开发环境或者linux系统终端下输入以下指令:

因为某些原因该项目已经不提供使用,抱歉

Enter the following command in the pycharm development environment or linux system terminal:

This item is no longer available for some reasons, sorry