-
# Notary Application
To apply as a notary, please fill out the following form.
## Core Information
- Name:David
- Affiliated Organization: Diancun Tech
- Website / Social Media:https://dianc…
-
仅 掘金 发布成功
**头条(已登录状态)的cookie获取不到**
``` shell
app_1 | [2021-01-14T04:50:14.004] [INFO] default - Publish task started
app_1 | [Function: JuejinSpider]
app_1 | [PCR] Chromium revision:…
-
# Notary Application
To apply as a notary, please fill out the following form.
## Core Information
- Name: Luo Lei
- Affiliated Organization: University of Electronic Science and Technology…
-
17-08-22 19:15:02,914 INFO us.codecraft.webmagic.downloader.HttpClientDownloader(HttpClientDownloader.java:88) ## downloading page success https://www.zhihu.com/question/33452102
17-08-22 19:15:02,9…
-
对于http://www.zhihu.com/question/22434291
这个页面中的答案作者名字,使用这个spider去爬的时候发现无法抓取到作者名字。甚至作者名字前面那个里面的href也是空的。
-
-
## 目的:
从知乎的一个问题开始爬取其中所有的回答.我选了[贫穷会对人的身心造成多大的影响?](https://www.zhihu.com/question/30603447)问题.
## 实现:
> ```
> public static void main(String[] args)
> {
> //test是实现了PageProcessor的类
> …
-
-
您好是这样的,打扰您工作十分抱歉。
我在processor中定制了一个抽取逻辑如下,
将爬取的数据存到了userDetailInfo对象,但不知道为什么报空指针异常。明明控制台打印的确实有值
代码清单:
UserBaseInfoProcessor.java
```
package com.complone.zhihumagic.processor;
import com.co…
-
https://telegra.ph/%E6%89%93%E5%B7%A5%E4%BA%BA%E9%80%9F%E9%80%9F%E9%9B%86%E7%BB%93%E4%B8%80%E8%B5%B7%E6%8A%95%E5%87%BA2020%E5%B9%B4%E5%BA%A6%E5%8D%81%E5%A4%A7%E9%BB%91%E5%BF%83%E4%BC%81%E4%B8%9A-12-22