cxcxxin / urap_tech_sp16

issue tracking
1 stars 1 forks source link

scrape search results w php code using dictionary dict_700d_750d_Juexiao_Feb252016 #18

Open cxcxxin opened 8 years ago

cxcxxin commented 8 years ago

dictionary is in the group_heads folder. please search for it in bdrive

cxcxxin commented 8 years ago

did you use the index_temporary when you run the program to name the output or other index in the dict?. Also how many pages did you set? Only page 1 right?

suyanglu commented 8 years ago

No I did not use index_temporary, rather I use search key word. Yes only page 1

2016年3月1日星期二,cxcxxin notifications@github.com 写道:

did you use the index_temporary when you run the program to name the output or other index in the dict?. Also how many pages did you set? Only page 1 right?

— Reply to this email directly or view it on GitHub https://github.com/cxcxxin/urap_tech_sp16/issues/18#issuecomment-190805066 .

cxcxxin commented 8 years ago

once you finish please save to Dropbox\urap_programming\all_data\srp and name it srp_0229_page1

suyanglu commented 8 years ago

I think we send a code to process the raw data result, because all data are in one string

cxcxxin commented 8 years ago

@suyanglu
please scrape first five pages of search results (should equal tohow many zemin got last time, 5?) for all ind_impt != 1 those unimportant dict items https://docs.google.com/spreadsheets/d/1VGPZGA8wPQo_y3Pbvrmd7IbtAFPQW9-P2a2LLy2lZyU/edit#gid=0

Please save to all_data srp folder w same naming rule as before (also append _unimptonly this time at the end) andlet @yiminfu know

suyanglu commented 8 years ago

Done