Open cxcxxin opened 8 years ago
did you use the index_temporary when you run the program to name the output or other index in the dict?. Also how many pages did you set? Only page 1 right?
No I did not use index_temporary, rather I use search key word. Yes only page 1
2016年3月1日星期二,cxcxxin notifications@github.com 写道:
did you use the index_temporary when you run the program to name the output or other index in the dict?. Also how many pages did you set? Only page 1 right?
— Reply to this email directly or view it on GitHub https://github.com/cxcxxin/urap_tech_sp16/issues/18#issuecomment-190805066 .
once you finish please save to Dropbox\urap_programming\all_data\srp and name it srp_0229_page1
I think we send a code to process the raw data result, because all data are in one string
@suyanglu
please scrape first five pages of search results (should equal tohow many zemin got last time, 5?) for all ind_impt != 1 those unimportant dict items
https://docs.google.com/spreadsheets/d/1VGPZGA8wPQo_y3Pbvrmd7IbtAFPQW9-P2a2LLy2lZyU/edit#gid=0
Please save to all_data srp folder w same naming rule as before (also append _unimptonly this time at the end) andlet @yiminfu know
Done
dictionary is in the group_heads folder. please search for it in bdrive