chankoo / BOAZ-projects

BOAZ Adv
4 stars 4 forks source link

to be added #4

Open chankoo opened 5 years ago

chankoo commented 5 years ago

https://subicura.com/2017/01/19/docker-guide-for-beginners-2.html

chankoo commented 5 years ago

우분투 18.04에 도커설치

https://blog.cosmosfarm.com/archives/248/%EC%9A%B0%EB%B6%84%ED%88%AC-18-04-%EB%8F%84%EC%BB%A4-docker-%EC%84%A4%EC%B9%98-%EB%B0%A9%EB%B2%95/

chankoo commented 5 years ago

2018-10-23 23-18-40 2018-10-23 23-20-00 2018-10-23 23-25-49 2018-10-23 23-27-43

chankoo commented 5 years ago

2018-10-24 00-14-29

chankoo commented 5 years ago

2018-10-24 00-23-48 2018-10-24 00-25-06

chankoo commented 5 years ago

2018-10-31 23-36-46

chankoo commented 5 years ago

2018-10-31 23-59-08

chankoo commented 5 years ago

2018-11-01 00-23-57

chankoo commented 5 years ago

http://blog.naver.com/PostView.nhn?blogId=jjy0501&logNo=221306926499

chankoo commented 5 years ago

소현아 이걸봐!! http://blog.naver.com/PostView.nhn?blogId=jjy0501&logNo=221306926499

chankoo commented 5 years ago

소현이걸봐2

if name=='main':

try_idx 찾기

try_idx = 0
print(os.getcwd()) # './aibnb-data' 임을 확인하자
flist = glob.glob('*_seoul.json') # seoul.json으로 끝나는 파일만 리스트로 가져옴
if flist != []:
    try_idx = len(flist)*100
now_try_idx = try_idx # 반복상태 체크위한 now_try_idx 변수

# 반복시작
listing_ids_loop = listing_ids[try_idx:]
for listing_id in tqdm(listing_ids_loop):
    if (now_try_idx % 100) == 99: # now_try_idx 100개 단위로 끊어 저장
        with open(os.path.join(os.getcwd(),'review_'+str(try_idx)+'_to_'+str(now_try_idx)+'_seoul.json'),'w') as fp:
            try:
                json.dump(review_jsn,fp,ensure_ascii=False) # flush

            except UnicodeEncodeError as e:
                print(now_try_idx, e)
                json.dump(review_jsn,fp,ensure_ascii=True) # retry
            finally:
                review_jsn = {} # 초기화
                try_idx = now_try_idx+1

    retry = 2
    while retry >0:
        try:
            review_count = get_review_count(listing_id)
            get_reviews(listing_id,review_count)
            break
        except Exception as e:
            except_cnt += 1
            print(e.args,e.with_traceback)
            retry -= 1
            if listing_id in review_jsn.keys():
                del review_jsn[listing_id]
            if retry == 0:
                print("lising_id {} not stored".format(listing_id))
                break
            continue
        finally:
            now_try_idx += 1

# 마지막 저장 dic flush
with open(os.path.join(os.getcwd(),'review_'+str(try_idx)+'_to_'+str(now_try_idx)+'_seoul.json'),'w') as fp:
            json.dump(review_jsn,fp,ensure_ascii=False) # flush
            review_jsn = {} # 초기화

print('For this iter');print('# of review:', review_count);print('# of review stored:',len(review_jsn[listing_id]))

chankoo commented 5 years ago

` # try_idx 찾기 try_idx = 0 print(os.getcwd()) # './aibnb-data' 임을 확인하자 flist = glob.glob('_seoul.json') # seoul.json으로 끝나는 파일만 리스트로 가져옴 if flist != []: try_idx = len(flist)100 now_try_idx = try_idx # 반복상태 체크위한 now_try_idx 변수

# 반복시작
listing_ids_loop = listing_ids[try_idx:]
for listing_id in tqdm(listing_ids_loop):
    if (now_try_idx % 100) == 99: # now_try_idx 100개 단위로 끊어 저장
        with open(os.path.join(os.getcwd(),'review_'+str(try_idx)+'_to_'+str(now_try_idx)+'_seoul.json'),'w') as fp:
            try:
                json.dump(review_jsn,fp,ensure_ascii=False) # flush

            except UnicodeEncodeError as e:
                print(now_try_idx, e)
                json.dump(review_jsn,fp,ensure_ascii=True) # retry
            finally:
                review_jsn = {} # 초기화
                try_idx = now_try_idx+1

    retry = 2
    while retry >0:
        try:
            review_count = get_review_count(listing_id)
            get_reviews(listing_id,review_count)
            break
        except Exception as e:
            except_cnt += 1
            print(e.args,e.with_traceback)
            retry -= 1
            if listing_id in review_jsn.keys():
                del review_jsn[listing_id]
            if retry == 0:
                print("lising_id {} not stored".format(listing_id))
                break
            continue
        finally:
            now_try_idx += 1

# 마지막 저장 dic flush
with open(os.path.join(os.getcwd(),'review_'+str(try_idx)+'_to_'+str(now_try_idx)+'_seoul.json'),'w') as fp:
            json.dump(review_jsn,fp,ensure_ascii=False) # flush
            review_jsn = {} # 초기화`