ISG-ICS / cloudberry

Big Data Visualization
http://cloudberry.ics.uci.edu
91 stars 82 forks source link

Make a Korea version of Twittermap #543

Closed GaeulG closed 5 years ago

GaeulG commented 6 years ago

Overview

Progress

Match format

1. Convert polygon coordinate UTM-K to WGS84

Before we develop, we need the Korea administrative district data file. You can download here : gisdeveloper

The polygons of Cloudberry were written by WGS84, that is longitude and latitude. But datas of Korea were written by UTM-K. This is formula that converts UTM-K to WGS84:

from pyproj import Proj
from pyproj import transform

WGS84 = {'proj':'latlong', 'datum':'WGS84', 'ellps':'WGS84'}

GRS80 = {'proj':'tmerc', 'lat_0':'38', 'lon_0':'127.5', 'k':0.9996, 'x_0':1000000,
         'y_0':2000000, 'ellps':'GRS80', 'units':'m'}

def grs80_to_wgs84(x,y):
    return transform(Proj(**GRS80), Proj(**WGS84), x, y)

2. Match properties of json file

Match the properties of Korean json file for working well on original Cloudberry.

3. Simplify data file

4. Solve linestring closed error

5. Korean crash problem

6. Bring Korea Twitter Data

What we have to do

chenlica commented 5 years ago

@waans11 @baiqiushi status?

waans11 commented 5 years ago

Due to a restriction of AsterixDB regarding the tokenization of languages other than English, we need to fix that issue first.

baiqiushi commented 5 years ago

Low priority.