ZuInnoTe / hadoopoffice

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Apache License 2.0
63 stars 31 forks source link

MSExcel: Color filtering of data. #56

Open jornfranke opened 5 years ago

jornfranke commented 5 years ago

Business that collect a lot of data in different structures (forms etc.) in Excel use often colors to indicate which cells represents columns and which cells data. This enhancement proposes to extract the data frame column names and the data itself based on the cell background color in Excel. It could be configured as follows (Numbers are ARGB codes):

Related to: https://github.com/ZuInnoTe/hadoopoffice/issues/55

jornfranke commented 4 years ago

target version hadoop office 1.4.0