Open GoogleCodeExporter opened 9 years ago
Column addition could perhaps be addressed using a filter that adds columns to
specific locations in the table. It is also possible to extend the batch
loader but I believe this type of information is valuable any time users are
aggregating data, and filters will work anywhere.
Adding source names to files could be addressed using the existing partition by
support (see Issue 840) and partitioning by source. In this case the names
would actually be directories.
One final consideration--if we add this information dynamically it should
somehow flow back to schema generation. There is an argument for putting
schema information into the replication flow and letting filters operate not
only on row changes but metadata as they flow by. The appliers can be extended
to generate Hive schema (for example) at apply time or somewhere along the way.
Original comment by robert.h...@continuent.com
on 9 Apr 2014 at 9:11
Original issue reported on code.google.com by
mc.br...@continuent.com
on 9 Apr 2014 at 4:40