nteg / Big-Data

Big Data
0 stars 1 forks source link

Apache Flume POC for Yammer Data Acquisition #2

Open amitkumar02 opened 9 years ago

amitkumar02 commented 9 years ago

Design and develop a POC on Apache Flume using Java language. POC should cover -

  1. Exploration of out of the box available agents (source and sinks)
  2. Setting up an agent (with one source, one channel and 2 sinks) for loading yammer data into HDFS
  3. Configuring/developing a source for calling Yammer APIs
  4. Setting up of a sink for pushing the data into HDFS
  5. Setting up of a sink for pushing the data on to Apache Kafka

Once the POC is done, code should be checked into this repository and a POC document will need to be published detailing the above steps.