Hearen / AllInJava

Everything about Java I know and should know.
GNU General Public License v3.0
0 stars 0 forks source link

What is TD-Agent #143

Open Hearen opened 5 years ago

Hearen commented 5 years ago

Fluentd is a cross platform open-source data collection software project originally developed at Treasure Data. It is written primarily in the Ruby programming language.

Fluentd v1.0 is available on Linux, Mac OSX and Windows. Also, Treasure Data packages it as Treasure Agent (td-agent) for RedHat/CentOS and Ubuntu/Debian and Windows

Fluentd is a Big Data tool for semi- or un-structured data sets. Like Apache Kafka, it analyzes event logs, application logs, and clickstreams. According to Suonsyrjä and Mikkonen, the "core idea of Fluentd is to be the unifying layer between different types of log inputs and outputs.", Fluentd is available on Linux, Mac OSX, and Windows.

Fluentd was one of the data collection tools recommended by Amazon Web Services in 2013, when it was said to be similar to Apache Flume or Scribe. Google Cloud Platform's BigQuery recommends Fluentd as default real-time data-ingestion tool, and uses Google's customized version of Fluentd, called google-fluentd, as a default logging agent.

References

Hearen commented 5 years ago

What

Fluentd is an open source data collector, which lets you unify the data collection and consumption for a better use and understanding of data.

image

Unified Logging Layer

Fluentd decouples data sources from backend systems by providing a unified logging layer in between.

This layer allows developers and data analysts to utilize many types of logs as they are generated. Just as importantly, it mitigates the risk of "bad data" slowing down and misinforming your organization.

A unified logging layer lets you and your organization make better use of data and iterate more quickly on your software.

Features

In one word, td-agent is a stable distribution package of Fluentd.

References

Hearen commented 5 years ago

Catch Up

monitor each stage first and then improve its reliability as the ideal goal

Which part for now shall we start our work? (I know there are different types of configurations: td-agent, prometheus and environments without them);

for now we start from fluentd only (the log of td-agent itself to analyze the status of fluentd);

Options

Questions

Monitoring

Usages

vim /etc/td-agent/config/forwarder/td-agent-forwarded.conf 
sudo less /var/log/td-agent/td-agent.log.1

# adding DAEMON_ARGS=-vv to /etc/default/td-agent to enable verbose logging

References

Hearen commented 5 years ago

Integrated

Java

http://central.maven.org/maven2/com/treasuredata/

<dependencies>
  ...
  <dependency>
    <groupId>com.treasuredata</groupId>
    <artifactId>td-logger</artifactId>
    <version>${logger.version}</version>
  </dependency>
  ...
</dependencies>

References