apache / seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
https://seatunnel.apache.org/
Apache License 2.0
7.94k stars 1.79k forks source link

Document optimization for LOCALFILE connector #7887

Open YOMO-Lee opened 2 days ago

YOMO-Lee commented 2 days ago

Search before asking

What happened

When I use the localfile, I need to filter some files, but the current documentation does not provide detailed instructions on how to configure the file_filter_mattern property of the connector, which has caused me to continuously fail to filter files

SeaTunnel Version

V2.3.7

SeaTunnel Config

env {
  parallelism = 1
  job.mode = "BATCH"
}

source {
  LocalFile {
    path = "/seatunnel/read/"
    file_format_type = "csv"
    skip_header_row_number = 1
    // file example abcD2024.csv
    file_filter_pattern = "abc*"
  }
}
sink {
  Console {
  }
}

Running Command

./bin/seatunnel.sh --config job/test.config -e local

Error Exception

Unable to filter out files

Zeta or Flink or Spark Version

Zeta

Java or Scala Version

No response

Screenshots

No response

Are you willing to submit PR?

Code of Conduct

YOMO-Lee commented 2 days ago

assign to me