apache / seatunnel

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
https://seatunnel.apache.org/
Apache License 2.0
8.06k stars 1.83k forks source link

[Feature][Connectors] LocalFile Support reading gz #8019

Open zhdech opened 1 week ago

zhdech commented 1 week ago

Search before asking

Description

image In practical applications, some files end in. gz, and currently LocalFile Connectors version 2.3.8 does not support reading gz files 在实际应用中,有些文件是.gz结尾的,目前LocalFile Connectors 2.3.8版本不支持gz文件的读取

Usage Scenario

source {
  LocalFile {
    path = "/aa/bb/cc/2024_001.csv.gz"
    file_format_type = "csv"
    field_delimiter = ","
    result_table_name = "fake"
    archive_compress_codec="GZ"
    schema {
                fields {
            NAME= string
            AGE= string
            CREATETIME= string
            DESC= string
        }
    }
  }
}

Related issues

No response

Are you willing to submit a PR?

Code of Conduct