[X] I had searched in the issues and found no similar issues.
What happened
There is an issue with the encoding format when reading data from Elasticsearch . The root cause is that when reading data from Elasticsearch, the response header Content-Type does not include a charset encoding. SeaTunnel defaults to ISO-8859-1 encoding in the absence of a charset specification. However, StarRocks only supports UTF-8 encoding, leading to the observed garbled text. Need to adjust the encoding handling to ensure compatibility and data integrity. Open to discussion on potential solutions and improvements. I am considering submitting a pull request to address this issue.
This issue has been automatically marked as stale because it has not had recent activity for 30 days. It will be closed in next 7 days if no further activity occurs.
Search before asking
What happened
There is an issue with the encoding format when reading data from Elasticsearch . The root cause is that when reading data from Elasticsearch, the response header Content-Type does not include a charset encoding. SeaTunnel defaults to ISO-8859-1 encoding in the absence of a charset specification. However, StarRocks only supports UTF-8 encoding, leading to the observed garbled text. Need to adjust the encoding handling to ensure compatibility and data integrity. Open to discussion on potential solutions and improvements. I am considering submitting a pull request to address this issue.
SeaTunnel Version
2.3.3
SeaTunnel Config
Running Command
Error Exception
Zeta or Flink or Spark Version
No response
Java or Scala Version
No response
Screenshots
No response
Are you willing to submit PR?
Code of Conduct