Few questions - Githubissues

ajantha-bhat commented 11 months ago

Some of these things are not clear from the documentation or test cases. Can you please clarify?

Do we support all these converters for source topic(s)? I know that control topic only uses Avro.

org.apache.kafka.connect.storage.StringConverter
org.apache.kafka.connect.json.JsonConverter
org.apache.kafka.connect.converters.ByteArrayConverter
io.confluent.connect.avro.AvroConverter
io.confluent.connect.protobuf.ProtobufConverter
io.confluent.connect.json.JsonSchemaConverter

Schema registry is supported?
autoCreateTable is not creating namespaces by default? Shall I raise a PR to fix it?
Buffering and polling is currently only time based (iceberg.control.commit.interval-ms)? Do we have a plan for message count based threshold?
How to automatically add the ingestion time column? Should use SMT?
Do we support all these SMT? https://docs.confluent.io/platform/current/connect/transforms/overview.html
Schema evolution with auto table creation may not work? Since we need to alter table's schema?
Framework capabilities like error tolerance, dead letter queue , deployment modes are supported with this connector too right?
How does commit retry works? I saw the commit-interval and commit-timeout config. Do we have configurable retries?
Can JMX be used to monitor the conenctor? https://docs.confluent.io/platform/current/connect/monitoring.html#use-jmx-to-monitor-kconnect

bryanck commented 11 months ago

There are no specific limitations on which converters can be used by the sink, any of those should work
Yes
Feel free to open a PR if it is small, we're trying to limit disruptive changes during the Iceberg submission process
There are no plans currently
Yes an SMT makes sense here
There are no specific limitations on what SMTs can be used so any of those should work
If I'm understand the question right, schema evolution should work with auto table creation
There are no limitations around these so all should work the same way as with other sinks
Commit retries can be set via table properties, i.e. commit.retry.num-retries
JMX should work as with any other connector, this sink doesn't expose any custom metrics yet

ajantha-bhat commented 11 months ago

Thanks.

tabular-io / iceberg-kafka-connect

Few questions #153