Kafka ingestion: Allow users to reset the offset to consume

apache / pinot

Apache Pinot - A realtime distributed OLAP datastore

https://pinot.apache.org/

Apache License 2.0

5.5k stars 1.29k forks source link

Kafka ingestion: Allow users to reset the offset to consume #6637

Open xiangfu0 opened 3 years ago

xiangfu0 commented 3 years ago

This can help users to skip the bad data due to upstream issues.

mcvsubbu commented 3 years ago

This has to be done on a per-partition basis. A couple of ways to do this:

Provide a method to start the offset of the current consuming segment at a certain point. So, if a segment started consumption at offset 100, but had a problem at 120, the user can set the start offset to be (say) 125. In this case, there is loss of data from 100 to 120. Hopefully there are no bad offsets beyond 125.
Provide a method to skip certain offsets in a stream-partition. The user specifies an array of offsets that are to be skipped by the consumer. This is more complicated in terms of user-interface, but preserves maximum data possible.

I expect the second option to be a bit more complex in implementation as well. We will need to consume each row and check whether it is taboo or not.

I prefer the first approach -- big hammer but simple(r).

xiangfu0 commented 3 years ago

Agreed with the first approach. Approach two assumes that the bad data are coming in normal traffic load, we observed sometimes that upstream data producers may dump a lot of data due to error, so the skipping option can handle both cases.

In my view, this API should take a map of Kafka topic partitions to offsets mapping(no need for all the partitions), then seal all related partition segments then create new segments using the given start offsets.

mcvsubbu commented 3 years ago

It has to be stream agnostic, so we need to take a partition number (for now) and StreamMsgOffset (serialized)
We don't need to seal existing consuming segments. In fact, we cannot do that since they would have turned themselves OFFLINE having encountered the consumption error. We can start new consuming segments for the partitions provided, at the offsets provided.

xiangfu0 commented 3 years ago

It has to be stream agnostic, so we need to take a partition number (for now) and StreamMsgOffset (serialized) True, I think it's fine for the API.

We don't need to seal existing consuming segments. In fact, we cannot do that since they would have turned themselves OFFLINE having encountered the consumption error. We can start new consuming segments for the partitions provided, at the offsets provided.

Do you mean we should just update the offset for the current consuming segment then restart the segment consumption(drop whatever consumed then restart from the given offset)?

mcvsubbu commented 3 years ago

I suggest creating a new segment, with new metadata etc. having the new offset. Set it consuming state, and let the ball roll.

xiangfu0 commented 3 years ago

I suggest creating a new segment, with new metadata etc. having the new offset. Set it consuming state, and let the ball roll.

I see. Then how about the current consuming segments?

There are two scenarios:

If the current segment is in ERROR status, then it will be in ERROR status forever
If the current segment is in CONSUMING status, then two consuming segments will cause race conditions right?

mcvsubbu commented 3 years ago

A CONSUMING segment can go into ERROR state only during the OFFLINE to CONSUMING state transition. Once it has transitioned, it can never go into ERROR state. It will turn itself into OFFLINE state if there are consumption issues.