keedio / flume-ng-sql-source

Flume Source to import data from SQL Databases
Apache License 2.0
264 stars 164 forks source link

Issues faced while using 1.4.3 version #49

Open rishit606 opened 6 years ago

rishit606 commented 6 years ago

Hi ,

I was earlier using 1.3.7 version but tried to move to 1.4.3(latest) to avoid the quotes in output data file. However while testing the new version i was facing below issues: 1) output of custom query always repeated after 10k rows. Is max.rows property mandatory in 1.4.3 ?

Kindly suggest on above. thanks, Rishit

lazaromedina commented 6 years ago

Hi rishit606, i try to reproduce with latest release 1.5.0, but not getting repeated rows with 20k. max.rows is not mandatory, it sets a max amount of rows that if resulset query cannot reach, main thread will be put to sleep when processing events.

rishit606 commented 6 years ago

Thanks lazaromedina, 10k rows was without setting max.rows property. If i set max.rows property then it would work fine. In my project, rows are going beyond 3 crores, so i was thinking of not using the max.rows as not always sure about the count.

thanks, Rishit Shah.

lazaromedina commented 6 years ago

Hi rishit606, I'm not sure if I'm understanding your problem with the parameter "max.rows".

Regards, Luis

rishit606 commented 6 years ago

Hi Luis, Sorry for replying late. For your above points , here is my reply : Point 1 : Yes data repeats again after 10k rows Point 2: set max.rows = 50000 . Data repeats after 50000 rows . Point 3 : I couldn't get what you are trying to say.

My query is in version 1.3.7 i was not getting any duplicate rows (no max.rows property set). So is the default value of 10k set in new version ?

Thanks, Rishit shah

lazaromedina commented 6 years ago

Hi Rishit,

best, Luis