Open jimhester opened 5 years ago
This would likely give us better delimiter guessing results than the current method, and also let us guess things like the quote and escapes used.
The main cost would be implementation time and how much data is needs to be read for it to work well.
In order to overcome the amount of data needed to determine CSV dialects a method like this can be implemented. The methodology shows great accuracy reading only ten records from a CSV file.
This would likely give us better delimiter guessing results than the current method, and also let us guess things like the quote and escapes used.
The main cost would be implementation time and how much data is needs to be read for it to work well.