datetime in Apache log follow format of "[DD/MM/yyyy:HH:MM:SS -tz]". Somehow some rows are getting a '-' appended. This causes a failure in DATE_PARSE SQL function as it gets just the '-' instead of the date when using SPLIT.
Likely a data integrity problem. We might need script to check for bad input. Might also be related to blank lines appearing in data tables as well.
🕵️ Expected behavior
Table should be created. Workaround has been to filter for "-" has been implemented, however, this results in the loss of 80k rows.
📜 To Reproduce
On EN logs, run DATE_PARSE on SPLIT of datetime column: DATE(DATE_PARSE(SPLIT(datetime, ' ')[1], '[%d/%M/%Y:%H:%i:%s')) as date,
Checked for duplicates
Yes - I've already checked
🐛 Describe the bug
datetime in Apache log follow format of "[DD/MM/yyyy:HH:MM:SS -tz]". Somehow some rows are getting a '-' appended. This causes a failure in DATE_PARSE SQL function as it gets just the '-' instead of the date when using SPLIT.
Likely a data integrity problem. We might need script to check for bad input. Might also be related to blank lines appearing in data tables as well.
🕵️ Expected behavior
Table should be created. Workaround has been to filter for "-" has been implemented, however, this results in the loss of 80k rows.
📜 To Reproduce
🖥 Environment Info
No response
📚 Version of Software Used
No response
🩺 Test Data / Additional context
No response
🦄 Related requirements
🦄 #xyz
⚙️ Engineering Details
No response