databrickslabs / overwatch

Capture deep metrics on one or all assets within a Databricks workspace
Other
226 stars 64 forks source link

SparkEventLogs table not populating #1055

Closed Mpicca closed 1 year ago

Mpicca commented 1 year ago

Overwatch Version 0.7.2.1 Using Azure.

Describe the bug Bronze_SparkEventLogs not populating

Screenshots image

Full Validation Report: image

mohanbaabu1996 commented 1 year ago

Hi @Mpicca,

Please have an look into the documentation

Mpicca commented 1 year ago

@mohanbaabu1996 1 workspace configured All cluster have have logging path enabled. Unable to display 3 bullet point i receive the following: IllegalArgumentException: Cannot head a directory: dbfs:/mnt/storageaccountname/monitoring/overwatch

mohanbaabu1996 commented 1 year ago

Thanks! Are you sure this path is the cluster logging path ?

dbfs:/mnt/storageaccountname/monitoring/overwatch

If yes, can you drill down the sub folders and check do you have .gz files

mohanbaabu1996 commented 1 year ago

Hi @Mpicca

Did you get a chance to check the above test case ? Thank you!

Mpicca commented 1 year ago

@mohanbaabu1996 yes, the spark tables look to be populating now after multiple runs of the job. A new issue has arisen where the job looks to be completing but the gold tables are not processing / generating through the job, only bronze and silver: Is there any way we can get on a quick call?

mohanbaabu1996 commented 1 year ago

Hi @Mpicca,

I'm from IST and let me know yours. Before getting into a call, can you please share the pipReport to get the answer ready.

select * from ETL_DB.pipeline_report order by Pipeline_SnapTs desc

Please share me to my ID: mohan.baabu@databricks.com

Mpicca commented 1 year ago

@mohanbaabu1996 i'm unable to share due to company policy. There are no failures in any modules. Will be happy to share on call. I'm free anytime, please let me know soonest time you are available.

mohanbaabu1996 commented 1 year ago

Google Meet joining info Video call link: https://meet.google.com/ufy-psmm-kxk Or dial: ‪(US) +1 252-843-1062‬ PIN: ‪584 907 704‬# More phone numbers: https://tel.meet/ufy-psmm-kxk?pin=3437854930262

Can you please join here now ?

mohanbaabu1996 commented 1 year ago

Let me know if this works for you

joaoaugusto05 commented 2 months ago

What was the solution in this case? I'm facing the same problem :(

mohanbaabu1996 commented 2 months ago

Hi @joaoaugusto05 What is the status message for 1006 moduleID in the pipeline_report?

select * from ETL_DB.pipeline_report order by Pipeline_SnapTs desc

joaoaugusto05 commented 2 months ago

Hi @mohanbaabu1996, it was actually quite strange. The error message where, actually, that there was no new data.

I tried to run again but I got the same message. At the end, I solved rolling it back. It was a little expensive but worked :))