MicrosoftDocs / azure-docs

Open source documentation of Microsoft Azure
https://docs.microsoft.com/azure
Creative Commons Attribution 4.0 International
10.31k stars 21.49k forks source link

Drill with Blob Plugin Doesn't Work #112336

Closed pg-msft closed 1 year ago

pg-msft commented 1 year ago

Seems from previous issues that Drill with the Azure Blob plugin returns an error on any query against Avro files generated by Event Hub.

https://github.com/Azure-Samples/streaming-at-scale/issues/113 https://github.com/Azure-Samples/streaming-at-scale/issues/38

etc.

Those issues were resolved by changing the tutorial to use Data Bricks, but the section is still titled 'Use Apache Drill'. The tutorial seems to have nothing to do with Drill and uses a more complicated solution involving Data Bricks. References to Drill should probably be removed entirely.


Document Details

Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.

RamanathanChinnappan-MSFT commented 1 year ago

@pg-nonfiction Thanks for your feedback! We will investigate and update as appropriate.

YashikaTyagii commented 1 year ago

@pg-nonfiction Thanks for your feedback! We have assigned the issue to author who will provide further updates.

spelluru commented 1 year ago

Updated the article.

please-close

pg-msft commented 1 year ago

Hello. Under the section titled 'Azure Storage Explorer', the doc still says 'You can also use Apache Drill for a lightweight SQL-driven experience..'. Unfortunately Drill doesn't work with .avro files at all (the Avro plugin throws an explicit exception on the complex types found in the Event Hub .avro files). Any reference to Drill in the article there we should include something about converting to JSON first using the Avro tools .jar, which does seem to work.