MicrosoftDocs / azure-docs

Open source documentation of Microsoft Azure
https://docs.microsoft.com/azure
Creative Commons Attribution 4.0 International
10.25k stars 21.42k forks source link

Detailed explanation between Self-hosted IR vs Azure IR #25103

Closed kvenkazure closed 5 years ago

kvenkazure commented 5 years ago

Azure integration runtime Dispatching the following transform activities in public network: HDInsight Hive activity, HDInsight Pig activity, HDInsight MapReduce activity, HDInsight Spark activity, HDInsight Streaming activity, Machine Learning Batch Execution activity, Machine Learning Update Resource activities, Stored Procedure activity, Data Lake Analytics U-SQL activity, .Net custom activity, Web activity, Lookup activity, and Get Metadata activity.

A self-hosted IR is capable of:

Dispatching the following transform activities against compute resources in On-Premise or Azure Virtual Network: HDInsight Hive activity (BYOC), HDInsight Pig activity (BYOC), HDInsight MapReduce activity (BYOC), HDInsight Spark activity (BYOC), HDInsight Streaming activity (BYOC), Machine Learning Batch Execution activity, Machine Learning Update Resource activities, Stored Procedure activity, Data Lake Analytics U-SQL activity, .Net custom activity, Lookup activity, and Get Metadata activities

Apart from Self-hosted is on-premise and Azure IR is on cloud. The only other difference I see is BYOC for the Self-hosted.

Where is confusing for self-hosted IR is these components (Machine Learning Batch Execution activity, Machine Learning Update Resource activities, Stored Procedure activity, Data Lake Analytics U-SQL activity, ) - IS the documentation is trying to explain that Self-hosted IR can reach these services on cloud or these services will reside on-premise.

It appears the content was copied between the two IR and for self-hosted just BYOC was added. Please add clarity to these run-time. Because of the fact there is direct costing linked to these and customers need to very clear on what they need to use and which service they need use.


Document Details

Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

AjayKumar-MSFT commented 5 years ago

@kvenkazure, Thanks for the feedback! I have assigned the issue to the content author to review further and update the document as appropriate.

nabhishek commented 5 years ago

@kvenkazure Sorry if you didn't find the doc clear. We will update it. To answer your question, Self-hosted IR is typically used for orchestrating services/ copying data to/from sources that are in a private network, though it can orchestrate services in public environment too. Suggestion is to use Azure IR for public endpoints since it is serverless!

nabhishek commented 5 years ago

please-close

AjayKumar-MSFT commented 5 years ago

@kvenkazure, We will close this out, but if you feel you need more information please just let us know. @nabhishek, Thank you!