-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update README.md - added note for MOE LLM deployments compatibility with ai inference sdk #39038
base: main
Are you sure you want to change the base?
Conversation
added note for MOE LLM deployments.
Thank you for your contribution @jakeatmsft! We will review the pull request and get back to you soon. |
API change check API changes are not detected in this pull request. |
@@ -42,6 +42,7 @@ See [Prerequisites](https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/ | |||
To construct any of the clients, you will need to pass in the endpoint URL. If you are using key authentication, you also need to pass in the key associated with your deployed AI model. | |||
|
|||
* For Serverless API and Managed Compute endpoints, the endpoint URL has the form `https://your-unique-resouce-name.your-azure-region.models.ai.azure.com`, where `your-unique-resource-name` is your globally unique Azure resource name and `your-azure-region` is the Azure region where the model is deployed (e.g. `eastus2`). | |||
* For Managed Online Endpoints, do not include the inference path (e.g. `/score`) in endpoint URL. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Last time I checked, the official term in the documentation is "Managed Compue", not "Managed Online". Can you please update your new line to say "Managed Compute"? Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Last time I checked, the official term in the documentation is "Managed Compue", not "Managed Online". Can you please update your new line to say "Managed Compute"? Thanks!
fixed!
added note for MOE LLM deployments.
Description
Please add an informative description that covers that changes made by the pull request and link all relevant issues.
If an SDK is being regenerated based on a new swagger spec, a link to the pull request containing these swagger spec changes has been included above.
All SDK Contribution checklist:
General Guidelines and Best Practices
Testing Guidelines