# Create an Azure OpenAI inference endpoint **PUT /_inference/{task_type}/{azureopenai_inference_id}** Create an inference endpoint to perform an inference task with the `azureopenai` service. The list of chat completion models that you can choose from in your Azure OpenAI deployment include: * [GPT-4 and GPT-4 Turbo models](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#gpt-4-and-gpt-4-turbo-models) * [GPT-3.5](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#gpt-35) The list of embeddings models that you can choose from in your deployment can be found in the [Azure models documentation](https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#embeddings). ## Required authorization * Cluster privileges: `manage_inference` ## Servers - http://api.example.com: http://api.example.com () ## Authentication methods - Api key auth - Basic auth - Bearer auth ## Parameters ### Path parameters - **task_type** (string) The type of the inference task that the model will perform. NOTE: The `chat_completion` task type only supports streaming and only through the _stream API. - **azureopenai_inference_id** (string) The unique identifier of the inference endpoint. ### Query parameters - **timeout** (string) Specifies the amount of time to wait for the inference endpoint to be created. ### Body: application/json (object) - **chunking_settings** (object) The chunking configuration object. - **service** (string) The type of service supported for the specified task type. In this case, `azureopenai`. - **service_settings** (object) Settings used to install the inference model. These settings are specific to the `azureopenai` service. - **task_settings** (object) Settings to configure the inference task. These settings are specific to the task type you specified. ## Responses ### 200 #### Body: application/json (object) - **chunking_settings** (object) Chunking configuration object - **service** (string) The service type - **service_settings** (object) Settings specific to the service - **task_settings** (object) Task settings specific to the service and task type - **inference_id** (string) The inference Id - **task_type** (string) The task type [Powered by Bump.sh](https://bump.sh)