Artificial Intelligence and Machine Learning Discussions
Engage in AI and ML discussions. Collaborate on innovative solutions, explore SAP's AI innovations, and discuss use cases, challenges, and future possibilities.
cancel
Showing results for 
Search instead for 
Did you mean: 

What is the actual URL endpoint for using a deployment for chat completions in SAP AI Core?

TiagoAlmeida
Participant
0 Kudos

Hello all, 

TLDR: If you have a deployment of an azure-openai model, which path do you call to get chat completions?

 

We have a deployment visible in AI launchpad and in status running. This is ID dd5....b5

TiagoAlmeida_0-1715872053401.png

This is tied to a configuration that is using azure openai model (we intend to use gpt-3.5turbo).

 

In postman, we can make requests to endpoints such as /lm/deployments and indeed get information about the deployment. Showing that the authentication to the API is working. Example:

TiagoAlmeida_1-1715872328468.png

 

Now, we'd like to call the endpoint to get "chat completions". i.e. passing a prompt and get a completion back.

Examples online suggest that completions are available under /v2/inference/deployments/<deploymentid>/v2/completions but this fails with a 404: 

TiagoAlmeida_2-1715872822978.png

Indeed we found some conflicting info online and decided to try variations of the differents paths. They always return either a 404 or a 403.

Paths tested:

/v2/inference/deployments/dd5...b5/

/v2/inference/deployments/dd5...b5/v2/completions

/v2/inference/deployments/dd5...b5/v2/completion

/v2/inference/deployments/dd5...b5/v2/chat

/v2/inference/deployments/dd5...b5/v2/chat-completion

/v2/inference/deployments/dd5...b5/v2/query

/v2/inference/deployments/dd5...b5/chat-completion

/v2/inference/deployments/dd5...b5/chat

/v2/inference/deployments/dd5...b5/completions

/v2/inference/deployments/dd5...b5/query

 

These paths also fail but with 403 forbidden RBAC: access denied

/v2/lm/deployments/dd5...b5/v2/query

/v2/lm/deployments/dd5...b5/v2/completions

/v2/lm/deployments/dd5...b5/v2/chat-completions

.../chat/completions?api-version=v2

 

References and examples from the web:

This example suggest it should be /v2/inference/deployments/<YOUR_AICORE_DEPLOYMENT_ID>/v2/chat-completion 

This example suggests it's either /v2/completion or /v2/chat-completion at the end.

The CAP LLM plugin sample suggests it's /chat/completions with an apiversion parameter with an undocumented value

This person is similarly confused about which path to use.

Question

Do you know what is the correct path please? Any help appreciated.

 

@Nora 

 

1 REPLY 1

TiagoAlmeida
Participant
0 Kudos

I asked too early 🙂

The correct path is /v2/inference/deployments/<deployment id>/chat/completions?api-version=2023-05-15

Thanks