Delay in sending a HTTP request to Open AI model

Ask Question

Asked 3 months ago

Modified 3 months ago

Viewed 76 times

Part of OpenAI Collective

I have coded a C++ binary using WinHTTP running on Windows 11 to send a prompt to Open AI and get a response from it. WinHTTP has the following stages:

Create a session with WinHTTPOpen().
Create a connection with WinHttpConnect(). Server is api.openai.com.
Create a request with WinHttpOpenRequest(). I use POST for /v1/chat/completions.
Send the request to the server with WinHttpSendRequest().
Receive a response from the server with WinHttpReceiveResponse().
Read the data with WinHttpQueryDataAvailable() and WinHttpReadData().

Step 4 is taking 6.8 seconds on an average for simple prompts that I send. Simple means something like "How far is Earth from the Sun?", "How many planets are there in the Solar System?", etc. Rest of the steps take 1 to 30ms. But the 6800 ms for Step 4 stood out. I tried to pre-warm the connection by going through the whole sequence (steps 1 to 6) ahead of time with a dummy prompt ("Say hello to me"), retaining the session and connection and only crafting the request (Step 3 onwards) later when I test with variable prompts. Sometimes Step 4 takes 3500 ms, but mostly it takes 6800 ms. During the investigation I found it is using HTTP/1.1, not HTTP/2.

Is there a client side configuration to reduce the latency?

edited Aug 14 at 14:31

asked Aug 14 at 7:21

Dark Matter

901 silver badge7 bronze badges

Using libcurl from Linux C++ code, I see 2.3 to 2.4 seconds latency for similar queries. Two major differences:1. From Windows I send string in UTF-8. 2. The endpoint from Linux is "/responses". From Windows it is "/v1/chat/completions". Checking impact of those changes.

Dark Matter
– Dark Matter

2025-08-14 23:33:40 +00:00
Commented Aug 14 at 23:33
I used libcurl from Windows (not WSL) to find the latency is 1.8 to 1.9 seconds. Unsure what the problem is with WinHTTP.

Dark Matter
– Dark Matter

2025-08-18 14:48:18 +00:00
Commented Aug 18 at 14:48

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Delay in sending a HTTP request to Open AI model

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest