Account and Authentication
How do I obtain an API Key?
How do I obtain an API Key?
Does the License have an expiration date? Can it be reset?
Does the License have an expiration date? Can it be reset?
Quick Start Questions
How can I quickly call the one-time synthesis interface?
How can I quickly call the one-time synthesis interface?
video_url and audio_url to generate the video. The example response format is:What are the minimum steps required for the real-time digital person to connect for the first time?
What are the minimum steps required for the real-time digital person to connect for the first time?
- Establish a WebSocket connection to
wss://transfer.navtalk.ai/wss/v2/realtime-chat(include thelicenseandnameparameters in the URL). - Wait for
conversation.connected.successevent containing session ID and ICE servers. - Optionally send conversation history via
conversation.item.createmessages. - Capture microphone audio and send it via
realtime.input_audio_buffer.append. - Receive AI response text/audio stream/video stream (WebRTC through the same connection).
Real-time WebSocket Connection Issues
What should I do if the WebSocket connection fails?
What should I do if the WebSocket connection fails?
- Is the license valid?
- Is the WebSocket address correct:
wss://api.navtalk.ai/realtime-api? - Does Chrome allow microphone access?
Do I need to configure WebRTC to get the digital person's video on the webpage?
Do I need to configure WebRTC to get the digital person's video on the webpage?
video. Please ensure that after connecting to the WebSocket, you simultaneously establish a WebRTC video channel and bind it to the video tag to play.Character and Behavior Settings
How do I specify the character settings and greeting of the digital person?
How do I specify the character settings and greeting of the digital person?
prompt field of the realtime.input_config message, for example:Can I specify the tone of the digital person? What are the options?
Can I specify the tone of the digital person? What are the options?
voice: "nova", which supports the following 9 tones: alloy, shimmer, coral, echo, ballad, ash, sage, verse.See Voice Styles for complete descriptions and audio previews.Context and Memory Issues
How can I make the digital person remember the user's history of conversations?
How can I make the digital person remember the user's history of conversations?
- Embed conversation context in the
promptfield ofrealtime.input_configto simulate full context. - Use
conversation.item.createto send historical messages (only supports user messages) after receiving therealtime.session.createdevent.
Why can't the AI remember the previous conversation?
Why can't the AI remember the previous conversation?
- Does your
realtime.input_configmessage include contextual content in thepromptfield? - Did you send conversation history using
conversation.item.createafter receivingrealtime.session.created?
Function Call Issues
Why is there no response after configuring the function call?
Why is there no response after configuring the function call?
- Please confirm that the
toolsparameter has been correctly registered. - Check if you are listening for the
response.function_call_arguments.doneevent. - Is the backend correctly returning
function_call_output?
After the function call result is pushed, why is there no response from the AI?
After the function call result is pushed, why is there no response from the AI?
Media Interface Call Issues
How long will it take to receive results after synthesizing video and audio?
How long will it take to receive results after synthesizing video and audio?
query_status interface until you receive:Can I upload files directly?
Can I upload files directly?