2026
March 14
2026
- Real-time Digital Human API
- Video Synthesis API
- Others
New Features
- AvatarId Parameter Support: Added
avatarIdparameter for WebSocket connections, enabling precise avatar lookup by ID instead of name. - Duplicate Name Warning: When multiple avatars share the same name, the system automatically selects the most recently updated one and sends a warning message (type:
conversation.warning) to the client. - Query Priority Mechanism:
avatarIdparameter has higher priority thannameparameter when both are provided.
- Avatar Query Optimization: Added
ORDER BY updated_at DESCto avatar name queries to ensure consistent results. - Code Quality: Improved avatar validation logic to reduce code duplication and improve maintainability.
2025
December 20
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New Features
- Unified WebSocket Connection: Merged two separate WebSocket connections into a single unified connection (
wss://transfer.navtalk.ai/wss/v2/realtime-chat). - Intelligent Parameter Configuration: Simplified connection parameters with default value mechanism. Only
licenseandnameare required. - Custom Event Encapsulation: All event types now use custom event encapsulation with
NavTalkMessageTypeconstants.
- Response Speed Optimization: Achieved industry-leading response latency of approximately 200ms through model and system optimization.
- Simplified Session Configuration: Session configuration now passed via URL query parameters instead of messages.
November 6
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New Features
- New Voice Styles: Added cedar and marin voice styles for real-time digital human conversations.
- Dialect Support: Added support for dialects in real-time digital human conversations.
- 21 New System Characters: Introduced 21 new system characters for enhanced digital human interactions.
- Video Stability Fix: Fixed video instability issues during real-time digital human conversations.
- Model Upgrade: Enhanced lip-sync accuracy and audio-visual synchronization precision for digital humans.
May 29
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New Features
- Character Definition: Supports the complete construction of instructions, including voice styles, function rules, and contextual memory.
- Voice Diversity: Introduced 9 new professional voice configurations, covering styles such as customer service, broadcasting, teaching, and entertainment comprehensively.
- Contextual Memory Mechanism: Added dual-channel strategies with messageConfig and conversation.item.create.
- Function Calling: Introduced a full-chain example for Function callbacks, supporting the invocation of external services like CRM, weather, and search.
May 15
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New FeaturesNo features were added in this release.Enhancements
- TTS Synthesis Enhancement: Supports emotional synthesis in voice, compatible with Nova, Fable, and Echo styles. The TTS interface has added voice parameter validation.
April 10
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New Features
- Function Calling Officially Open for Testing: Supports defining custom functions with the tools field, enhancing the operability of digital humans. Can be applied in scenarios such as weather inquiries, knowledge retrieval, and business system integrations.
March 10
2025
- Real-time Digital Human API
- Video Synthesis API
- Voice Styles
- Others
New FeaturesNo features were added in this release.Enhancements
- WebRTC Video Channel Stability Optimization: Enhanced ICE connection logic, supporting automatic reconnection after disconnection. Latency control optimized to less than 500ms.