OpenAI Introduces New Voice AI Features for API, Enhancing User Interaction
OpenAI has launched new voice AI features for its API platform, aimed at enhancing user interaction through applications that can transcribe speech and translate languages. The new GPT-Realtime-2 model offers realistic voice simulation, enabling natural conversations with users. This model, which possesses GPT-5 level reasoning capabilities, is designed to process more complex requests than its predecessor. Additionally, OpenAI introduced the GPT-Realtime-Translate feature, providing real-time translation services in over 70 input and 13 output languages. The GPT-Realtime-Whisper tool offers live speech-to-text transcription, recording interactions instantly. These new models are expected to revolutionize sectors such as customer service, education, media, and content creation. OpenAI has implemented special protection systems to prevent abuse, fraud, and spam, automatically terminating interactions if harmful content rules are violated.