Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
OpenAI launches new voice intelligence features in its API

OpenAI launches new voice intelligence features in its API

Posted on 7 May 2026 By jobuzo

OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.

The company’s new GPT‑Realtime‑2 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT‑5‑class reasoning that OpenAI says was created to deal with more complicated requests from users.

The company is also launching GPT‑Realtime‑Translate, which, just as it sounds, is designed to provide real-time translation services that “keep pace” with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker).

Finally, the company has also launched a new transcription capability, GPT-Realtime-Whisper, which gives users live speech-to-text capabilities that are captured as interactions occur.

“Together, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds,” the company said.

Who will these updates be good for? Companies that want to expand customer service capabilities are an obvious target. However, OpenAI also notes that its new features will assist with a wide array of areas, including education, media, events, and creator platforms, among others.

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

As useful as these tools seem from an enterprise perspective, it also seems plausible that they could be misused. The company said it has built guardrails to stop its new features from being abused to create spam, fraud, or other forms of online abuse. Certain triggers have been embedded in the system so that “conversations can be halted if they are detected as violating our harmful content guidelines,” OpenAI said.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

All of the new voice models are included in OpenAI’s Realtime API. Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

OpenAI launches new voice intelligence features in its API


News

Post navigation

Previous Post: Interview: Russia-China film co-productions boost cultural ties, says senior media executive
Next Post: Everything to Know About Hantavirus Cruise Ship Outbreak That’s Killed 3 People

Related Posts

Pedri returns to field for Barcelona after one-month injury absence News
Zelensky dice a CNN que las conversaciones sobre Ucrania no pueden esperar hasta que termine la guerra en Irán News
Louisiana attorney general sues Roblox Louisiana attorney general sues Roblox News

Latest

  • Zambia commends Chinese medical team for enhancing healthcare delivery
  • Strait of Hormuz tanker damaged in projectile strike as US-Iran tensions escalate
  • Explosion, heavy gunfire in Pakistan’s Karachi near Rangers offices
  • Iranian drones attack Bahrain and a ship is struck in the strait after US airstrikes
  • World Cup final day of group play will set the field for the round of 32
  • What the Leaked Galaxy Z Fold 8 Colors Reveal About the Wider Design
  • Body of teen girl found in luggage near Thailand train tracks, Australian man arrested
  • Australia plans to strengthen laws banning children from social media
  • Amazon Prime Day 2026 Final Hours: Everything You Need to Know to Score the Best Deals (Live Updates)
  • Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, agencies

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs