Telegram (AI) YouTube Facebook X
Ру
OpenAI Introduces Voice Mode Customization for Developers

OpenAI Introduces Voice Mode Customization for Developers

OpenAI has announced several new tools, including a public beta version of a Realtime API for creating applications with low-latency voice response capabilities, according to TechCrunch.

With this new feature, developers can enable real-time voice communication in their applications, supporting six voices.

Romain Huet, Head of Developer Relations, demonstrated a travel planning app that allows users to interact verbally with an AI assistant.

Among other announcements from OpenAI is the customization of AI “vision,” aimed at enhancing the visual understanding of neural networks.

Additionally, a feature for utilizing larger models like o1-preview and GPT-4o to train smaller ones was introduced.

OpenAI’s Chief Product Officer Kevin Weil noted that the recent departures of CTO Mira Murati and Chief Scientist Bob McGrew will not affect the company’s operations.

Meanwhile, one of OpenAI’s co-founders, Durk Kingma, has joined the rival AI startup Anthropic.

He left Sam Altman’s company in 2018 to become a business angel and consultant for AI startups.

Anthropic was founded in 2021 by former OpenAI Vice President Dario Amodei and his sister Daniela Amodei.

In August, the firm lured John Schulman, a co-founder of the ChatGPT developer company. In May, it hired former OpenAI safety lead Jan Leike.

Altman’s startup is reportedly in talks to raise $6.5 billion at a valuation of $150 billion.

In September, it announced the launch of an enhanced voice mode for ChatGPT.

Подписывайтесь на ForkLog в социальных сетях

Telegram (основной канал) Facebook X
Нашли ошибку в тексте? Выделите ее и нажмите CTRL+ENTER

Рассылки ForkLog: держите руку на пульсе биткоин-индустрии!

We use cookies to improve the quality of our service.

By using this website, you agree to the Privacy policy.

OK