Friday, 22 November 2024
25.6 C
Singapore

OpenAI introduces a cutting-edge audio tool capable of replicating human voices

OpenAI unveils Voice Engine, an audio tool that accurately replicates human voices, showcasing the potential and ethical considerations of AI advancements.

OpenAI, a leading name in the sector, has recently showcased its latest advancement: an audio tool with the ability to convert text into speech that sounds remarkably human. This development is at the forefront of AI technology, yet it also introduces potential concerns regarding the creation of deepfakes.

A cautious rollout amid ethical considerations

So far, the new tool, named Voice Engine, has been made available to a select group of around 10 developers. Despite plans for a broader release to potentially 100 developers, OpenAI decided to limit access after consulting with various stakeholders, including policymakers, industry specialists, and educational professionals. This cautious approach, detailed in a company blog post on March 29, reflects the potential ethical and safety implications, particularly in the context of an election year.

Voice Engine differs from previous audio content technologies by accurately replicating the voice of specific individuals, requiring only a 15-second audio sample. During a demonstration, Bloomberg experienced a clip in which OpenAI’s CEO, Sam Altman, discussed the technology in a voice generated by the AI that was virtually indistinguishable from his own.

However, OpenAI is proceeding with caution due to the precise nature of the voice replication, emphasizing the importance of safety in its use. The technology’s potential benefits were also highlighted, such as assisting patients at the Norman Prince Neurosciences Institute to regain their voices. In one instance, a young patient was able to speak clearly again for a school project after losing her voice to a brain tumour, thanks to the Voice Engine.

Expanding the potential of voice replication

Moreover, the tool’s capability extends to translating generated audio into various languages, proving useful for companies like in making content more accessible across different linguistic groups. OpenAI has outlined strict usage policies for its partners, including obtaining consent from the voice’s original owner and informing listeners that the speech they hear is AI-generated. Additionally, an inaudible audio watermark is being used to track the origin of audio clips.

As OpenAI considers wider release, it seeks feedback to gauge the global response to such technology, emphasizing the importance of public understanding and preparation for AI advancements. The firm is advocating for measures to increase societal resilience against the potential misuse of AI technologies, such as phasing out voice authentication in banks and educating the public on detecting AI-generated content.

Hot this week

Warrix enhances internal communications with Slack to boost collaboration and efficiency

Warrix has transformed its internal communications with Slack, cutting time spent on meetings and improving collaboration by 30%.

Glancing becomes a digital trend in Indonesia’s smartphone market

Glance Smart Lock Screen is transforming how over 30 million Indonesians engage with their smartphones, boasting 117 billion glances and 2 billion taps in just nine months.

T-Mobile network infiltrated by hackers linked to China

China-linked hackers breached T-Mobile, accessing officials' data. T-Mobile says customers' data remains largely unaffected.

Fantasian Neo Dimension launches on consoles this December

Bandai Namco and Square Enix announced Fantasian Neo Dimension for consoles, which will launch on December 5. Pre-orders are open now.

Cybersecurity unicorn Semperis strengthens identity-driven resilience in Singapore after major funding success

Semperis enhances cybersecurity in Singapore with US$125M funding, partnerships, and training to combat growing cyber threats and ransomware attacks.

UGREEN Surge Protector Power Strip review: Fast charging meets smart safety

The UGREEN Surge Protector Power Strip offers fast charging, 10-device support, and surge protection but faces durability concerns.

Microsoft’s AI agents in Microsoft 365 to handle your mundane tasks

Boost productivity with Microsoft 365's new AI agents, handling tasks in SharePoint, Teams, and Planner for better efficiency and collaboration.

New features in GPT-4o enhance creativity and efficiency

GPT-4o enhances creative writing with improved speed, capabilities, and cost-efficiency, offering tailored and natural responses for users.

The Windows 11 24H2 update continues to cause problems

Windows 11 24H2 update causes time zone bugs, audio glitches, and sync issues; Microsoft promises fixes in the next update.

Related Articles

Popular Categories