Friday, 14 March 2025
21.6 C
Singapore
35.5 C
Thailand
22.9 C
Indonesia
27.3 C
Philippines

AI startup Sesame unveils base model for its voice assistant

AI startup Sesame has released CSM-1B, the base model behind its voice assistant Maya, raising concerns over voice cloning risks and safeguards.

Sesame, the AI startup behind the widely discussed virtual assistant Maya, has released the base model that powers its advanced voice technology. The companyโ€™s new AI model, CSM-1B, is now available under an Apache 2.0 licence, meaning it can be used commercially with minimal restrictions.

The model is built with 1 billion parameters, referring to the individual components that help process and generate responses. According to Sesame, CSM-1B produces โ€œRVQ audio codesโ€ from text and audio inputs. This process, known as Residual Vector Quantisation (RVQ), converts audio into digital tokens called codes. RVQ is commonly used in AI-powered audio tools, including Googleโ€™s SoundStream and Metaโ€™s Encodec.

How Sesameโ€™s AI model works

CSM-1B combines Metaโ€™s Llama language model with an audio decoder, creating a system capable of generating realistic speech. While the base model can produce various voices, Sesame notes that it is not specifically fine-tuned for any particular voice. However, the company has developed a refined version that powers its virtual assistant, Maya.

Sesame acknowledges that the model has some capacity to understand and generate non-English languages, but this is limited due to the nature of its training data. The company has not disclosed details about the datasets used to train CSM-1B.

Despite this AI’s impressive capabilities, Sesame has implemented very few safeguards. Developers and users are urged to follow an honour system, refraining from using the model to replicate voices without consent, spread misinformation, or engage in harmful activities. However, there are no built-in restrictions to prevent misuse.

Concerns over voice cloning risks

A hands-on test of CSM-1Bโ€™s Hugging Face demo on the AI platform revealed how quickly it can replicate a personโ€™s voice. The cloning process took less than a minute, and from there, generating speech on various topics, including politically sensitive issues like elections and Russian propaganda, was effortless.

Consumer Reports recently raised concerns about the growing number of AI-powered voice cloning tools, warning that many lack meaningful safeguards to prevent fraud or abuse. The rapid development of these technologies has sparked discussions about the potential risks of deepfake audio and misinformation.

Sesame was co-founded by Brendan Iribe, best known as the co-creator of Oculus. The company gained widespread attention in February when Maya and its other AI assistant, Miles, were unveiled. Unlike traditional virtual assistants, these AI voices take breaths, pause naturally, and can even be interrupted mid-sentenceโ€”features similar to OpenAIโ€™s Voice Mode, which aims to make AI interactions more human-like.

Sesame has secured funding from major investors, including Andreessen Horowitz, Spark Capital, and Matrix Partners. In addition to developing AI voice assistants, the company is also working on AI-powered smart glasses. These wearable devices, designed for all-day use, will integrate Sesameโ€™s custom AI models to enhance user interactions.

As AI voice technology evolves, concerns over ethical use and security risks remain. With CSM-1B now open to the public, it is yet to be seen how developers will use itโ€”and whether safeguards will eventually be put in place to prevent misuse.

Hot this week

Armis acquires OTORIO to enhance on-premises security and strengthen cyber physical systems protection

Armis acquires OTORIO to expand its on-premises cybersecurity solutions, strengthening OT, ICS, and CPS protection for critical industries.

Apple confirms delay for ‘more personalised’ Siri, likely arriving with iOS 19

Apple confirms delays for its "more personalised Siri" update, with features now expected in iOS 19. Smart home plans may also be affected.

Meta introduces new fact-checking system for Facebook, Instagram, and Threads

Meta is launching Community Notes on Facebook, Instagram, and Threads in the US on March 18, aiming to improve fact-checking with a crowdsourced system.

Apple delays smart home hub as Siri upgrades take longer than expected

Appleโ€™s smart home hub has been delayed due to Siri upgrade challenges, pushing back its release. Despite the setback, internal testing has started.

Salesforce to invest US$1 billion in Singapore over five years

Salesforce is investing US$1 billion in Singapore over five years to drive AI innovation, expand workforce development, and enhance local data residency.

Android introduces Auracast support for hearing aids in public audio broadcasts

Android 16 will add Auracast support, allowing hearing aids to connect directly to public audio broadcasts.

Yottamaster 3 Ports USB Hub with Card Reader review: A reliable hub for all your USB needs

The Yottamaster 3 Ports USB 3.2 Hub with Card Reader offers 10Gbps high-speed data transfer, ergonomic design, and reliable connectivity, making it a great choice for professionals and casual users alike.

Google enhances gaming experience with new developer tools and PC titles

Google unveils new developer tools, PC-optimised game titles, and custom controls, making mobile games more accessible on PC.

Google introduces personalised Gemini chatbot

Google launches Gemini with personalisation, allowing AI to tailor responses based on your search habits. The opt-in feature is rolling out now.

Related Articles