Wednesday’s highlight: The new release by OpenAI has introduced new voices and features with its Advanced Voice Mode, furthering audio experiences in ChatGPT. How will this change the way users interact with AI? Let’s dive in…
Today’s News:
1. 🗣OpenAI Launches Advanced Voice Mode
2. 🤖Google Launches New Gemini AI Models
3. 🎯Learn A New Skill With AI
4. ✔️Microsoft Launches Real-Time AI Corrections
5. 🚗Alibaba and Nvidia Unite for AI Smart Driving
6. 🗣How To Use Advanced Voice Mode
7. 🎵Spotify’s AI Playlist Builder Launches
CHATGPT
🗣OpenAI Launches Advanced Voice Mode
Image Credit: OpenAI
Report: OpenAI has just introduced an Advanced Voice Mode that allows a far more immersive audio experience for conversations on ChatGPT. It has a new design, offers more voices, and enhances features to make the interactions easier.
🔑Key Points:
- More Voices Expands: The AVM (Advanced Voice Mode) comes with five nature-named voices, namely Arbor, Maple, Sol, Spruce, and Vale, adding up to nine of them, through which the users can get along a bit better.
- Better Experience: It has features such as Custom Instructions and Memory. This will allow users to personalize responses and have conversations remembered by ChatGPT.
- Early Access: AVM will be available to the Plus and Teams subscribers, and Enterprise and Edu users will start getting access next week.
🤔Why It Matters:
This update significantly advances the ways in which AI interactions will become more intuitive and user-friendly, probably increasing user satisfaction and broader applications of ChatGPT in everyday settings. As AI permeates daily life, such improvements will be imperative in driving adoption and usability.
🤖Google Launches New Gemini AI Models
- New Model Releases: Google released two new versions of Gemini AI, each with more power, speed, and value than their originals. These models have been named “Gemini-1.5-Pro-002” and “Gemini-1.5-Flash-002.”
- Performance Improvements: These new models outperform their predecessors by a large margin in most benchmarks, including a 20% improvement in math performance, and visual understanding and coding tasks.
- Cost and Accessibility: More than a 50% reduction in the cost of input and output tokens, increased rate limits, and lowered latency. The models are accessible via Google AI Studio, Gemini API, and Vertex AI, with a chat-optimized version coming soon for Gemini Advanced users.
MICROSOFT
✔️Microsoft Launches Real-Time AI Corrections
- Correction Capability Meets Introduction: Azure AI Content Safety introduces a new capability, “correction,” that not only detects ungrounded or hallucinated content in AI outputs but also corrects those inaccuracies in real time.
- Improved Groundedness Detection: This enhanced capability in detecting groundedness has given developers the ability to identify ungrounded responses and revise them for improving the reliability of generative AI applications.
- Operational Process: Upon the detection of ungrounded content, the system automatically rewrites inaccuracies based on its connected data sources and provides the user with already corrected content before one gets hold of the original errors.
🦾 Datacamp: Unlock the power of data and AI by learning Python, ChatGPT, SQL, Power BI, and earn industry-leading Certifications.
🎥 Movavi: A slicker and simpler design, smarter AI, faster video cutting, and new overlay effect modes.
👩💻 AI Value Tool by Cloudfare: A tool in development to help website owners monetize access to their content through AI models.
PARTNERSHIP
🚗Alibaba and Nvidia Unite for AI Smart Driving
- First Integration of AI Models: Alibaba Cloud has integrated its large language models into Nvidia’s Drive automotive platform, a first step in bringing better autonomous driving solutions for automakers in China.
- Better Performance and Features: The collaboration brings a new multimodal model, LMM, that will power innovative in-car voice assistants capable of dynamic conversation and command execution for superior user experiences in smart vehicles.
- Future Development Plans: Both companies are working to adapt Alibaba’s Qwen LLMs for Nvidia’s next-generation Drive Thor platform and also are planning tailored cloud solutions for traditional enterprises migrating to LLM operations.
🗣How To Use Advanced Voice Mode:
Our Report: Freshly launched “Advanced Voice Mode” by OpenAI, allows ChatGPT subscribers to engage in natural audio conversations, featuring multiple accents and language options for a more interactive experience.
The Tutorial:
- Visit OpenAI, log into your account and select the Plus subscription tier, priced at $20 per month, to gain access to Advanced Voice Mode.
- Open the App Store (iOS) or Google Play Store (Android) on your device and download the ChatGPT app.
- Upon opening the app, you should receive an in-app notification about the new Advanced Voice Mode feature.
- Next to the Message text field, locate and tap the sound wave icon to enable audio input.
- After tapping the sound wave icon, you’ll hear a brief “bump” sound indicating that voice mode is active.
- Begin your conversation by speaking naturally. ChatGPT will respond verbally in real-time.
- If you need to pause or stop the AI, simply interrupt, and ChatGPT will listen to your input.
- Extra features which include 9 different voices, accents, speed and voice parameters, can be accessed through the “Customization” section.
- Important note: Advanced Voice Mode has a daily rate limit to ensure optimal performance.
- Use the feature wisely to avoid hitting the rate limit, and consider upgrading your plan if higher usage is necessary.
SPOTIFY
🎵Spotify’s AI Playlist Builder Launches
- AI Playlist Feature: Spotify’s AI playlist maker is live in the US, Canada, Ireland, and New Zealand. It allows users who are paid subscribers of the service to build playlists that are personalized to them based on text descriptions alone.
- Create Playlists with Text Descriptions: Users can create custom playlists by describing the vibe they want. The feature includes suggested prompts to get them started, which will make it easier to curate the perfect soundtrack.
- Ongoing Improvements: Spotify is working aggressively to further fine-tune the AI Playlist feature, learning from user interaction to better its ability to match niche requests.
I can’t believe we coded an app with AI in 67 mins (V0, Cursor AI, Replit, Claude AI):
💡Meta Connect 2024: How to watch the metaverse and generative AI event today
“In this article, Meta Connect 2024 kicks off with CEO Mark Zuckerberg unveiling the latest in Meta’s XR platforms and generative AI advancements. Discover exciting new hardware like the Meta Quest 4, updated Ray-Bans, and the innovative “Orion” AR headset. As the event streams live, anticipation builds around how these developments will shape the future of virtual and augmented reality.”
That’s it for today’s AI-News on September 25.
At AIAtlas, our goal is to teach everyone about AI in a simple but effective way.
Therefore, we send out 1 e-mail a day, which includes all important news in the AI space.
Thanks for reading and we’ll see you on Friday:)