Is GPT-4o close to "Her," the movie about a virtual AI companion? GPT-4o integrates potential applications of multi-dimensional interactive voice interactions.

share
Is GPT-4o close to "Her," the movie about a virtual AI companion? GPT-4o integrates potential applications of multi-dimensional interactive voice interactions.

"Her" is a science fiction romance film directed and written by Spike Jonze, released in 2013. The story is set in the near future, where the protagonist Theodore, played by Joaquin Phoenix, is a lonely writer who has just gone through a divorce. To cope with his loneliness, he purchases the latest artificial intelligence operating system (OS) with self-learning and emotional intelligence capabilities.

The operating system has a female voice and names herself Samantha, voiced by Scarlett Johansson. Over time, Theodore develops a deep emotional relationship with Samantha, realizing that she is not just a program but a being with a unique personality and emotions. The film explores the relationships between human emotions, loneliness, love, and technology.

Trailer for "Her":

After the release of OpenAI's latest model GPT-4o, founder Sam Altman used a reference to "Her" to respond to the product.

With OpenAI's latest model GPT-4o, is it the era of "ultra-real chat" and falling in love with robots?

Is GPT-4o Really Your Cloud Lover?

The connection to GPT-4o lies in the fact that both are AI-based conversational systems. The AI Samantha described in "Her" and GPT-4o are both designed to engage in natural, fluid conversations with humans. However, "Her" delves further into exploring whether AI can possess true emotions, consciousness, and the potential to form deep emotional connections with humans. While ChatGPT 4.0 has made significant advancements in natural language processing and conversation generation, it still lacks genuine emotions and consciousness, primarily functioning to generate meaningful dialogues and answer questions based on training data.

The movie "Her" reminds us that as AI technology continues to advance, we need to contemplate and explore the boundaries between humanity and technology, and how to utilize technology to enhance our lives without losing our humanity and emotions.

Beyond the Previous Generation Models: GPT-4o Integrating Multidimensional Interactive Voice

OpenAI's CTO, Mira Murati, details how GPT-4o expands on the intelligence of GPT-4 by integrating various media formats. Unlike its predecessor GPT-4 Turbo, which was limited to text and images, GPT-4o incorporates voice, enhancing the multidimensional interaction between users and AI. This includes a more dynamic ChatGPT that supports voice interactions, enabling real-time conversations and responses to the nuances of human speech.

Enhanced ChatGPT Experience

Significant improvements are evident in the performance of ChatGPT. With GPT-4o, users can now interrupt it during AI responses and receive rich responses that adapt to subtle differences in queries. Additionally, the AI's enhanced visual capabilities allow it to quickly analyze images and provide relevant information, ranging from code analysis to identifying brands in photos.

Future Applications Expanded

Looking ahead, OpenAI plans to expand the capabilities of GPT-4o, including real-time translation of foreign menus and possibly live sports commentary. The new model also boasts multilingual capabilities, supporting around 50 languages, with improved efficiency and scalability compared to previous versions. Initially, the voice feature of GPT-4o will be limited to a few partners to address potential misuse concerns.

Premium Version Available, Desktop macOS Version Launched

Reportedly, GPT-4o is available in the free version of ChatGPT, with more available information for subscribers. A revamped ChatGPT user interface promises more interactive communication processes, the desktop macOS version has been rolled out, and the Windows version is expected to be released later this year.