Two GPT-4os interacting and singing

3 minutesNew to AIChatGPT & LLMs

OpenAI. Two instances of voice mode talking to each other, one of which has camera access to describe the room. Three minutes long and the most efficient way to internalise what makes voice mode different from old "press the microphone, wait, listen" interfaces — interruption, tone, music, real-time vision, all in one clip.

AI Expert note

Keep this as a short intuition pump only. Do not treat it as evidence that a production voice agent can safely handle real users without disclosure, logging boundaries, fallback and human escalation.

What you should get from this

See multimodal voice interaction quickly, especially interruption, tone and camera-aware conversation.

Watch or know first

Know that demo behavior may differ from the product, region and account tier available to you.

Watch next

Continue through the same learning path with the next curated companion videos.

Live demo of GPT-4o vision capabilities

Four minutes of someone holding up a handwritten linear equation to the camera and ChatGPT tutoring them through it without giving the answer.

"Generative AI" is not what you think it is

A developer-essayist works through the "AI is just slop / AI is magical / AI is theft" trio of myths with patience and code on the screen.

Sam Altman | This Past Weekend w/ Theo Von #599

Understand that chatbot conversations are not automatically private, privileged or safe for sensitive business details.

Related videos

Andrej Karpathy: Software Is Changing (Again)

Deep Dive into LLMs like ChatGPT

The New, Smartest AI: Claude 3 – Tested vs Gemini 1.5 + GPT-4

How I use LLMs

Take it further

Hand-picked external courses that go deeper on this topic.

Coursera · DeepLearning.AI

Generative AI for Everyone

Real time inside an LLM, learning to prompt deliberately and recognise where generative AI is genuinely useful versus where it's a trap. Calm, no-hype teaching — the perfect bridge from "I've tried ChatGPT once" to "I use it every day with confidence."

Beginner~5 hoursVerified 25 days ago

Coursera · DeepLearning.AI + AWS

Generative AI with Large Language Models

Antje Barth · Shelbee Eigenbrode · Mike Chambers

When practitioners ask "what should I take if I'm serious about building with LLMs?", this is the answer. Mathematically honest without being a research paper; AWS-flavoured deployment chapters stay useful even if you'll never touch SageMaker.

Advanced~16 hoursVerified 25 days ago

Anthropic Academy

MCP: Build Rich-Context AI Apps with Anthropic

MCP is the protocol that's quietly replacing one-off tool integrations across the AI tooling ecosystem. Learn it from the source. By the end you'll have built and deployed your own MCP server, connected an LLM client to it, and understood why this standard is the closest thing the field has to USB-C.

Intermediate~3 hoursVerified 25 days ago

See all courses for ChatGPT & LLMs