Here are the biggest AI launches of 2024 and what they mean for the future of work
1. Multimodality (Google Stream Realtime / OpenAI Video Screenshare):
Multimodal AI will be one of the most significant changes in how knowledge workers work.
You can now share live videos or screen share, so AI can be your personal assistant, helping you complete tasks through text, audio, or visual.
I’ve done countless demos with Google Stream Realtime; it will have a huge impact on sales, customer support/success, and education.
2. AI Search (ChatGPT Search / Gemini Flash):
I’ve discussed this for 2 years; AI will change how we retrieve information. Google’s blue links were great, but AI is much better. It does the hard work for you. It searches across the links and gives you a clear and concise answer.
We’re already seeing the impact. I’ve seen companies lose 10% to 30% of their organic traffic in certain markets.
ChatGPT launched search engine, and Google’s updated Gemini model continues to make AI search the internet’s front door.
3. Workflow AI Agents (Google’s Project Mariner / Claude’s Computer Use):
We’ve seen leaders of large tech companies predict AI will eventually be able to do anywhere between 30% to 80% of all current knowledge work.
So how do we get there?
Eventually, AI will ingest data from your browser and hardware device to learn what you do and then do it for you.
Google’s Project Mariner and Claude’s Computer use gave us glimpses of this future.
4. Text to Video (OpenAI Sora / Google’s Veo):
I predicted 2024 would be the year of text-to-video models.
And December saved my prediction :). We got OpenAI’s Sora and Google’s Veo, which was a surprise.
Applying video in more creative ways across your business will be a huge unlock. These models showcase what the future could look like.
The big surprise? It looks like Google has outshone OpenAI, with Veo getting much better reviews than Sora.
A couple of bonuses (smaller but still very cool)
Claude’s Artifacts and Project features were awesome. It allows you to easily create a multitude of different AI assistants to help you with varying tasks across your life.
You can see how good they were with OpenAI, copying them with Canvas and Projects.
Also, I couldn’t end the post without calling out Claude’s (and Kieran’s) launch of writing styles available right from the Claude interface.
Strap in for 2025; it will be a wild year for AI with Elon due to the launch Grok 3, a model people believe will be a huge leap in intelligence from current models.