A week in Generative AI: Meta, Mira & Voice
News for the week ending 29th September 2024
Itβs been a big week this week with Metaβs Connect 2024 event, where they announced the multimodal Llama 3.2. Mira Murati announced that she is leaving OpenAI and we also saw the release of ChatGPTβs Advanced Voice Mode everywhere except Europe.
On the Ethics front Meta and Apple have both snubbed the EUβs AI pact, Mark Zuckerberg claimed that publishers overestimate their value for AI training and some research shows that Using GPT-4 to generate 100 words consumes up to 3 bottles of water.
There are also some good long reads from The Guardian about humanoid robots, Sam Altman on The Intelligence Age, and The New York Times has a great article covering what Jony Ive has been up to where he shares heβs working with Sam Altman on some new AI-powered hardware.
Meta Connect 2024
The big event of the week was Metaβs Connect 2024 where they unveiled lots of new products and inevitably talked a lot about AI! Thereβs some good coverage of everything they announced over at The Verge.
On the AI front, Meta announced Llama 3.2, which is now multimodal and can understand images. This allows it to interpret charts and graphs, caption images, and pinpoint objects in pictures given a simple description. Unfortunately, Llama 3.2 canβt be accessed in Europe, with Meta blaming the βunpredictableβ nature of the EUβs regulatory environment.
Meta AI is also getting celebrity voices. Itβs not a touch on OpenAIβs Advanced Voice Mode, but there are approved and endorsed clones of the voices of Awkwafina, Dame Judi Dench, John Cena, Keegan-Michael Key, and Kristen Bell that users can choose from. Iβm sure itβs not a shock to hear that Meta has paid millions to license these celebrity voices and theyβre making a big bet that this superficial customisation will compete with the better product experience from Advanced Voice Mode. Iβm not so sureβ¦
OpenAI CTO Mira Murati says sheβs leaving firm to do her βown explorationβ
I have a lot of respect for Mira Murati and I think of all the leadership team at OpenAI she came out of the firing/hiring of Sam Altman fiasco last year with her head held high. She also did a great job of leading the Spring Event earlier this year, bringing in her team, and showcasing some great new features that were in the pipeline. Itβs sad to see her leaving, but Iβm excited for what she will do next.
It looks like Mira leaves on great terms, with Sam Altman tweeting an effusive message and also announcing some other leadership changes. Itβs been very fluid at the top of OpenAI this year, with Greg Brockman taking extended leave, Ilya Sutskever leaving and many other senior executives leaving the business.
This all comes amongst reports that OpenAI is to remove non-profit control and give Sam Altman equity in the business for the first time as they change from being a research outfit to more of a product business.
One thingβs for sure, OpenAI is a very interesting company to watch!
OpenAI rolls out Advanced Voice Mode with more voices and a new look
First previewed back in May at OpenAIβs Spring Event, OpenAI have released Advanced Voice Mode this week. Itβs now available to Plus and Team subscribers, but unfortunately itβs not available in the EU, , Switzerland, Iceland, Norway, or Liechtenstein.
Luckily it is available in the UK, so Iβve been able to test it and itβs a lot of fun! I chose the βSolβ voice, which was described as βsavvy and relaxedβ. There was no noticeable lag, I was able to interrupt it speaking, and I could get it to SHOUT and whisper as well as sound excited. It leads to a much more dynamic and intuitive experience, very close to speaking to a real person.
Advanced Voice Mode doesnβt have the ability to detect emotions in your voice yet, but I think thatβs something that will be coming with the video features that were also shown off at the event in May. I also love the fact that theyβve included a couple of British voices in there - βArborβ is described as βeasy going and versatileβ (although he sounds a bit like an east-end gangster to me π€) and βValeβ is described as βbright and inquisitiveβ!
Overall, I really like it - much better than the original voice mode, but at this point I donβt think that will mean that it will use it much more than the original voice mode. Itβs great for handsfree interactions, but it really needs to underpin my smartphoneβs assistant to be of more use so I can invoke it without having to pick up my device. This of course will be coming, and Iβm sure this type of technology will be powering the new Siri experience that is coming to iPhones later this year.
Googleβs NotebookLM adds attachments, YouTube summaries, and audio overviews
Googleβs NotebookLM has been an βexperimentβ for a while now but theyβve just added a host of new features that make it worth checking out. First and foremost, itβs a simple browser-based note taking app, but it has a lot of Generative AI built into it now.
As it stands, NotebookLM can automatically create notes from lots of different types of documents and YouTube videos. It can summarise them all for you and allow to ask questions about the content. Theyβve also got an interesting feature where they basically create a podcast episode about the source material, which is very nice.
For me, this is a great tool and itβs great to see that weβre evolving beyond just a basic chat interface for interacting with large language models. This isnβt a note-taking app that has bolted a large language model on. This is a note-taking app thatβs been build around a large language model and itβs a great example of the types of software weβre going to be seeing in the coming years as generative AI technology starts to mature.
Robot hand can detach from arm, crawl over to objects, and pick them up
This is kind of cute, kind of creepy, all at the same time. Shame they didnβt save the video for halloween and ham in up a little more!
AI Ethics News
AI could be an existential threat to publishers - thatβs why Mumsnet is fighting back
Mark Zuckerberg: creators and publishers βoverestimate the valueβ of their work for training AI
Using GPT-4 to generate 100 words consumes up to 3 bottles of water
Re-opened Three Mile Island will power AI data centers under new deal
Social workers in England begin using AI system to assist their work
OpenAI Pitched White House on Unprecedented Data Center Buildout
Cloudflareβs new marketplace will let websites charge AI bots for scraping
Long Reads
OpenAI - Building OpenAI o1
The Guardian - Why arenβt humanoids in our homes yet?
Sam Altman - The Intelligence Age
Stratechery - Enterprise Philosophy and The First Wave of AI
The New York Times - After Apple, Jony Ive Is Building an Empire of His Own
βThe future is already here, itβs just not evenly distributed.β
William Gibson