A week in Generative AI: Meta, Mira & Voice
News for the week ending 29th September 2024
Itās been a big week this week with Metaās Connect 2024 event, where they announced the multimodal Llama 3.2. Mira Murati announced that she is leaving OpenAI and we also saw the release of ChatGPTās Advanced Voice Mode everywhere except Europe.
On the Ethics front Meta and Apple have both snubbed the EUās AI pact, Mark Zuckerberg claimed that publishers overestimate their value for AI training and some research shows that Using GPT-4 to generate 100 words consumes up to 3 bottles of water.
There are also some good long reads from The Guardian about humanoid robots, Sam Altman on The Intelligence Age, and The New York Times has a great article covering what Jony Ive has been up to where he shares heās working with Sam Altman on some new AI-powered hardware.
Meta Connect 2024
The big event of the week was Metaās Connect 2024 where they unveiled lots of new products and inevitably talked a lot about AI! Thereās some good coverage of everything they announced over at The Verge.
On the AI front, Meta announced Llama 3.2, which is now multimodal and can understand images. This allows it to interpret charts and graphs, caption images, and pinpoint objects in pictures given a simple description. Unfortunately, Llama 3.2 canāt be accessed in Europe, with Meta blaming the āunpredictableā nature of the EUās regulatory environment.
Meta AI is also getting celebrity voices. Itās not a touch on OpenAIās Advanced Voice Mode, but there are approved and endorsed clones of the voices of Awkwafina, Dame Judi Dench, John Cena, Keegan-Michael Key, and Kristen Bell that users can choose from. Iām sure itās not a shock to hear that Meta has paid millions to license these celebrity voices and theyāre making a big bet that this superficial customisation will compete with the better product experience from Advanced Voice Mode. Iām not so sureā¦
OpenAI CTO Mira Murati says sheās leaving firm to do her āown explorationā
I have a lot of respect for Mira Murati and I think of all the leadership team at OpenAI she came out of the firing/hiring of Sam Altman fiasco last year with her head held high. She also did a great job of leading the Spring Event earlier this year, bringing in her team, and showcasing some great new features that were in the pipeline. Itās sad to see her leaving, but Iām excited for what she will do next.
It looks like Mira leaves on great terms, with Sam Altman tweeting an effusive message and also announcing some other leadership changes. Itās been very fluid at the top of OpenAI this year, with Greg Brockman taking extended leave, Ilya Sutskever leaving and many other senior executives leaving the business.
This all comes amongst reports that OpenAI is to remove non-profit control and give Sam Altman equity in the business for the first time as they change from being a research outfit to more of a product business.
One thingās for sure, OpenAI is a very interesting company to watch!
OpenAI rolls out Advanced Voice Mode with more voices and a new look
First previewed back in May at OpenAIās Spring Event, OpenAI have released Advanced Voice Mode this week. Itās now available to Plus and Team subscribers, but unfortunately itās not available in the EU, , Switzerland, Iceland, Norway, or Liechtenstein.
Luckily it is available in the UK, so Iāve been able to test it and itās a lot of fun! I chose the āSolā voice, which was described as āsavvy and relaxedā. There was no noticeable lag, I was able to interrupt it speaking, and I could get it to SHOUT and whisper as well as sound excited. It leads to a much more dynamic and intuitive experience, very close to speaking to a real person.
Advanced Voice Mode doesnāt have the ability to detect emotions in your voice yet, but I think thatās something that will be coming with the video features that were also shown off at the event in May. I also love the fact that theyāve included a couple of British voices in there - āArborā is described as āeasy going and versatileā (although he sounds a bit like an east-end gangster to me š¤) and āValeā is described as ābright and inquisitiveā!
Overall, I really like it - much better than the original voice mode, but at this point I donāt think that will mean that it will use it much more than the original voice mode. Itās great for handsfree interactions, but it really needs to underpin my smartphoneās assistant to be of more use so I can invoke it without having to pick up my device. This of course will be coming, and Iām sure this type of technology will be powering the new Siri experience that is coming to iPhones later this year.
Googleās NotebookLM adds attachments, YouTube summaries, and audio overviews
Googleās NotebookLM has been an āexperimentā for a while now but theyāve just added a host of new features that make it worth checking out. First and foremost, itās a simple browser-based note taking app, but it has a lot of Generative AI built into it now.
As it stands, NotebookLM can automatically create notes from lots of different types of documents and YouTube videos. It can summarise them all for you and allow to ask questions about the content. Theyāve also got an interesting feature where they basically create a podcast episode about the source material, which is very nice.
For me, this is a great tool and itās great to see that weāre evolving beyond just a basic chat interface for interacting with large language models. This isnāt a note-taking app that has bolted a large language model on. This is a note-taking app thatās been build around a large language model and itās a great example of the types of software weāre going to be seeing in the coming years as generative AI technology starts to mature.
Robot hand can detach from arm, crawl over to objects, and pick them up
This is kind of cute, kind of creepy, all at the same time. Shame they didnāt save the video for halloween and ham in up a little more!
AI Ethics News
AI could be an existential threat to publishers - thatās why Mumsnet is fighting back
Mark Zuckerberg: creators and publishers āoverestimate the valueā of their work for training AI
Using GPT-4 to generate 100 words consumes up to 3 bottles of water
Re-opened Three Mile Island will power AI data centers under new deal
Social workers in England begin using AI system to assist their work
OpenAI Pitched White House on Unprecedented Data Center Buildout
Cloudflareās new marketplace will let websites charge AI bots for scraping
Long Reads
OpenAI - Building OpenAI o1
The Guardian - Why arenāt humanoids in our homes yet?
Sam Altman - The Intelligence Age
Stratechery - Enterprise Philosophy and The First Wave of AI
The New York Times - After Apple, Jony Ive Is Building an Empire of His Own
āThe future is already here, itās just not evenly distributed.ā
William Gibson