AIExplained

AIExplained

Is o1 No Longer a LLM? LeCun + New 'LRM' paper explained (+ exclusive interview clips)

Added 2024-09-24 14:05:00 +0000 UTC

A new paper from the last few days has dropped, and it's a good one. LLMs can now be said to plan, and I have all the analysis as well as exclusive clips from my interview with the lead author. And I don't believe any one else has reported that this breakthrough performance from o1 now exceeds average human performance for this core task.

Link for Off-line Viewing and Download: https://drive.google.com/file/d/1j6pRKnVcEywTodONO3L2zrerIvIW6l8g/view?usp=sharing

LeCun Tweet: https://x.com/ylecun/status/1832860107925024789

LRM Paper: https://arxiv.org/pdf/2409.13373

Original Paper: https://proceedings.neurips.cc/paper_files/paper/2023/file/efb2072a358cefb75886a315a6fcf880-Paper-Conference.pdf

o1 Calculations: https://x.com/yuntiandeng/status/1836114401213989366

Rao Analysis: https://x.com/rao2z/status/1838248409146507353

Fast Downward System: https://arxiv.org/pdf/1109.6051

Is o1 No Longer a LLM? LeCun + New 'LRM' paper explained (+ exclusive interview clips)

Comments

When do you think true AGI will arrive?

Andrew Salinas

2024-09-30 02:50:35 +0000 UTC

I loved the video and now feel excited and a bit nervous where all of this new LRM domain is heading. I have one question that is still unclear to me. It seems that LRMs do not improve performance outside of a scientific setting, provable correct answer like in math, on which RL can be performed. I would have expected these abilities to increase as well given that the reasoning strategies where reinforced. But maybe the issue is actually how to evaluate such a setting. I am thinking of tasks like, plan my next vacation I like x,y,z and have 3000$ or so. Where there is not a right answer of final location but of how well the model derived its answer from the initial conditioning(likings of the user). Sorry for the wall of text

Pacert

2024-09-27 06:26:12 +0000 UTC

Good video - will it make its way onto Youtube?

Doug97

2024-09-26 15:26:05 +0000 UTC

Great video again! I have just watched your video on Situation Awareness. Would love to hear if your analysis of Leopold’s claims have changed with the release of O1?

Martin Fjeldbonde

2024-09-26 13:20:49 +0000 UTC

I am glad that the researchers pointed out the cost and time constraints with o1, but I think some leniency is also due. New technologies and frontiers were never meant to be efficient right away. So who knows what this will look like once the chips have been improved, the micro-nuclear plants built, and the models scaled up. This decade will not be forgotten.

Jonathan Kirk

2024-09-25 22:25:19 +0000 UTC

I was talking with Beth Rudden from Bast AI on hybrid symbolic/LLM systems for critical work. Would be worth talking/interviewing her.

Billy N

2024-09-25 20:48:42 +0000 UTC

It's worthwhile to review Andrej Karpathy's notion, sketched 10 months ago in a YouTube talk (starting at 42:15), of an "LLM OS" with an LLM "kernel process" making use of tools: https://youtu.be/zjkBMFhNj_g?si=kzFx73VBJ-aXPdvy&t=2535

Tom English

2024-09-25 19:19:09 +0000 UTC

I think I will use O1 models more often (as part of an agent's workflow) when APIs become easier to acces.

Michal Babula

2024-09-25 11:41:07 +0000 UTC

Don't crack out 2001 unless it's worth it!

Philip

2024-09-25 09:06:21 +0000 UTC

Loving the background music to this video :)

Michael Cho

2024-09-25 06:49:00 +0000 UTC

You know it's serious when Phillip highlights words like "quantum improvement" in the first 5 minutes of his video, and doesn't specify any caveats until much later!

Erik

2024-09-25 00:12:59 +0000 UTC

Thanks for another fantastic exclusive to your old and new Patreons.

Lee FRASER

2024-09-24 20:18:14 +0000 UTC

Now we (likely) know what Ilya (and the board?) saw.

Gabor Melli

2024-09-24 20:07:08 +0000 UTC

I had a wish come through here: perspectives on planing :)

Nikolai NM

2024-09-24 19:34:13 +0000 UTC

Next-gen LRM + competent tool use is going to be quite powerful.

Alexis Olson

2024-09-24 17:25:22 +0000 UTC

Yeah, which boils down to the richness of the training data

Philip

2024-09-24 16:14:27 +0000 UTC

I agree totally

Philip

2024-09-24 16:14:14 +0000 UTC

I did, yeah, at like -30 decibels to normal vol.

Philip

2024-09-24 16:13:55 +0000 UTC

Sneaky Philip

Christian Hendriksen

2024-09-24 15:44:27 +0000 UTC

If reasoning can be scaled using inference cost, does the bottleneck become the world-model?

Barnaby Golden

2024-09-24 15:19:43 +0000 UTC

I have the impression that most people don't realize what the new capabilities of o1 are. Thank you for pointing them out with very good examples. I wonder what the limit of LRM will be.

SteveHaupt

2024-09-24 14:48:37 +0000 UTC

I loved this humor. A video about planning, yet Phil’s videos are unplanned :-)

André Thieme

2024-09-24 14:43:26 +0000 UTC

Was I just hearing things or did you actually start playing the 2001 song on a very low volume? Wow.

Norfuer

2024-09-24 14:26:10 +0000 UTC

More Creators

Sena＠ASMR

Sena＠ASMR

fantia

mina ₍ᐢ. .ᐢ₎ ₊˚⊹♡

mina ₍ᐢ. .ᐢ₎ ₊˚⊹♡

patreon

krisstian

krisstian

patreon

Italy Unnie

Italy Unnie

patreon

Jordi Bruin

Jordi Bruin

gumroad

SarahBlackwellFiction

SarahBlackwellFiction

patreon

JosephAnderson

JosephAnderson

patreon

donaora889

donaora889

fanbox

teikyu

teikyu

patreon

Aaron Shirk

Aaron Shirk

patreon

Kurigami

Kurigami

fanbox

ackers

ackers

patreon

FruitsParadise

FruitsParadise

patreon

Cracked Ivory

Cracked Ivory

patreon

NCThomas

NCThomas

patreon

azo

azo

fanbox

snugglepuff

snugglepuff

fanbox

shaunkeaveny

shaunkeaveny

patreon

fleetwoodmutt

fleetwoodmutt

patreon

Deanvspanties

Deanvspanties

patreon

Zyphroxyl

Zyphroxyl

patreon

Lord_Snow

Lord_Snow

patreon

sugarcubedstudios

sugarcubedstudios

patreon

フランク-兄さん

フランク-兄さん

patreon

chococae

chococae

patreon

DaxMapsOfficial

DaxMapsOfficial

patreon

rhodeislandred

rhodeislandred

patreon

rustyfawkes

rustyfawkes

patreon

scabslut

scabslut

patreon

fffff

fffff

patreon

super1

super1

fanbox

highwaywarrior

highwaywarrior

patreon

xiai

xiai

fantia

kuzumochi

kuzumochi

fanbox

Dookie

Dookie

patreon

MONO-CHAN

MONO-CHAN

patreon

Invisible Cactus Games

Invisible Cactus Games

patreon

ArtistFigureReference

ArtistFigureReference

patreon

Mans.JS

Mans.JS

gumroad

彩音〜xi-on〜

彩音〜xi-on〜

fanbox