A new paper from the last few days has dropped, and it's a good one. LLMs can now be said to plan, and I have all the analysis as well as exclusive clips from my interview with the lead author. And I don't believe any one else has reported that this breakthrough performance from o1 now exceeds average human performance for this core task.
Link for Off-line Viewing and Download: https://drive.google.com/file/d/1j6pRKnVcEywTodONO3L2zrerIvIW6l8g/view?usp=sharing
LeCun Tweet: https://x.com/ylecun/status/1832860107925024789
LRM Paper: https://arxiv.org/pdf/2409.13373
Original Paper: https://proceedings.neurips.cc/paper_files/paper/2023/file/efb2072a358cefb75886a315a6fcf880-Paper-Conference.pdf
o1 Calculations: https://x.com/yuntiandeng/status/1836114401213989366
Rao Analysis: https://x.com/rao2z/status/1838248409146507353
Fast Downward System: https://arxiv.org/pdf/1109.6051
Andrew Salinas
2024-09-30 02:50:35 +0000 UTCPacert
2024-09-27 06:26:12 +0000 UTCDoug97
2024-09-26 15:26:05 +0000 UTCMartin Fjeldbonde
2024-09-26 13:20:49 +0000 UTCJonathan Kirk
2024-09-25 22:25:19 +0000 UTCBilly N
2024-09-25 20:48:42 +0000 UTCTom English
2024-09-25 19:19:09 +0000 UTCMichal Babula
2024-09-25 11:41:07 +0000 UTCPhilip
2024-09-25 09:06:21 +0000 UTCMichael Cho
2024-09-25 06:49:00 +0000 UTCErik
2024-09-25 00:12:59 +0000 UTCLee FRASER
2024-09-24 20:18:14 +0000 UTCGabor Melli
2024-09-24 20:07:08 +0000 UTCNikolai NM
2024-09-24 19:34:13 +0000 UTCAlexis Olson
2024-09-24 17:25:22 +0000 UTCPhilip
2024-09-24 16:14:27 +0000 UTCPhilip
2024-09-24 16:14:14 +0000 UTCPhilip
2024-09-24 16:13:55 +0000 UTCChristian Hendriksen
2024-09-24 15:44:27 +0000 UTCBarnaby Golden
2024-09-24 15:19:43 +0000 UTCSteveHaupt
2024-09-24 14:48:37 +0000 UTCAndré Thieme
2024-09-24 14:43:26 +0000 UTCNorfuer
2024-09-24 14:26:10 +0000 UTC