I quiz Google Deepmind Principal Scientist Prof. Tim Rocktäschel on AGI to ASI timelines, promptbreeding, GDP 2x-ing, Gemini 2 and automating science.
https://geni.us/ArtificialIntelligence
https://en.wikipedia.org/wiki/Summa_Technologiae
Laura Ruis - PROCEDURAL KNOWLEDGE IN PRETRAINING DRIVES REASONING IN LARGE LANGUAGE MODELS - https://arxiv.org/pdf/2411.12580
MLE-Bench: https://arxiv.org/pdf/2410.07095
AIDE - https://www.weco.ai/blog/technical-report
Reversal Curse: https://arxiv.org/abs/2309.12288
Reasoning or Reciting? Counter-factuals: https://arxiv.org/abs/2307.02477
Rainbow Teaming - LLM-based adversarial prompt generation - https://arxiv.org/abs/2402.16822
Sakana AI: Towards Fully Automated Open-Ended Scientific Discovery https://arxiv.org/abs/2408.06292
ChemCrow: Augmenting large language models with chemistry tools https://arxiv.org/pdf/2304.05376
BALROG: BENCHMARKING AGENTIC LLM AND VLM REASONING ON GAMES - https://arxiv.org/pdf/2411.13543
Promptbreeder - Self-referential LLM prompt evolution - https://arxiv.org/abs/2309.16797
Chain-of-Thought Prompting - Improving LLM reasoning through prompting - https://arxiv.org/abs/2201.11903
Many-shot In-Context Learning: https://arxiv.org/abs/2404.11018
NetHack Learning Environment - RL research in procedurally generated game - https://arxiv.org/abs/2006.1376
GENIE - Generative interactive environment from unlabelled videos - https://arxiv.org/abs/2402.15391
Adrian Schmidt
2025-01-03 11:04:33 +0000 UTCSean Betts
2024-12-02 16:32:09 +0000 UTCArek Stryjski
2024-12-01 22:18:55 +0000 UTCAdin Softic
2024-12-01 16:25:03 +0000 UTC