Two recent papers (DeepMind + Anthropic tag-team) and a failed $10k bet have reminded people not to underestimate what models can learn from the data you give them in the prompt. Let me show you how this can be harnessed to get better results, even if you don’t have great demonstrations at hand. Then we’ll see the bet that lost a start-up founder $10k but hopefully won’t fool you. And end with Anthropic’s jailbreaking revelations and how even they think such jailbreaks not be stoppable – relevant for any user or business.
Downloadable Video File for Off-line Viewing: https://drive.google.com/file/d/1DVw-9c1h5BcFjJf_1EEMAo5GxEa1bx8l/view?usp=sharing
Many-shot In-Context Learning, Google DeepMind: https://arxiv.org/pdf/2404.11018.pdf
Many-shot Jailbreaking: https://cdn.sanity.io/files/4zrzovbb/website/af5633c94ed2beb282f6a53c595eb437e8e7b630.pdf
Anthropic Jailbreaking Post: https://www.anthropic.com/research/many-shot-jailbreaking
$10k Challenge: https://twitter.com/VictorTaelin/status/1776248021858111542
Winning Prompt: https://twitter.com/futuristfrog/status/1778109834509832462
https://gist.github.com/GameDevGitHub/ffd826a4296008ba10d0df893f6fde62
Pavol Vaskovic
2024-05-02 20:37:13 +0000 UTCYoussef Mohamed
2024-04-28 22:48:01 +0000 UTCYoussef Mohamed
2024-04-28 22:31:55 +0000 UTCYoussef Mohamed
2024-04-28 22:30:05 +0000 UTCRobert Gomez-Reino
2024-04-24 11:38:05 +0000 UTCRobert Gomez-Reino
2024-04-24 11:32:05 +0000 UTCShawn Fumo
2024-04-24 11:26:34 +0000 UTCPhilip
2024-04-24 10:56:28 +0000 UTCPhilip
2024-04-24 10:54:34 +0000 UTCPhilip
2024-04-24 10:53:40 +0000 UTCPhilip
2024-04-24 10:52:42 +0000 UTCPhilip
2024-04-24 10:50:38 +0000 UTCPhilip
2024-04-24 10:49:42 +0000 UTCPhilip
2024-04-24 10:48:20 +0000 UTCPhilip
2024-04-24 10:47:10 +0000 UTCTrenton Dambrowitz
2024-04-24 08:36:15 +0000 UTCRobert Gomez-Reino
2024-04-24 07:00:50 +0000 UTCJon Kurishita
2024-04-24 01:00:25 +0000 UTCSean Gallagher
2024-04-24 00:51:03 +0000 UTCCarlos Baraza
2024-04-23 22:14:34 +0000 UTCSean Betts
2024-04-23 20:36:31 +0000 UTCMachiel Reyneke
2024-04-23 16:49:08 +0000 UTCShawn Fumo
2024-04-23 16:31:02 +0000 UTCSteveHaupt
2024-04-23 16:14:33 +0000 UTC