robertskmiles posts from patreon

Early Access: 6 New AI Safety Talks

Lots of stuff in the pipeline, but to start with here's a series of 6 new talks from Evan Hubinger (author of Risks From Learned Optimization, the Mesa-Optimizers paper), going through a load of interesting AGI Safety Ideas.

https://www.youtube.com/watch?v=NmDRFwRczVQ&list=PLtlVeM84bZ6RLSR6oaQnbZ7FSwb-hkapx

Probably Patreon will just embed the first...

2023-05-10 15:12:29 +0000 UTC View Post

Computerphile Video: ChatGPT

I made a new video with Computerphile, about ChatGPT!

https://www.youtube.com/watch?v=viJt_DXTfwA

Language Models, Simulacra, Reinforcement Learning from Human Feedback, Scaling and Inverse Scaling, Sycophancy and Deception and Instrumental Convergence!

2023-02-01 16:44:05 +0000 UTC View Post

Video: Why AI Systems Lie, and What We Can Do About It

New main channel video! I'm still doing these!

This one's about how it's surprisingly hard to make AI systems that consistently tell the truth, and the safety problem that poses.

https://youtu.be/w65p_IIp6JY

I'm also doing a watch party for this video on the Discord at 9pm UK time (about 45 minutes from now). https://discord.gg/3hrvdsdZ 2022-12-05 20:19:02 +0000 UTC View Post

Podcast Guest Appearance: The Inside View

Michael Trazzi interviewed me for his podcast The Inside View! We talked about what you'd expect I suppose, YouTube, AI, Stampy, all that good stuff. He also gave me a t-shirt?

https://youtu.be/DyZye1GZtfk

2022-08-19 22:25:12 +0000 UTC View Post

Second Channel Video: Why Don't I Give Specifics?

You might notice I don't talk about specific AI takeover scenarios, with hacking, bribery, nanotechnology etc. Why is that?

https://youtu.be/JVIqp_lIwZg

2022-08-15 00:57:12 +0000 UTC View Post

Behind The Scenes: Unboxing a Gimbal Camera

I got a new camera for filming low-friction videos, and took it to the zoo! It takes a while to get it set up, testing footage at 9:20

https://www.youtube.com/watch?v=P0iBe_ts_ds

2022-07-19 18:09:25 +0000 UTC View Post

Second Channel/Short Video: Would You Help an AI to Escape?

I'M NOT DEAD! I did get COVID though. But I'm ok now.

I'm still working on the next main channel video, which turned out to be much harder than I expected. In the meantime here is a short video about current events (which aren't even that current now that I've got the video made). This video is an experiment in shooting footage to publish in both vertical and horizontal formats at the same time on different platforms. So I tried to shoot in 4K 16:9 and then also crop down to 9:16, but w...

2022-07-07 15:18:13 +0000 UTC View Post

Video: Second Research Talk by John Wentworth

This is a second talk following up the one I posted here: https://www.patreon.com/posts/62888854

This is somehow even lower production value than the last one, since the camera was stuck in a weird mode where it kept trying to adjust the focus and exposure. But the content is just as interesting if not more so, so I'm posting it anyway!

2022-02-22 23:58:55 +0000 UTC View Post

Video: Research Talk by John Wentworth

I'm at a research retreat right now with a few independent alignment researchers! The other day John Wentworth presented a talk giving his take on the alignment problem, and we filmed it just to have a record. I took photos of the flipchart after and edited them in. So the result is long, has relatively low production values, and gets technical in places, but it covers some really fascinating material so I thought at least some of you would be interested!

2022-02-22 06:06:55 +0000 UTC View Post

Early Access Short: Win $50k for Solving a Single AI Problem?

https://youtu.be/HYtJdflujjc
A quick short to let people know that The Alignment Research Center is offering prizes of up to $50k for proposals for their Eliciting Latent Knowledge problem!
It's a very interesting report, and I hope to get a full video made about it, but I couldn't get that done before the prize deadline.

The report: 2022-02-08 06:45:01 +0000 UTC View Post

Early Access 2nd Channel Video: There's No Rule That Says We'll Make It

I get frustrated sometimes. A long time ago I made this video rant about it, but the production quality was far too low to publish it so I only released it to patrons. I've now (largely thanks to urging from Nick and Nicole) gone back and re-made it properly, and this is that.

https://youtu.be/JD_iA7imAPs

2022-01-15 13:40:46 +0000 UTC View Post

Bootcamp to learn ML for Alignment

Some friends of mine are running a (free) bootcamp "to bring people interested in AI Alignment up-to-speed with the state of modern ML engineering", and they thought my Patreon would be a quality source of candidates. This is a great way to get started learning the skills to do actual AI Safety work!

The deadline to apply is *extremely* soon though, November 15th, so apply right now if you're going to!

2021-11-14 21:03:35 +0000 UTC View Post

Video: Experimental TikTok about GitHub Copilot

Who here is on TikTok? I'm experimenting with it a bit, here's an attempt at a video about misalignment and GitHub Copilot!

My impression so far is:
- 3 minutes is a little limiting, but not bad at all
- the on-device editing tools are no good
- but, the shorter format has lower expectations and might make it easier to make a lot more videos with less perfectionism

What do you think?

https://vm.tik...

2021-11-13 17:26:43 +0000 UTC View Post

Behind the Scenes: Unboxing a new audio recorder! Zoom F2-BT

I got a new audio recorder and microphone, which works unlike any other I've used!
Thank you for your support :)

https://www.youtube.com/watch?v=sMYNjttnY4w

2021-10-22 11:48:07 +0000 UTC View Post

Early Access Video: We Were Right! Real Inner Misalignment

Some people ran real versions of the thought experiments from the 'Mesa-Optimisers' videos! What they found won't shock you (if you've been paying attention)

https://youtu.be/zkbPdEHEyEI

Also UPCOMING EVENT TODAY, we'll have another Live Watch Party in one hour (at 10pm UK time, 2pm pacific, 5pm eastern). We'll watch the new video at the same time here:

2021-10-08 20:00:03 +0000 UTC View Post

New(ish) Video: Intro to AI Safety, Remastered

I spent the last 3 weeks or so in the USA, doing various things which have given me a lot of video fuel but no actual videos, so here's a remaster of an existing second channel video:

A while back I gave a talk at "AI and Politics" in London, which I think was a decent introduction to AI Safety. I published a quick version of it my second channel, which probably a lot of you have seen, being patrons, but most people didn't. So I've remastered that video, with better editing, better gra...

2021-06-22 15:36:58 +0000 UTC View Post

Early Access Video: Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...

The previous video explained why it's *possible* for trained models to end up with the wrong goals, even when we specify the goals perfectly. This video explains why it's *likely*.

https://youtu.be/IeWljQw3UgQ

Also UPCOMING EVENT TODAY, we'll have another Live Watch Party in one hour (at 10pm UK time, 2pm pacific, 5pm eastern). We'll watch the new video at the same time here: 2021-05-22 20:01:00 +0000 UTC View Post

I'm Looking to Hire a Video Editor!

If you're good at video editing, especially using open source software like Kdenlive and Blender, consider applying, to help me make more videos more quickly! :)

http://www.nonlinearfund.org/videoeditor.html

(Application deadline is April 2nd)

2021-03-27 18:45:57 +0000 UTC View Post

Early Access Video: The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment

This "Alignment" thing turns out to be even harder than we thought.
https://youtu.be/bJLcIBixGj8

Also UPCOMING EVENT TODAY, we will have an experimental Live Watch Party on the Discord in one hour (at 9pm UK time, 1pm pacific, 4pm eastern). We'll watch the new video at the same time here:
https://sync-tube.d...

2021-02-13 20:03:36 +0000 UTC View Post

Early Access Video: Quantilizers: AI That Doesn't Try Too Hard

How do you get an AI system that does better than a human would, without doing anything a human wouldn't?

A follow-up to "Maximizers and Satisficers": https://youtu.be/Ao4jwLwT36M

The Paper: https://intelligence.org/files/QuantilizersSaferAlternative.pdf

2020-12-05 15:48:51 +0000 UTC View Post

BTS Video: New Camera! Unboxing, Testing, and Modification

My camera died in the middle of shooting the new video! So I had to get a new one - this is that.

Thanks for making it possible for me to just buy new equipment when I need it!

https://youtu.be/JgmHftE4Bs0

2020-10-05 16:28:15 +0000 UTC View Post

Patreon Exclusive: Discord Server

I'm starting a Discord Server! It's been running for a little bit for patrons at the 'Advisor', 'Hero' and 'Personal thank you in a video' support levels, and now I'm opening it up a little wider, to include the 'Thank you in videos' tier.

I made a video explaining the idea of the Discord, please do watch it before joining! https://discord.gg/numraBv

2020-09-22 16:10:01 +0000 UTC View Post

Early Access Video: Sharing the Benefits of AI: The Windfall Clause

AI might create enormous amounts of wealth, but how is it going to be distributed?

https://youtu.be/7i_f4Kbpgn4

There's also a patreon-exclusive video of the full conversation I had with the author of this paper, Cullen O'Keefe. When it's done uploading, that will be here: https://youtu.be/f2HunLXhPEQ

2020-07-03 14:56:34 +0000 UTC View Post

Announcement: Recording for Computerphile about GPT-3

Just thought I should let patrons know that we've had a go at recording Computerphile remotely, talking about OpenAI's new GPT-3 model.

I wanted to include a photo but there's really nothing to show since it's just me at my regular desk with Sean on the phone :/

Anyway hopefully that will be published soon, I'll link it here when it's out!

2020-06-17 10:40:58 +0000 UTC View Post

Early Access Video: 10 Reasons to Ignore AI Safety

Why do some ignore AI Safety? Let's look at 10 reasons people give (adapted from Stuart Russell's list).

https://www.youtube.com/watch?v=9i1WlcCudpU

Enjoy!

2020-05-31 15:54:50 +0000 UTC View Post

Answering your AI Safety Questions (Part 2)

Answering the other half of the questions you asked: Moral philosophy, biological intelligence enhancement, nukes, regulating compute clusters, lethal autonomous weapons, etc!

https://www.youtube.com/watch?v=0mxFgJGjk0E

2020-05-07 19:12:16 +0000 UTC View Post

Early Access Video: 9 Examples of Specification Gaming

AI systems do what you say, and it's hard to say exactly what you mean.

Let's look at a list of real life examples of specification gaming!

https://youtu.be/nKJlF-olKmg

2020-04-25 20:27:15 +0000 UTC View Post

Patron Exclusive (obviously): Quarantine Haircut

I shot a video, so I had to cut my hair first. For some reason I filmed it, and now you can watch that. This is what we have become. This is The New Normal.

Seriously though this is just me cutting my hair, feel free to skip it if that sounds like it's not your preferred kind of weird.

This was in preparation for a proper video which is coming out extremely soon.

https://youtu.be/2DTexidHa6o 2020-04-25 20:25:26 +0000 UTC View Post

Answering your AI Safety Questions (1)

You asked, I answered :)

I ended up shooting a lot of footage for this, so this video is part 1 of 2.

Let me know if you want to see more Q+A videos in future!

https://youtu.be/Q6jwEiyUmi0

2020-03-10 12:19:16 +0000 UTC View Post

I'm at VidCon London

In case any of you are as well, look me up on the networking app

2020-02-21 14:53:10 +0000 UTC View Post