Robert Miles AI Safety channel

AI Ruined My Year

AI Ruined My Year

2 สัปดาห์ที่ผ่านมา
45:59

Why Does AI Lie, and What Can We Do About It?

Why Does AI Lie, and What Can We Do About It?

1 ปีที่แล้ว
9:24

We Were Right! Real Inner Misalignment

We Were Right! Real Inner Misalignment

2 ปีที่แล้ว
11:47

Intro to AI Safety, Remastered

Intro to AI Safety, Remastered

2 ปีที่แล้ว
18:05

Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...

Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...

3 ปีที่แล้ว
10:20

The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment

The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment

3 ปีที่แล้ว
23:24

Quantilizers: AI That Doesn't Try Too Hard

Quantilizers: AI That Doesn't Try Too Hard

3 ปีที่แล้ว
9:54

Sharing the Benefits of AI: The Windfall Clause

Sharing the Benefits of AI: The Windfall Clause

3 ปีที่แล้ว
11:44

10 Reasons to Ignore AI Safety

10 Reasons to Ignore AI Safety

4 ปีที่แล้ว
16:29

9 Examples of Specification Gaming

9 Examples of Specification Gaming

4 ปีที่แล้ว
9:40

Training AI Without Writing A Reward Function, with Reward Modelling

Training AI Without Writing A Reward Function, with Reward Modelling

4 ปีที่แล้ว
17:52

AI That Doesn't Try Too Hard - Maximizers and Satisficers

AI That Doesn't Try Too Hard - Maximizers and Satisficers

4 ปีที่แล้ว
10:22

Is AI Safety a Pascal's Mugging?

Is AI Safety a Pascal's Mugging?

5 ปีที่แล้ว
13:41

A Response to Steven Pinker on AI

A Response to Steven Pinker on AI

5 ปีที่แล้ว
15:38

How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification

How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification

5 ปีที่แล้ว
11:32

Why Not Just: Think of AGI Like a Corporation?

Why Not Just: Think of AGI Like a Corporation?

5 ปีที่แล้ว
15:27

Safe Exploration: Concrete Problems in AI Safety Part 6

Safe Exploration: Concrete Problems in AI Safety Part 6

5 ปีที่แล้ว
13:46

Friend or Foe? AI Safety Gridworlds extra bit

Friend or Foe? AI Safety Gridworlds extra bit

5 ปีที่แล้ว
3:47

AI Safety Gridworlds

AI Safety Gridworlds

6 ปีที่แล้ว
7:23

Experts' Predictions about the Future of AI

Experts' Predictions about the Future of AI

6 ปีที่แล้ว
6:47

Why Would AI Want to do Bad Things? Instrumental Convergence

Why Would AI Want to do Bad Things? Instrumental Convergence

6 ปีที่แล้ว
10:36

Superintelligence Mod for Civilization V

Superintelligence Mod for Civilization V

6 ปีที่แล้ว
1:04:40

Intelligence and Stupidity: The Orthogonality Thesis

Intelligence and Stupidity: The Orthogonality Thesis

6 ปีที่แล้ว
13:03

Scalable Supervision: Concrete Problems in AI Safety Part 5

Scalable Supervision: Concrete Problems in AI Safety Part 5

6 ปีที่แล้ว
5:03

AI Safety at EAGlobal2017 Conference

AI Safety at EAGlobal2017 Conference

6 ปีที่แล้ว
5:30

AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1

AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1

6 ปีที่แล้ว
5:20

What can AGI do? I/O and Speed

What can AGI do? I/O and Speed

6 ปีที่แล้ว
10:41

What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4

What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4

6 ปีที่แล้ว
9:38

Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5

Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5

6 ปีที่แล้ว
7:32

The other

The other "Killer Robot Arms Race" Elon Musk should worry about

6 ปีที่แล้ว
5:51