AI “godfather” Yoshua Bengio has joined a UK project to prevent AI catastrophes

Yoshua Bengio, a Turing Award winner who is considered one of the “godfathers” of modern AI, is throwing his weight behind a project funded by the UK government to embed safety mechanisms into AI systems.

The project, called Safeguarded AI, aims to build an AI system that can check whether other AI systems deployed in critical areas are safe. Bengio is joining the program as scientific director and will provide critical input and scientific advice. The project, which will receive £59 million over the next four years, is being funded by the UK’s Advanced Research and Invention Agency (ARIA), which was launched in January last year to invest in potentially transformational scientific research.

Safeguarded AI’s goal is to build AI systems that can offer quantitative guarantees, such as a risk score, about their effect on the real world, says David “davidad” Dalrymple, the program director for Safeguarded AI at ARIA. The idea is to supplement human testing with mathematical analysis of new systems’ potential for harm.

The project aims to build AI safety mechanisms by combining scientific world models, which are essentially simulations of the world, with mathematical proofs. These proofs would include explanations of the AI’s work, and humans would be tasked with verifying whether the AI model’s safety checks are correct.

Bengio says he wants to help ensure that future AI systems cannot cause serious harm.

“We’re currently racing toward a fog behind which might be a precipice,” he says. “We don’t know how far the precipice is, or if there even is one, so it might be years, decades, and we don’t know how serious it could be … We need to build up the tools to clear that fog and make sure we don’t cross into a precipice if there is one.”

Science and technology companies don’t have a way to give mathematical guarantees that AI systems are going to behave as programmed, he adds. This unreliability, he says, could lead to catastrophic outcomes.

Dalrymple and Bengio argue that current techniques to mitigate the risk of advanced AI systems—such as red-teaming, where people probe AI systems for flaws—have serious limitations and can’t be relied on to ensure that critical systems don’t go off-piste.

Instead, they hope the program will provide new ways to secure AI systems that rely less on human efforts and more on mathematical certainty. The vision is to build a “gatekeeper” AI, which is tasked with understanding and reducing the safety risks of other AI agents. This gatekeeper would ensure that AI agents functioning in high-stakes sectors, such as transport or energy systems, operate as we want them to. The idea is to collaborate with companies early on to understand how AI safety mechanisms could be useful for different sectors, says Dalrymple.

The complexity of advanced systems means we have no choice but to use AI to safeguard AI, argues Bengio. “That’s the only way, because at some point these AIs are just too complicated. Even the ones that we have now, we can’t really break down their answers into human, understandable sequences of reasoning steps,” he says.

The next step—actually building models that can check other AI systems—is also where Safeguarded AI and ARIA hope to change the status quo of the AI industry.

ARIA is also offering funding to people or organizations in high-risk sectors such as transport, telecommunications, supply chains, and medical research to help them build applications that might benefit from AI safety mechanisms. ARIA is offering applicants a total of £5.4 million in the first year, and another £8.2 million in another year. The deadline for applications is October 2.

The agency is also casting a wide net for people who might be interested in building Safeguarded AI’s safety mechanism through a nonprofit organization. ARIA is eyeing up to £18 million to set this organization up and will be accepting funding applications early next year.

The program is looking for proposals to start a nonprofit with a diverse board that encompasses lots of different sectors in order to do this work in a reliable, trustworthy way, Dalrymple says. This is similar to what OpenAI was initially set up to do before changing its strategy to be more product- and profit-oriented.

The organization’s board will not just be responsible for holding the CEO accountable; it will even weigh in on decisions about whether to undertake certain research projects, and whether to release particular papers and APIs, he adds.

The Safeguarded AI project is part of the UK’s mission to position itself as a pioneer in AI safety. In November 2023, the country hosted the very first AI Safety Summit, which gathered world leaders and technologists to discuss how to build the technology in a safe way.

While the funding program has a preference for UK-based applicants, ARIA is looking for global talent that might be interested in coming to the UK, says Dalrymple. ARIA also has an intellectual-property mechanism for funding for-profit companies abroad, which allows royalties to return back to the country.

Bengio says he was drawn to the project to promote international collaboration on AI safety. He chairs the International Scientific Report on the safety of advanced AI, which involves 30 countries as well as the EU and UN. A vocal advocate for AI safety, he has been part of an influential lobby warning that superintelligent AI poses an existential risk.

“We need to bring the discussion of how we are going to address the risks of AI to a global, larger set of actors,” says Bengio. “This program is bringing us closer to this.”

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

FORTNITE Digital V-Bucks 1000 -PlayStation/Xbox/Nintendo Switch/PC/Mobile [Digital Code]

(1471)

$8.99 (as of September 18, 2024 18:31 GMT +00:00 - )

Amazon Fire TV Stick 4K Max streaming device, supports Wi-Fi 6E, free & live TV without cable or satellite

(32809)

$39.99 (as of September 18, 2024 18:28 GMT +00:00 - )

Ailun 3 Pack Screen Protector for iPhone 15 Pro Max [6.7 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film

(76898)

$6.99 (as of September 18, 2024 18:28 GMT +00:00 - )

TP-Link Deco AX3000 WiFi 6 Mesh System(Deco X55) - Covers up to 6500 Sq.Ft. , Replaces Wireless Router and Extender, 3 Gigabit ports per unit, supports Ethernet Backhaul (3-pack)

(12007)

$169.98 (as of September 18, 2024 18:31 GMT +00:00 - )

TAURI 5 in 1 for iPhone 16 Pro Max Case, Compatible with MagSafe [Not-Yellowing] with 2X Screen Protector + 2X Camera Lens Protector, Military-Grade Protection, Magnetic Case for 16 ProMax 6.9", Clear

(50732)

$15.99 (as of September 18, 2024 18:28 GMT +00:00 - )

Index Of News Author

Technology

Spotify confirms price hike as premium plan rises to $10.99/month in the US

Spotify has announced that it’s raising its premium plan prices, confirming reports that first started to surface over the weekend. The music-streaming giant, which claims some 500 million users globally, 40% of which are premium subscribers, revealed that its main premium ad-free subscription plan will be increasing from $9.99 per month to $10.99 in the

July 24, 2023

Technology

Universal Studios Will Require Proof of Vaccination and Masks Starting Next Week

California Governor Gavin Newsom attends a press conference for the official reopening of the state of California at Universal Studios Hollywood on June 15, 2021.Photo: Alberto E. Rodriguez (Getty Images)As the world slowly attempts to drag itself toward a sense of normalcy as the covid-19 pandemic continues, events and businesses looking to bring the public…

September 29, 2021

Technology

Europe teases breaking up Google over ad monopoly

The European Commission (EC) believes Google may be violating antitrust rules by favoring its own ad tools at the expense of competitors and is considering whether it will have to break up the company to end the alleged self-preferencing. "Google controls both sides of the adtech market: sell and buy," said Margrethe Vestager, European Commissioner

June 14, 2023

Technology

MacBook Air M1 available at the Source for $500 off

Grab one in-store for $799.99 Apr 16, 20242:32 PM EDT 1 comment Apple’s MacBook Air with an M1 chip 8-core CPU is available at the Source for $500 off, bringing its cost down to $799.99. Typically, these laptops are available for $1,299.99, so this is a pretty solid deal. Unfortunately, the laptop is out of

April 16, 2024

Technology

The new iPad Pro will support wireless charging and abandon the glass material on the back for durability considerations

虽然新款 iPad Pro 仍具备无线充电功能，但 9to5Mac 的 Filipe Espósito 今天报道说，由于担心易碎性，苹果可能在测试后最终取消玻璃背面的设计。报道援引熟悉苹果设计计划的消息人士称，苹果已经开发出 iPad Pro 原型，用玻璃制成更大的苹果标志，这仍将允许无线充电。其中一个原型据说采用了 MagSafe，其磁性比 iPhone 上的磁铁更强，以防止意外损坏，而且据说与 iPhone 上的 MagSafe 相比，它还支持更快的无线充电速度。鉴于苹果公司在内部制作了几款产品的原型，因此不能保证带有玻璃苹果标志的iPad Pro会被发布，但传言似乎确实同意下一款iPad Pro会有某种新设计，以实现无线充电。

January 14, 2022

Technology

Space Force chief: Timing of Chinese spaceplane launch “no coincidence”

File photo of the US military's X-37B test vehicle after a landing in 2012.Boeing China has launched its top-secret military spaceplane for a third time, days after the US military's winged spacecraft was grounded for several weeks due to problems with its SpaceX rocket. Observers believe China's spaceplane looks much like the US military's X-37B

December 14, 2023

Hand-Picked Top-Read Stories

F1 Azerbaijan GP LIVE: Race times and updates as Oscar Piastri takes lead with super move

Thanks for helping to reduce Tobago’s animal overpopulation

Madena Ashley Donald-Baird celebrates 6 years in bag-making business

Trending Tags

AI “godfather” Yoshua Bengio has joined a UK project to prevent AI catastrophes

FORTNITE Digital V-Bucks 1000 -PlayStation/Xbox/Nintendo Switch/PC/Mobile [Digital Code]

Amazon Fire TV Stick 4K Max streaming device, supports Wi-Fi 6E, free & live TV without cable or satellite

Ailun 3 Pack Screen Protector for iPhone 15 Pro Max [6.7 inch] + 3 Pack Camera Lens Protector with Installation Frame,Sensor Protection,Dynamic Island Compatible,Case Friendly Tempered Glass Film

TP-Link Deco AX3000 WiFi 6 Mesh System(Deco X55) - Covers up to 6500 Sq.Ft. , Replaces Wireless Router and Extender, 3 Gigabit ports per unit, supports Ethernet Backhaul (3-pack)

TAURI 5 in 1 for iPhone 16 Pro Max Case, Compatible with MagSafe [Not-Yellowing] with 2X Screen Protector + 2X Camera Lens Protector, Military-Grade Protection, Magnetic Case for 16 ProMax 6.9", Clear

SEC Charges Siblings Over $124 Million Crypto Scam

Al-Ahly Club and the Rowing Federation support the “Middle and Live” campaign

Up-and-Coming Las Vegas Rapper CAINE OTH is Making Everyone Engrossed to his Latest Album ‘The Sloth Mixtape’

Bleacher Report ranks Warriors’ Kuminga most overrated NBA player

Iran’s foreign minister holds talks with Syria’s Bashar al-Assad

F1 Azerbaijan GP LIVE: Race times and updates as Oscar Piastri takes lead with super move

Thanks for helping to reduce Tobago’s animal overpopulation

Madena Ashley Donald-Baird celebrates 6 years in bag-making business

The world was his theatre

Hard for Harris to beat Trump

AI “godfather” Yoshua Bengio has joined a UK project to prevent AI catastrophes

Related Posts