2 OpenAI researchers working on safety and governance have quit

Jonathan Raa/Getty Images

OpenAI researchers Daniel Kokotajlo and William Saunders recently left the company behind ChatGPT.
Kokotajlo said on a forum he doesn't think OpenAI will "behave responsibly around the time of AGI."
Kokotajlo was on the governance team, and Saunders worked on the Superalignment team at OpenAI.

Two OpenAI employees who worked on safety and governance recently resigned from the company behind ChatGPT.

Daniel Kokotajlo left last month and William Saunders departed OpenAI in February. The timing of their departures was confirmed by two people familiar with the situation. The people asked to remain anonymous in order to discuss the departures, but their identities are known to Business Insider.

Kokotajlo, who worked on the governance team, is listed as an adversarial tester of GPT-4, which was launched in March last year. Saunders had worked on the Alignment team, which became the Superalignment team, since 2021.

Kokotajlo wrote on his profile page on the online forum LessWrong that he quit "due to losing confidence that it would behave responsibly around the time of AGI."

In a separate post on the platform in April, he partially explained one of the reasons behind his decision to leave. He also weighed in on a discussion about pausing AGI development.

"I think most people pushing for a pause are trying to push against a 'selective pause' and for an actual pause that would apply to the big labs who are at the forefront of progress," Kokotajlo wrote.

He added: "The current overton window seems unfortunately centered around some combination of evals-and-mitigations that is at IMO high risk of regulatory capture (i.e. resulting in a selective pause that doesn't apply to the big corporations that most need to pause!) My disillusionment about this is part of why I left OpenAI."

Saunders said in a comment on his LessWrong profile page that he resigned that month after three years at the ChatGPT maker.

The Superalignment team, which was initially led by Ilya Sutskever and Jan Leike, is tasked with building safeguards to prevent artificial general intelligence (AGI) going rogue.

Sutskever and Leike have previously predicted that AGI could arrive within a decade. It's not clear if Sutskever is still at OpenAI following his participation in the brief ousting of Sam Altman as CEO last year.

Saunders was also a manager of the interpretability team, which researches how to make AGI safe and examines how and why models behave the way they do. He has co-authored several papers on AI models.

Kokotajlo and Saunders' resignations come amid other departures at OpenAI. Two executives, Diane Yoon and Chris Clark, quit last week, The Information reported. Yoon was the VP of people, and Clark was head of nonprofit and strategic initiatives.

OpenAI also parted ways with researchers Leopold Aschenbrenner and Pavel Izmailov, according to another report by The Information last month.

OpenAI, Kokotajlo, and Saunders did not respond to requests for comment from Business Insider.

Do you work for OpenAI? Got a tip? Contact this reporter at jmann@businessinsider.com for a nonwork device.

Read the original article on Business Insider

2 OpenAI researchers working on safety and governance have quit

In Other News

Relative of Princess Tatiana of Greece reported missing in California

Camila and Matthew McConaughey go pantless (again!) to play croquet in tequila campai…

Fans are on the hunt for Blue Bell Ice Cream's latest collab: 'I audibly gasped'

Jennifer Garner swears this super easy, 2-ingredient recipe 'will make your life bett…

I have 2 men in my life, so I wear a wedding ring and a commitment ring to honor both…

Quintuplets with different majors graduate from the same college

Iran's first vice president appointed president by supreme leader after helicopter cr…

Slovak PM Fico is 'improving' after assassination attempt, says hospital

Longtime Oakland Raiders center, Hall of Famer Jim Otto dies at 86

With Timberwolves' takedown of the champs, there's a new favorite in the NBA (for now…

Tourists flock to Tornado Alley, paying big bucks for the chance to see dangerous sto…

Hurricanes can kill from 1,000 miles away — and other terrifying dangers

Related articles

Related articles

Related articles