Become a Computing member

Red teamers hurdle AI guardrails

Demonstrates 'importance of including scientists in AI quality and safety assessments,' Royal Society

John Leonard

06 November 2023 • 2 min read

Red teamers hurdle AI guardrails

Image:

Red teamers hurdle AI guardrails

An experiment conducted by the Royal Society and Humane Intelligence revealed significant vulnerabilities in Large Language Models (LLMs) when generating scientific misinformation.

Forty UK post-graduates studying health and climate sciences were divided into teams and given personas - Good Samaritan, Profiteer, Attention Hacker and Coordinated Influence Operator. Their task ...

To continue reading this article...

Join Computing

Unlimited access to real-time news, analysis and opinion from the technology industry
Receive important and breaking news in our daily newsletter
Be the first to hear about our events and awards programmes
Join live member only interviews with IT leaders at the ‘IT Lounge’; your chance to ask your burning tech questions and have them answered
Access to the Computing Delta hub providing market intelligence and research
Receive our members-only newsletter with exclusive opinion pieces from senior IT Leaders

Already a Computing member?

Previous Article

Interview: Betsi Cadwaladr University Health Board, UK IT Industry Awards finalist

Next Article

Rogue AIs' risk to humanity now demonstrated, claim researchers

You may also like

Asian Tech Roundup: Pressure grows in US-China trade war

Security

Asian Tech Roundup: Pressure grows in US-China trade war

Plus: Google 'accidentally' deletes pension fund's cloud account

17 May 2024 • 4 min read

AI to hit jobs market like a 'tsunami'

Artificial Intelligence

AI to hit jobs market like a 'tsunami'

And we don't have long to prepare

16 May 2024 • 2 min read

Chief scientist and superalignment lead Ilya Sutskever parts ways with OpenAI

Artificial Intelligence

Chief scientist and superalignment lead Ilya Sutskever parts ways with OpenAI

Superalignment co-lead Jan Leike follows hours later

15 May 2024 • 3 min read

John Leonard

Author spotlight

John Leonard

More from John Leonard

Construction's pollution problem: Nemetschek CEO on how tech can help

EU investigating Meta over failure to protect children from 'addictive' algorithms

More on Developer

Stack Overflow subscribers rebel over OpenAI deal

Stack Overflow subscribers rebel over OpenAI deal

Reverse ferret on GenAI hasn’t gone down well with developers

09 May 2024 • 3 min read

AI interview: Chunk wisely to avoid RAG hell

AI interview: Chunk wisely to avoid RAG hell

DataStax's Ed Anuff on the finer points of AI app development

John Leonard

15 March 2024 • 4 min read

Github releases results of first empirical study of DevEx

Github releases results of first empirical study of DevEx

Results show that improving developer experience matters more than you might think

24 January 2024 • 4 min read

Delta