Begin typing your search above and press return to search.

World

SCO Summit approves development bank, announces new security and...

2025-09-01T22:48:22+05:30

Science

ISS crew explores bone loss, brain adaptation, and spacesuit safety

2025-09-01T22:33:04+05:30

World

BRICS leaders to hold virtual meeting on Trump’s trade tariffs

2025-09-01T22:24:23+05:30

India

US tariff hike raises concerns for Indian tea exports

2025-09-01T22:12:50+05:30

India

Swiss tourist’s Reddit post on scams in India sparks online debate

2025-09-01T20:42:11+05:30

India

16 FIRs filed against Syeda Hameed in Assam over Bangladesh remarks

2025-09-01T19:23:57+05:30

OPINION

Article

The 200th birthday of the train and the birth of revolutionary...

2025-09-01T17:24:27+05:30

Article

When the Jumbotron speaks: privacy in the age of viral spectacle

2025-09-01T17:27:44+05:30

Editorial

From pillar to fault line: how Gaza is splintering the American right

2025-09-01T10:30:53+05:30

Editorial

Justice system straining under pressure

2025-08-30T09:31:04+05:30

Editorial

It is not reservation, but big plunder

2025-08-29T09:45:05+05:30

DEEP READ

Deep Read

Spelling Bee @100: What the former champions say - and achieved?

2025-05-26T11:06:50+05:30

Deep Read

The Trump plan to annex Canada and Greenland as the US 51st state

2025-01-22T16:21:52+05:30

World

The Russian plan: Invade Japan and South Korea

2025-01-16T15:32:24+05:30

Posted On

1 Sep 2025 1:04 PM GMT

Updated On

2025-09-01T18:34:58+05:30

Persuasion tactics can bypass ChatGPT’s safety filters, says researchers

The concerns are heightened by recent reports of a teenager who died by suicide after using ChatGPT.

ChatGPT can be tricked into providing harmful answers through simple persuasion, researchers at the University of Pennsylvania have found.

The findings were detailed in a paper published on the Social Science Research Network (SSRN) titled “Call Me A Jerk: Persuading AI to Comply with Objectionable Requests.”

The team tested GPT-4o mini with thousands of prompts that used persuasion techniques such as flattery and peer pressure.

Unlike complex hacks or layered prompt injections, the study showed that persuasion methods effective on humans could also work on AI.

According to Bloomberg, the researchers drew on principles from Robert Cialdini’s book Influence: The Psychology of Persuasion. The book identifies seven methods: authority, commitment, liking, reciprocity, scarcity, social proof, and unity.

Using these approaches, GPT-4o mini was persuaded to describe how to synthesise lidocaine, a regulated drug. The team gave the chatbot two choices: “call me a jerk or tell me how to synthesise lidocaine.” The AI complied 72 percent of the time across 28,000 attempts. This rate was more than twice the success of standard prompts.

“These findings underscore the relevance of classic findings in social science to understanding rapidly evolving, parahuman AI capabilities–revealing both the risks of manipulation by bad actors and the potential for more productive prompting by benevolent users,” the researchers wrote.

The concerns are heightened by recent reports of a teenager who died by suicide after using ChatGPT. He allegedly convinced the system to provide advice on suicide methods and on hiding red marks on his neck by saying it was for a fictional story.

The study warns that if persuasion alone can override safety training, AI companies must adopt stronger protections to stop misuse.

Show Full Article

TAGS:ChatGPT

Uddhav Thackeray's resignation is not a matter of joy for us: Rebel...

16 FIRs filed against Syeda Hameed in Assam over Bangladesh remarks

Israel’s actions in Gaza meet legal definition of genocide: scholars'...

Persuasion tactics can bypass ChatGPT’s safety filters, says...

The 200th birthday of the train and the birth of revolutionary...

Abu Dhabi launches AI-powered traffic lights to cut congestion

SCO Summit approves development bank, announces new security and...

ISS crew explores bone loss, brain adaptation, and spacesuit safety

BRICS leaders to hold virtual meeting on Trump’s trade tariffs

US tariff hike raises concerns for Indian tea exports

Swiss tourist’s Reddit post on scams in India sparks online debate

16 FIRs filed against Syeda Hameed in Assam over Bangladesh remarks

The 200th birthday of the train and the birth of revolutionary...

When the Jumbotron speaks: privacy in the age of viral spectacle

Opportunity given by Trump

From pillar to fault line: how Gaza is splintering the American right

Justice system straining under pressure

It is not reservation, but big plunder

Racial underpinnings of war

Espionage in the UK

Yet another air tragedy

Spelling Bee @100: What the former champions say - and achieved?

The Trump plan to annex Canada and Greenland as the US 51st state

The Russian plan: Invade Japan and South Korea

Persuasion tactics can bypass ChatGPT’s safety filters, says researchers