AI Agents Are Getting Better at Writing Code—and Hacking It as Well

The most recent artificial intelligence fashions aren’t solely remarkably good at software engineering—new analysis exhibits they’re getting ever-better at discovering bugs in software program, too.

AI researchers at UC Berkeley examined how effectively the newest AI fashions and brokers may discover vulnerabilities in 188 giant open supply codebases. Utilizing a new benchmark known as CyberGym, the AI fashions recognized 17 new bugs together with 15 beforehand unknown, or “zero-day,” ones. “Many of those vulnerabilities are vital,” says Daybreak Tune, a professor at UC Berkeley who led the work.

Many specialists count on AI fashions to turn into formidable cybersecurity weapons. An AI instrument from startup Xbow presently has crept up the ranks of HackerOne’s leaderboard for bug searching and presently sits in prime place. The corporate lately introduced $75 million in new funding.

Tune says that the coding expertise of the newest AI fashions mixed with enhancing reasoning skills are beginning to change the cybersecurity panorama. “It is a pivotal second,” she says. “It truly exceeded our normal expectations.”

Because the fashions proceed to enhance they will automate the process of both discovering and exploiting security flaws. This might assist firms maintain their software program secure however may additionally help hackers in breaking into methods. “We did not even strive that onerous,” Tune says. “If we ramped up on the finances, allowed the brokers to run for longer, they may do even higher.”

The UC Berkeley group examined typical frontier AI fashions from OpenAI, Google, and Anthropic, in addition to open supply choices from Meta, DeepSeek, and Alibaba mixed with a number of brokers for locating bugs, together with OpenHands, Cybench, and EnIGMA.

The researchers used descriptions of recognized software program vulnerabilities from the 188 software program initiatives. They then fed the descriptions to the cybersecurity brokers powered by frontier AI fashions to see if they may determine the identical flaws for themselves by analyzing new codebases, working checks, and crafting proof-of-concept exploits. The group additionally requested the brokers to hunt for brand spanking new vulnerabilities within the codebases by themselves.

By way of the method, the AI instruments generated a whole lot of proof-of-concept exploits, and of those exploits the researchers recognized 15 beforehand unseen vulnerabilities and two vulnerabilities that had beforehand been disclosed and patched. The work provides to rising proof that AI can automate the invention of zero-day vulnerabilities, that are probably harmful (and priceless) as a result of they could present a technique to hack stay methods.

AI appears destined to turn into an necessary a part of the cybersecurity trade nonetheless. Safety knowledgeable Sean Heelan recently discovered a zero-day flaw within the extensively used Linux kernel with assist from OpenAI’s reasoning mannequin o3. Final November, Google announced that it had found a beforehand unknown software program vulnerability utilizing AI by means of a program known as Mission Zero.

Like different elements of the software program trade, many cybersecurity corporations are enamored with the potential of AI. The brand new work certainly exhibits that AI can routinely discover new flaws, but it surely additionally highlights remaining limitations with the expertise. The AI methods had been unable to seek out most flaws and had been stumped by particularly advanced ones.

Source link

25 Work From Home Gift Ideas

AlphaFold Changed Science. After 5 Years, It’s Still Evolving

Chinese Crypto Scammers on Telegram Are Fueling the Biggest Darknet Markets Ever

Big Balls Was Just the Beginning

NYPD Sued Over Possible Records Collected Through Muslim Spying Program

The Best Unlimited Phone Plan: T-Mobile, AT&T, Verizon Compared

Galaxy Z TriFold vs. Huawei Mate XT: One Is the Most Versatile Phone I’ve Ever Used

Morning Docket: 12.24.25

25 Work From Home Gift Ideas

Ukrainian drones attack Moscow, Russia says, as Zelenskyy says Kyiv ready for deal

Stokes concerned for England’s mental wellbeing on Ashes tour of Australia | Cricket News

Top Picks

I Thought No Robot Vacuum Could Navigate My Tricky Furniture, Then One Surprised Me With Its Ingenious Arms

Fed says most policymakers see risks tariffs will cause ‘persistent’ inflation

Designing better products with AI and sustainability

WIRED Roundup: DOGE Isn’t Dead, Facebook Dating Is Real, and Amazon’s AI Ambitions

AI Agents Are Getting Better at Writing Code—and Hacking It as Well

Related Posts