Upcoming Webinar
Understand How Your Peers Approach AD Forest Recovery
Join this webinar and not only learn about what your peers are doing, but also learn about a new patent-pending modern approach to AD forest recovery from Cayosoft.
New episode!
Unlocking the Secrets of App Security!
In this episode of UnplugIT, Stephen Rose talks to IT expert and technology engineer Sean Hurley about how to secure apps in the cloud.
Essential Eight + Microsoft 365 Backup Compliance
This download allows you to demonstrate exactly how the technology you use keeps customers covered and compliant.
UK Ransomware Guidelines + M365 Backup Compliance
This download allows you to demonstrate exactly how the technology you use keeps customers covered and compliant.

Microsoft Releases Internal Security Tool ‘PyRIT’ to Protect Generative AI Systems

Key Takeaways:

Microsoft’s PyRIT (Python Risk Identification Toolkit) is a newly released tool to help security teams mitigate security risks within generative AI systems.

The toolkit offering a practical solution to automate routine tasks and enhance risk detection and mitigation processes.

PyRIT is not intended to replace manual red teaming and it complements existing expertise by providing a streamlined approach to risk assessment.

Last week, Microsoft introduced its Python Risk Identification Toolkit for generative AI (PyRIT) The new tool provides security teams and machine learning engineers with tools to identify and mitigate risks within AI systems.

In 2022, Microsoft introduced PyRIT to help its AI red team identify security risks within its generation AI systems, such as Copilot. A red team is a group of skilled professionals responsible for simulating cyberattacks on a corporate network or infrastructure. The primary goal is to detect security vulnerabilities and improve security measures.

Microsoft emphasized that the red-teaming process for these systems differs from classical AI or traditional software. This is because Microsoft must account for both responsible AI risks and typical security risks. Consequently, analyzing these various risks can be a slow, tedious, and time-consuming process.

How does the PyRIT toolkit work?

The PyRIT toolkit is designed to help security teams automate time-consuming routine tasks, such as the creation of malicious prompts in bulk. It comprises five primary components: targets, datasets, scoring engine, attack strategies, and memory.

Additionally, the PyRIT toolkit offers two distinct attack styles. In the single-turn strategy, PyRIT sends a combination of harmful prompts to a target generative AI system prior to scoring the response. The second strategy, dubbed “multiturn,” involves sending a set of malicious prompts and then scoring the response. PyRIT sends a new prompt to the AI system depending on the previous score.

Microsoft Releases Internal Security Tool 'PyRIT' to Protect Generative AI Systems

Microsoft says that the PyRIT toolkit does not serve as a substitute for manual red teaming of generative AI systems. Instead, it is intended to complement the expertise and skills of the existing red team.

“For instance, in one of our red teaming exercises on a Copilot system, we were able to pick a harm category, generate several thousand malicious prompts, and use PyRIT’s scoring engine to evaluate the output from the Copilot system all in the matter of hours instead of weeks,” said Microsoft explained.

The PyRIT toolkit is available to download for customers on GitHub, with Microsoft offering a range of demos to facilitate user familiarization. The company also plans to host a webinar on Mar 05 2024, to guide participants about how to use PyRIT for red teaming generative AI systems. If you’re interested, you can register for the upcoming webinar on Microsoft’s website.

by Rabia Noureen
Feb 27, 2024

Rabia comes from a solid IT background and has been writing professionally about Microsoft products and other technology for four years. Rabia has also written for OnMSFT.com as well as Windows Report. She is always up to date on the latest trends in...

Streamlining SaaS Governance: How Nudge Security Simplifies Compliance and Security Management for Cloud Apps

Oct 30, 2023
Nov 03, 2023
The Ultimate Guide to Web Application Firewalls (WAF)

Jun 20, 2023
The Dirty Truth About IT Offboarding Automation

Jul 21, 2023
Aug 25, 2023

Take our survey

PETRI NEWSLETTERS

Join Petri Insider

Create a free account today to participate in forum conversations, comment on posts and more.