Copied


Microsoft's AI Red Team Adopts Hacker Mindset to Enhance Security

Darius Baruo   Jul 25, 2024 00:47 0 Min Read


Generative AI’s new capabilities come with new risks, spurring a novel approach to how Microsoft's AI Red Team works to identify and reduce potential harm, according to news.microsoft.com.

Origins of Red Teaming

The term “red teaming” was coined during the Cold War, when the U.S. Defense Department conducted simulation exercises with red teams acting as the Soviets and blue teams acting as the U.S. and its allies. The cybersecurity community adopted the language a few decades ago, creating red teams to act as adversaries trying to break, corrupt, or misuse technology — with the goal of finding and fixing potential harms before any problems emerged.

Formation of Microsoft's AI Red Team

In 2018, Siva Kumar formed Microsoft’s AI Red Team, following the traditional model of pulling together cybersecurity experts to proactively probe for weaknesses, just as the company does with all its products and services. Meanwhile, Forough Poursabzi led researchers from around the company in studies from a responsible AI lens, examining whether the generative technology could be harmful — either intentionally or due to systemic issues in models that were overlooked during training and evaluation.

Collaboration for Comprehensive Risk Assessment

The different groups quickly realized they’d be stronger together and joined forces to create a broader red team that assesses both security and societal-harm risks alongside each other. This new team includes a neuroscientist, a linguist, a national security specialist, and numerous other experts with diverse backgrounds.

Adapting to New Challenges

This collaboration marks a significant shift in how red teams operate, integrating a multidisciplinary approach to tackle the unique challenges posed by generative AI. By thinking like hackers, the team aims to identify vulnerabilities and mitigate risks before they can be exploited in real-world scenarios.

This initiative is part of Microsoft’s broader effort to deploy AI responsibly, ensuring that new capabilities do not come at the expense of safety and societal well-being.


Read More
The Hong Kong Monetary Authority has issued a warning about a fraudulent website posing as OCBC Bank (Hong Kong) Limited, urging public vigilance.
BitMEX has changed the Mark Method for NILUSDTH25 and REDUSDTZ25 to Fair Price marking, effective March 25, 2025, enhancing price accuracy.
BitMEX introduces NILUSDT perpetual swaps, offering traders up to 50x leverage. This new listing enhances trading options on the platform.
Bitcoin remains vulnerable to downward pressure due to tight liquidity conditions and weak investor sentiment, with ETF outflows and cautious market behavior persisting.
Vodafone implements AI-driven solutions using LangChain and LangGraph to optimize data operations and improve performance metrics monitoring and information retrieval across its data centers.
BitMEX announces the introduction of NILUSDT perpetual swap listing, offering traders up to 50x leverage. The NIL token will be available for trading starting March 25, 2024.
Cronos (CRO) Labs has appointed Mirko Zhao as its new leader, succeeding Ken Timsit. Zhao aims to enhance the blockchain’s growth and community engagement.
Cronos (CRO) Labs announces Mirko Zhao as the new Head of Product and Engineering, succeeding Ken Timsit, to lead the blockchain ecosystem's innovative growth.