What's Happening?
Gray Swan, a Pittsburgh-based AI startup, employs an army of 15,000 hackers to pressure test AI models like Claude, GPT-5, and Gemini for safety vulnerabilities. Founded by Carnegie Mellon University professors Matt Fredrikson and Zico Kolter, the company
has become a key player in AI security, working with major labs such as OpenAI, Anthropic, and Google Deepmind. The startup uses a platform called Arena to conduct security challenges, where hackers attempt to exploit AI models to identify weaknesses. Gray Swan has raised $40 million in Series A funding, bringing its valuation to $200 million. The company uses data from these challenges to train its AI agent, Shade, which continuously tests systems for vulnerabilities.
Why It's Important?
As AI systems become more integrated into various sectors, ensuring their security is paramount. Gray Swan's approach to using human hackers to identify vulnerabilities before they can be exploited is crucial in preventing misuse of AI technologies. The startup's work is particularly significant given the potential for AI models to be used in harmful ways, such as planning illegal activities or leaking sensitive information. By collaborating with major AI labs, Gray Swan is helping to set industry standards for AI safety and security, which could influence future regulations and best practices in the field.
What's Next?
With its recent funding, Gray Swan plans to expand its customer base beyond frontier labs to include large enterprises that need to secure their AI products. The company aims to enhance its AI agent, Shade, and its monitoring software, Cygnal, to better detect and prevent harmful AI outputs. As AI technologies continue to evolve, Gray Swan's role in safeguarding these systems will likely grow, potentially leading to new partnerships and innovations in AI security.











