CrowdStrike and Meta have launched CyberSOCEval, an open-source benchmark suite to evaluate AI models in security operations centers, helping businesses choose the right AI-powered cybersecurity tools for their needs.
The cybersecurity landscape is transforming with artificial intelligence as both a threat and a defense mechanism. As AI enables cybercriminals with advanced tactics, organizations are integrating AI into their security frameworks to counter these dangers, sparking a digital arms race.
CyberSOCEval addresses a critical gap by providing standardized tests for large language models (LLMs) on essential cybersecurity tasks, including incident response, threat analysis comprehension, and malware testing. According to CrowdStrike, “Without clear benchmarks, it’s difficult to know which systems, use cases, and performance standards deliver a true AI advantage against real-world attacks.”
By formalizing evaluations for real-world applications, CyberSOCEval offers organizations a transparent view of each model’s strengths and weaknesses. For AI developers, the framework provides insights into enterprise usage patterns, potentially fostering more tailored and effective models and accelerating innovation.
The benefits of AI in cybersecurity are evident in practical deployments. A recent survey by Mastercard and the Financial Times’ Longitude revealed that financial services firms have saved millions by implementing AI-powered tools to combat AI-enabled fraud, highlighting the tangible return on investment.
Meta’s involvement underscores its commitment to open-source AI principles, allowing developers free access to model weights and source code. The partnership with CrowdStrike exemplifies Meta’s strategy to expand open-source resources in cybersecurity.
Vincent Gonguet, Director of Product for GenAI at Meta’s Superintelligence Labs division, stated, “With these benchmarks in place, and open for the security and AI community to further improve, we can more quickly work as an industry to unlock the potential of AI in protecting against advanced attacks, including AI-based threats.”
The launch comes at a pivotal time, as businesses face mounting pressure from AI-augmented cyber threats. CyberSOCEval’s open-source nature democratizes access, empowering smaller organizations to assess and adopt cutting-edge tools.
The benchmark suite is available for immediate download on GitHub, with comprehensive details and documentation accessible on the project’s dedicated website. Early adopters can begin testing LLMs right away, contributing feedback to refine the framework further.




