LLMCluster – Fuzzing crash triage using generative AI
Fuzzing is a common security and testing technique using mutated input against systems under test to identify crashes that could be exploited against the program. Fuzzing campaigns can run over long timespans and generate significant data. Crash triage, the analysis of fuzzing crash data, is a human-intensive endeavor following the fuzzing campaign. Triage requires the analyst to deduplicate or bucket similar crashes and to explore the impact that a given bucket of crashes has on the system under test. In our work, we explore the usage of large language models hosted on NARWHAL and NAUTILUS to simplify the bucketing of crashes in API Fuzzers. We present LLMCluster, a novel framework for crash bucketing and compare it to previous bucketing algorithms.
IMPACT
Accomplishment: Led a team of 3 engineers to develop a framework to analyze crash data using LLM. Result: Provided insights into further usage of LLMs in the software engineering/software security lifecycle. Applied research necessary to integrate further algorithms into fuzzing capabilities.
PRESENTER
Pack, Derik
derik.l.pack.civ@us.navy.mil
843-218-5015Naval Information Warfare Center Atlantic
CO-AUTHOR(S)
Leclerc, Anthony
anthony.p.leclerc.civ@us.navy.milCATEGORY
GPU usage for HPC
SECONDARY CATEGORY
AI/ML for HPC
SYSTEM(S) USED
NARWHAL NAUTILUS