CodeQ
Autonomous Code Debugging Agent
Self-improving code debugging agent using Monte Carlo Tree Search to explore fix strategies, dual-temperature self-critique to rank them, and Direct Preference Optimization to learn from its own exploration data. Built on Qwen2.5-Coder-7B across two H100 nodes.
fix rate on DebugBench
MCTS mode after DPO Round 2