Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
MIT Researchers Boost LLM Planning with New Framework

MIT Researchers Boost LLM Planning with New Framework

by Tekmono Editorial Team
22/09/2025
in News
Share on FacebookShare on Twitter

Researchers from MIT CSAIL have developed PDDL-INSTRUCT, a framework designed to improve the multi-step planning capabilities of large language models (LLMs) by combining logical reasoning with an external plan validator.

The PDDL-INSTRUCT framework trains models to recognize and explain why a candidate plan has failed, including identifying unsatisfied preconditions, incorrect effects, frame violations, or an unmet goal. This is achieved through logical chain-of-thought prompts that guide the LLM to perform step-by-step inference over state and action transitions, producing traceable sequences of state→action→state, written as ⟨sᵢ, aᵢ₊₁, sᵢ₊₁⟩.

For external validation, PDDL-INSTRUCT integrates the VAL plan validator, which checks each step of the generated plan and provides feedback that is either binary (valid/invalid) or detailed. The detailed feedback results in superior performance. The system uses a two-stage optimization process: the first stage penalizes errors in the reasoning chains, and the second stage optimizes for final planning accuracy.

Related Reads

OpenAI Launches Customizable Skills for Codex Coding Agent

Amazon’s Alexa+ to Integrate with Four New Services

EA Investigated for AI-Generated Content in Battlefield 6

Apple to Start iPhone 18 Production in January

The effectiveness of PDDL-INSTRUCT was evaluated using the PlanBench benchmark, which includes planning domains known to challenge LLMs, such as Blocksworld, Mystery Blocksworld, and Logistics. In the Blocksworld domain, a tuned Llama-3-8B model achieved a 94% rate of generating valid plans, significantly outperforming previous models. Notably, PDDL-INSTRUCT achieved up to a 64-fold improvement in the Mystery Blocksworld domain, where predicate names are obfuscated to prevent pattern matching.

Significant performance gains were also recorded in the Logistics domain. Across all test domains, the framework delivered up to a 66% absolute improvement compared to untuned baseline models. Researchers observed that performance improved with longer feedback budgets and more detailed output from the validator.

The current implementation of PDDL-INSTRUCT applies to classical PDDL domains and relies on the VAL validator as an external oracle. The results demonstrate a method for grounding LLM reasoning in formal semantics for use in agent systems that include a verifier during planning. Extending the framework to handle long-horizon, temporal, numeric, and cost-sensitive planning tasks remains an area for further work.

ShareTweet

You Might Be Interested

OpenAI Launches Customizable Skills for Codex Coding Agent
News

OpenAI Launches Customizable Skills for Codex Coding Agent

24/12/2025
Amazon’s Alexa+ to Integrate with Four New Services
News

Amazon’s Alexa+ to Integrate with Four New Services

24/12/2025
EA Investigated for AI-Generated Content in Battlefield 6
News

EA Investigated for AI-Generated Content in Battlefield 6

24/12/2025
Apple to Start iPhone 18 Production in January
News

Apple to Start iPhone 18 Production in January

24/12/2025
Please login to join discussion

Recent Posts

  • OpenAI Launches Customizable Skills for Codex Coding Agent
  • Amazon’s Alexa+ to Integrate with Four New Services
  • EA Investigated for AI-Generated Content in Battlefield 6
  • Apple to Start iPhone 18 Production in January
  • Connect Your Phone to Wi-Fi Easily

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals