Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
MIT Researchers Boost LLM Planning with New Framework

MIT Researchers Boost LLM Planning with New Framework

by Tekmono Editorial Team
22/09/2025
in News
Share on FacebookShare on Twitter

Researchers from MIT CSAIL have developed PDDL-INSTRUCT, a framework designed to improve the multi-step planning capabilities of large language models (LLMs) by combining logical reasoning with an external plan validator.

The PDDL-INSTRUCT framework trains models to recognize and explain why a candidate plan has failed, including identifying unsatisfied preconditions, incorrect effects, frame violations, or an unmet goal. This is achieved through logical chain-of-thought prompts that guide the LLM to perform step-by-step inference over state and action transitions, producing traceable sequences of state→action→state, written as ⟨sᵢ, aᵢ₊₁, sᵢ₊₁⟩.

For external validation, PDDL-INSTRUCT integrates the VAL plan validator, which checks each step of the generated plan and provides feedback that is either binary (valid/invalid) or detailed. The detailed feedback results in superior performance. The system uses a two-stage optimization process: the first stage penalizes errors in the reasoning chains, and the second stage optimizes for final planning accuracy.

Related Reads

Microsoft enhances Copilot with multimodal features, introduces new $99 tier

Apple celebrates 50th anniversary amid scrutiny over privacy practices

Huawei launches Converged Development Engine for HarmonyOS PCs

Salesforce unveils updated Slack with 30 new AI features

The effectiveness of PDDL-INSTRUCT was evaluated using the PlanBench benchmark, which includes planning domains known to challenge LLMs, such as Blocksworld, Mystery Blocksworld, and Logistics. In the Blocksworld domain, a tuned Llama-3-8B model achieved a 94% rate of generating valid plans, significantly outperforming previous models. Notably, PDDL-INSTRUCT achieved up to a 64-fold improvement in the Mystery Blocksworld domain, where predicate names are obfuscated to prevent pattern matching.

Significant performance gains were also recorded in the Logistics domain. Across all test domains, the framework delivered up to a 66% absolute improvement compared to untuned baseline models. Researchers observed that performance improved with longer feedback budgets and more detailed output from the validator.

The current implementation of PDDL-INSTRUCT applies to classical PDDL domains and relies on the VAL validator as an external oracle. The results demonstrate a method for grounding LLM reasoning in formal semantics for use in agent systems that include a verifier during planning. Extending the framework to handle long-horizon, temporal, numeric, and cost-sensitive planning tasks remains an area for further work.

ShareTweet

You Might Be Interested

Microsoft enhances Copilot with multimodal features, introduces new  tier
News

Microsoft enhances Copilot with multimodal features, introduces new $99 tier

02/04/2026
News

Apple celebrates 50th anniversary amid scrutiny over privacy practices

02/04/2026
News

Huawei launches Converged Development Engine for HarmonyOS PCs

02/04/2026
Salesforce unveils updated Slack with 30 new AI features
News

Salesforce unveils updated Slack with 30 new AI features

02/04/2026
Please login to join discussion

Recent Posts

  • Microsoft enhances Copilot with multimodal features, introduces new $99 tier
  • Apple celebrates 50th anniversary amid scrutiny over privacy practices
  • Huawei launches Converged Development Engine for HarmonyOS PCs
  • Salesforce unveils updated Slack with 30 new AI features
  • Meta announces release of second generation smart glasses starting April 14

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals