This article benchmarks the GPT-3.5 LLM on multi-hop reasoning datasets, finding that RECKONING's performance significantly surpasses both zero-shot and few-shot GPT-3.5 prompting.This article benchmarks the GPT-3.5 LLM on multi-hop reasoning datasets, finding that RECKONING's performance significantly surpasses both zero-shot and few-shot GPT-3.5 prompting.

The Strength of Dynamic Encoding: RECKONING Outperforms Zero-Shot GPT-3.5 in Distractor Robustness

2025/10/29 23:57

Abstract and 1. Introduction

  1. Background

  2. Method

  3. Experiments

    4.1 Multi-hop Reasoning Performance

    4.2 Reasoning with Distractors

    4.3 Generalization to Real-World knowledge

    4.4 Run-time Analysis

    4.5 Memorizing Knowledge

  4. Related Work

  5. Conclusion, Acknowledgements, and References

\ A. Dataset

B. In-context Reasoning with Distractors

C. Implementation Details

D. Adaptive Learning Rate

E. Experiments with Large Language Models

E Experiments with Large Language Models

\ Recently, Large Language Models (LLMs) with large parameter sizes learned from human preferences have shown remarkable performance in language understanding and generation. These LLMs are powerful zero-shot and few-shot reasoners. Recent works find that LLMs learn to perform multi-step reasoning by first generating new reasoning chains and then predicting the answers. In this experiment, we benchmark the performance of a popular new LLM, GPT-3.5, on the two multi-hop reasoning datasets we used in our paper. We first evaluate GPT-3.5’s zero-shot reasoning performance in predicting the correct answers. As Table 10 shows, zero-shot prompting GPT-3.5 significantly underperforms RECKONING’s performance. GPT-3.5’s performance improves on ProofWriter without distractors but still is behind the performance of RECKONING. When distractors are present in the context, RECKONING performs much better than zero-shot and few-shot GPT-3.5 prompting. This highlights RECKONING’s strength in disentangling irrelevant information from useful knowledge, an ability that even powerful LLMs like GPT-3.5 lack.

\

:::info Authors:

(1) Zeming Chen, EPFL (zeming.chen@epfl.ch);

(2) Gail Weiss, EPFL (antoine.bosselut@epfl.ch);

(3) Eric Mitchell, Stanford University (eric.mitchell@cs.stanford.edu)';

(4) Asli Celikyilmaz, Meta AI Research (aslic@meta.com);

(5) Antoine Bosselut, EPFL (antoine.bosselut@epfl.ch).

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\

Market Opportunity
ZeroLend Logo
ZeroLend Price(ZERO)
$0.000008411
$0.000008411$0.000008411
-1.64%
USD
ZeroLend (ZERO) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Liquidations Surge 108% to $665 Million as Bearish Sentiment Dominates

Liquidations Surge 108% to $665 Million as Bearish Sentiment Dominates

The cryptocurrency market experienced a brutal 24-hour period, with liquidations surging 108% to reach $665 million. The spike in forced position closures reflects the violent price action that has characterized recent trading sessions, catching leveraged traders on both sides of the market.
Share
MEXC NEWS2025/12/16 19:30
Tajikistan Imposes Harsh Penalties for Illegal Crypto Mining Linked to Power Theft

Tajikistan Imposes Harsh Penalties for Illegal Crypto Mining Linked to Power Theft

Tajikistan has enacted legislation criminalizing unauthorized cryptocurrency mining operations connected to electricity theft. Violators face fines reaching approximately $8,200 and prison terms of up to 8 years, signaling the government's serious stance against illicit mining activities draining the national power grid.
Share
MEXC NEWS2025/12/16 19:32
Stablecoins Are Booming — And The Fed Thinks They Could Cut Rates

Stablecoins Are Booming — And The Fed Thinks They Could Cut Rates

The post Stablecoins Are Booming — And The Fed Thinks They Could Cut Rates appeared on BitcoinEthereumNews.com. Stablecoins Are Booming — And The Fed Thinks They Could Cut Rates | Bitcoinist.com Sign Up for Our Newsletter! For updates and exclusive offers enter your email. Christian, a journalist and editor with leadership roles in Philippine and Canadian media, is fueled by his love for writing and cryptocurrency. Off-screen, he’s a cook and cinephile who’s constantly intrigued by the size of the universe. This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Center or Cookie Policy. I Agree Source: https://bitcoinist.com/stablecoins-are-booming-and-the-fed-thinks-they-could-cut-rates/
Share
BitcoinEthereumNews2025/11/11 05:05