ExchangeDEX+

Buy Crypto Markets Spot Futures500X Earn Events

Gold Bar & BTC Giveaway2000g

This article provides a detailed description of two multi-hop logical reasoning datasets: ProofWriter and CLUTRR-SG.This article provides a detailed description of two multi-hop logical reasoning datasets: ProofWriter and CLUTRR-SG.

Evaluating Systematic Generalization: The Use of ProofWriter and CLUTRR-SG in LLM Reasoning Research

2025/10/29 02:19

Table of Links

Abstract and 1. Introduction

Background
Method
Experiments

4.1 Multi-hop Reasoning Performance

4.2 Reasoning with Distractors

4.3 Generalization to Real-World knowledge

4.4 Run-time Analysis

4.5 Memorizing Knowledge
Related Work
Conclusion, Acknowledgements, and References

\ A. Dataset

B. In-context Reasoning with Distractors

C. Implementation Details

D. Adaptive Learning Rate

E. Experiments with Large Language Models

A Dataset

ProofWriter The ProofWriter [73] dataset has 500k pairs of questions, answers, and proofs over natural-language rule bases. Each example in the dataset contains a set of facts, a set of rules, a hypothesis, and a label indicating whether the hypothesis is true, false, or unknown. The dataset comprise five datasets named D0, D1, D2, D3, D5, each with 100k examples. Each dataset’s questions require reasoning up to depths D (D = 0, 1, 2, 3, 5) to determine their answers. In our experiments, we only focus on the datasets that require more reasoning depths (D2, D3, D5). We show an example from the dataset in Table 7. In these datasets, a set of facts and rules are mapped to 18 questions, where the questions can be answered based on a subset of the facts and rules. Thus, some of the facts or rules can be irrelevant to some questions, and we call them distractors in Section 4.2. In the experiment for knowledge encoding with distractors, we encode all the facts in the model parameters and evaluate its ability to reproduce and reason over the correct facts. We show an example of distractor and relevant knowledge of a question in Table 9. For detailed statistics on the two datasets, please see Table 6.

\ CLUTRR-SG The CLUTRR-SG [28] is an evaluation dataset for inductive reasoning on family relations adapted from the [71] dataset for measuring systematic generalization. Each example in the dataset contains (i) a set of facts representing a family graph G = (V, E) where nodes (V ) are entities and edges (E) are the relationships. (ii) a question asking the relationship between two entities (v1, vn ∈ V ), and (iii) a target relationship e ∗ ∈ E as the answer for the question. The facts are expressed as a list of (vi , ej , vk) tuples. The two entities in the question are separated by more than one hop in the graph. There are 272 unique entities, 20 relationship types, and nearly 1.5M possible facts in the dataset. Following the authors, we define the difficulty of examples based on the number of family graph edges (i.e., the number of reasoning hops required to determine a relation), in which k edges (k-hop) correspond to k facts. We show an example from the dataset in Table 8.

:::info Authors:

(1) Zeming Chen, EPFL (zeming.chen@epfl.ch);

(2) Gail Weiss, EPFL (antoine.bosselut@epfl.ch);

(3) Eric Mitchell, Stanford University (eric.mitchell@cs.stanford.edu)';

(4) Asli Celikyilmaz, Meta AI Research (aslic@meta.com);

(5) Antoine Bosselut, EPFL (antoine.bosselut@epfl.ch).

:::

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

Market Opportunity

Large Language Model Price(LLM)

$0.0003519

$0.0003519$0.0003519

+6.57%

USD

Large Language Model (LLM) Live Price Chart

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

The post China Blocks Nvidia’s RTX Pro 6000D as Local Chips Rise appeared on BitcoinEthereumNews.com. China Blocks Nvidia’s RTX Pro 6000D as Local Chips Rise China’s internet regulator has ordered the country’s biggest technology firms, including Alibaba and ByteDance, to stop purchasing Nvidia’s RTX Pro 6000D GPUs. According to the Financial Times, the move shuts down the last major channel for mass supplies of American chips to the Chinese market. Why Beijing Halted Nvidia Purchases Chinese companies had planned to buy tens of thousands of RTX Pro 6000D accelerators and had already begun testing them in servers. But regulators intervened, halting the purchases and signaling stricter controls than earlier measures placed on Nvidia’s H20 chip. Image: Nvidia An audit compared Huawei and Cambricon processors, along with chips developed by Alibaba and Baidu, against Nvidia’s export-approved products. Regulators concluded that Chinese chips had reached performance levels comparable to the restricted U.S. models. This assessment pushed authorities to advise firms to rely more heavily on domestic processors, further tightening Nvidia’s already limited position in China. China’s Drive Toward Tech Independence The decision highlights Beijing’s focus on import substitution — developing self-sufficient chip production to reduce reliance on U.S. supplies. “The signal is now clear: all attention is focused on building a domestic ecosystem,” said a representative of a leading Chinese tech company. Nvidia had unveiled the RTX Pro 6000D in July 2025 during CEO Jensen Huang’s visit to Beijing, in an attempt to keep a foothold in China after Washington restricted exports of its most advanced chips. But momentum is shifting. Industry sources told the Financial Times that Chinese manufacturers plan to triple AI chip production next year to meet growing demand. They believe “domestic supply will now be sufficient without Nvidia.” What It Means for the Future With Huawei, Cambricon, Alibaba, and Baidu stepping up, China is positioning itself for long-term technological independence. Nvidia, meanwhile, faces…

BitcoinEthereumNews

2025/09/18 01:37

The aftermath of the energy war: As Microsoft, BlackRock monopolize infrastructure, Eden Miner becomes retail’s last backdoor to the “hashrate yield network”

As mining goes institutional in 2025, Eden Miner opens retail access to hashrate investing through a new model. The year 2025 marks a watershed moment for global

Crypto.news

2025/12/17 00:08

Gold continues to hit new highs. How to invest in gold in the crypto market?

As Bitcoin encounters a "value winter", real-world gold is recasting the iron curtain of value on the blockchain.

PANews

2025/04/14 17:12

Quick Reads

JPMorgan Launches Tokenized Money Market Fund on Ethereum: A Milestone in Traditional Finance's Blockchain Adoption

XRP Price Breaks Critical Support: Technical Analysis Reveals Deep Pullback Risks and Trading Strategies

StraitsX Launches on Solana: Singapore and U.S. Dollar Stablecoins Enable Instant Currency Exchange in Digital Forex Revolution

Bitcoin's Bearish Plunge Deepens: 75% of Top 100 Cryptos Break Below Key Averages While Nasdaq Shows Remarkable Resilience

Crypto Prices

Bitcoin

BTC

$87,209.32

$87,209.32$87,209.32

-0.88%

Ethereum

ETH

$2,927.50

$2,927.50$2,927.50

-1.22%

Solana

SOL

$127.41

$127.41$127.41

-1.79%

XRP

$1.9185

$1.9185$1.9185

-1.17%

Binance Coin

BNB

$867.80

$867.80$867.80

-0.99%

Evaluating Systematic Generalization: The Use of ProofWriter and CLUTRR-SG in LLM Reasoning Research

Table of Links

A Dataset

You May Also Like

China Blocks Nvidia’s RTX Pro 6000D as Local Chips Rise

The aftermath of the energy war: As Microsoft, BlackRock monopolize infrastructure, Eden Miner becomes retail’s last backdoor to the “hashrate yield network”

Gold continues to hit new highs. How to invest in gold in the crypto market?

Trending News

China Blocks Nvidia’s RTX Pro 6000D as Local Chips Rise

The aftermath of the energy war: As Microsoft, BlackRock monopolize infrastructure, Eden Miner becomes retail’s last backdoor to the “hashrate yield network”

Gold continues to hit new highs. How to invest in gold in the crypto market?

Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

ETF Expert Says Spot XRP ETF Launching This Week Will Test Investors, Here’s How

Quick Reads

Argentina's President Mired in Crypto Scandal: The Calculated Launch and Political Manipulation Behind Libra's Collapse

JPMorgan Launches Tokenized Money Market Fund on Ethereum: A Milestone in Traditional Finance's Blockchain Adoption

XRP Price Breaks Critical Support: Technical Analysis Reveals Deep Pullback Risks and Trading Strategies

StraitsX Launches on Solana: Singapore and U.S. Dollar Stablecoins Enable Instant Currency Exchange in Digital Forex Revolution

Bitcoin's Bearish Plunge Deepens: 75% of Top 100 Cryptos Break Below Key Averages While Nasdaq Shows Remarkable Resilience

Crypto Prices