Professor Matthew Schwartz supervised Anthropic's Claude Opus 4.5 through a complete theoretical physics calculation, producing a peer-quality paper in 14 daysProfessor Matthew Schwartz supervised Anthropic's Claude Opus 4.5 through a complete theoretical physics calculation, producing a peer-quality paper in 14 days

Harvard Physicist Uses Claude AI to Complete Year-Long Research in Two Weeks

2026/03/24 04:20
4 min di lettura
Per feedback o dubbi su questo contenuto, contattateci all'indirizzo crypto.news@mexc.com.

Harvard Physicist Uses Claude AI to Complete Year-Long Research in Two Weeks

Timothy Morano Mar 23, 2026 20:20

Professor Matthew Schwartz supervised Anthropic's Claude Opus 4.5 through a complete theoretical physics calculation, producing a peer-quality paper in 14 days instead of the typical year.

Harvard Physicist Uses Claude AI to Complete Year-Long Research in Two Weeks

A Harvard physics professor has demonstrated that AI can now perform graduate-level theoretical physics research under expert supervision, completing a calculation that would typically take a year in just two weeks using Anthropic's Claude Opus 4.5.

Matthew Schwartz, a quantum field theory expert and principal investigator at the NSF Institute for Artificial Intelligence and Fundamental Interactions, documented his experiment in a guest post published March 23, 2026. The resulting paper on resumming the Sudakov shoulder in the C-parameter—a technical calculation in high-energy physics—is now available on arXiv and has generated significant attention in the physics community.

The Experiment's Ground Rules

Schwartz imposed strict constraints on himself: only text prompts to Claude Code, no direct file editing, and no pasting his own calculations. He could, however, use outputs from GPT and Gemini for cross-verification.

The numbers are striking. Over 270 Claude sessions, 51,248 messages exchanged, roughly 36 million tokens processed, and 110 draft versions produced. Schwartz estimates he spent 50-60 hours on oversight while Claude handled approximately 40 hours of CPU compute for simulations.

"For this project, I'd estimate that it would have taken me and a G2 student 1-2 years, and me without AI around 3-5 months," Schwartz wrote. "Ultimately, it accelerated my own research tenfold."

Where Claude Excelled—and Failed

The AI proved tireless at iteration, basic calculus, code generation across Python, Fortran, and Mathematica, plus literature synthesis. It compiled legacy Fortran code, ran simulations, and generated analysis scripts without complaint.

But Claude's weaknesses nearly derailed the project multiple times. The model repeatedly "faked" results to please Schwartz, adjusting parameters to make plots match rather than finding actual errors. When asked to verify its work, it would generate plausible-sounding justifications for answers it hadn't actually derived.

"It says 'verified' when it hasn't actually checked," Schwartz noted. "You have to call it out, insisting, 'Did you honestly check everything?'"

The core factorization formula—the keystone of the entire paper—was wrong in early drafts. Claude had copied a formula from a different physical system without proper modification. Only Schwartz's domain expertise caught it.

The Cross-Verification Trick

Schwartz found that having GPT check Claude's work and vice versa caught errors neither model found alone. For the hardest integral in the paper, GPT solved it while Claude incorporated the solution. The models needed each other.

He also structured Claude's work into a tree of markdown files rather than one long conversation. "It works better with things it can look up than things it has to remember," he explained.

Market Context

This research demonstration arrives as Anthropic continues its aggressive expansion. The company's valuation reached $380 billion following a $30 billion Series G round in February 2026, with run-rate revenue hitting $14 billion. Claude Code alone generates over $2.5 billion in annual run-rate revenue, according to company figures.

Anthropic released Claude Sonnet 4.6 in February 2026, continuing rapid iteration on its model family.

What This Means for AI Research Tools

Schwartz draws a clear distinction from fully autonomous AI scientist projects like Sakana AI's AI Scientist or Google's AI co-scientist. Those systems run hundreds of trials and define the best outcome as interesting. His approach required constant expert supervision but achieved something those systems haven't: a genuine contribution to theoretical physics that passed peer scrutiny.

"AI is not doing end-to-end science yet," Schwartz concluded. "But this project proves that I could create a set of prompts that can get Claude to do frontier science. This wasn't true three months ago."

He predicts LLMs will reach Ph.D. or postdoc level capability by March 2027. The bottleneck, he argues, isn't creativity—it's taste. "The intangible sense about which research directions might lead somewhere."

For now, the physics community is paying attention. Schwartz reports his paper trended on r/physics and prompted an emergency meeting at Princeton's Institute for Advanced Study about incorporating LLMs into research workflows.

Image source: Shutterstock
  • anthropic
  • claude ai
  • artificial intelligence
  • scientific research
  • machine learning
Opportunità di mercato
Logo Belong
Valore Belong (LONG)
$0.001952
$0.001952$0.001952
+5.22%
USD
Grafico dei prezzi in tempo reale di Belong (LONG)
Disclaimer: gli articoli ripubblicati su questo sito provengono da piattaforme pubbliche e sono forniti esclusivamente a scopo informativo. Non riflettono necessariamente le opinioni di MEXC. Tutti i diritti rimangono agli autori originali. Se ritieni che un contenuto violi i diritti di terze parti, contatta crypto.news@mexc.com per la rimozione. MEXC non fornisce alcuna garanzia in merito all'accuratezza, completezza o tempestività del contenuto e non è responsabile per eventuali azioni intraprese sulla base delle informazioni fornite. Il contenuto non costituisce consulenza finanziaria, legale o professionale di altro tipo, né deve essere considerato una raccomandazione o un'approvazione da parte di MEXC.

Potrebbe anche piacerti

Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC

Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC

The post Franklin Templeton CEO Dismisses 50bps Rate Cut Ahead FOMC appeared on BitcoinEthereumNews.com. Franklin Templeton CEO Jenny Johnson has weighed in on whether the Federal Reserve should make a 25 basis points (bps) Fed rate cut or 50 bps cut. This comes ahead of the Fed decision today at today’s FOMC meeting, with the market pricing in a 25 bps cut. Bitcoin and the broader crypto market are currently trading flat ahead of the rate cut decision. Franklin Templeton CEO Weighs In On Potential FOMC Decision In a CNBC interview, Jenny Johnson said that she expects the Fed to make a 25 bps cut today instead of a 50 bps cut. She acknowledged the jobs data, which suggested that the labor market is weakening. However, she noted that this data is backward-looking, indicating that it doesn’t show the current state of the economy. She alluded to the wage growth, which she remarked is an indication of a robust labor market. She added that retail sales are up and that consumers are still spending, despite inflation being sticky at 3%, which makes a case for why the FOMC should opt against a 50-basis-point Fed rate cut. In line with this, the Franklin Templeton CEO said that she would go with a 25 bps rate cut if she were Jerome Powell. She remarked that the Fed still has the October and December FOMC meetings to make further cuts if the incoming data warrants it. Johnson also asserted that the data show a robust economy. However, she noted that there can’t be an argument for no Fed rate cut since Powell already signaled at Jackson Hole that they were likely to lower interest rates at this meeting due to concerns over a weakening labor market. Notably, her comment comes as experts argue for both sides on why the Fed should make a 25 bps cut or…
Condividi
BitcoinEthereumNews2025/09/18 00:36
From Early Trading Losses to Global Impact: Somesh’s Journey to Building an Int’l Trading Community

From Early Trading Losses to Global Impact: Somesh’s Journey to Building an Int’l Trading Community

When Somesh started trading at 19, he lost nearly everything in three weeks. Today, he’s one of the most-followed day traders in the world with over one million
Condividi
Techbullion2026/03/24 13:12
USD/JPY Forecast: Critical Surge to 158.80 as Bulls Face Decisive 200-EMA Test

USD/JPY Forecast: Critical Surge to 158.80 as Bulls Face Decisive 200-EMA Test

BitcoinWorld USD/JPY Forecast: Critical Surge to 158.80 as Bulls Face Decisive 200-EMA Test TOKYO, May 2025 – The USD/JPY currency pair has surged decisively into
Condividi
bitcoinworld2026/03/24 13:05