The post how to protect against AI attacks appeared on BitcoinEthereumNews.com. It only took a calendar invite containing a jailbreak prompt to highlight how an AI agent connected via the Model Context Protocol (MCP) can be prompted to exfiltrate data. Signals and mitigations for this type of prompt injection have been formalized in the OWASP guidelines for GenAI, which update the LLM01 risk on April 17, 2025 OWASP GenAI.  Hence the idea relaunched by Vitalik Buterin: to adopt a human jury that oversees decisions and crypto treasuries, accompanied — but not replaced — by language models. In this context, the priority becomes keeping the human as the final arbiter. Exploit MCP: what happened and why it matters for crypto treasuries The researcher Eito Miyamura (as reported by BitcoinEthereumNews) illustrated an attack where a simple calendar invitation, filled with a malicious prompt, convinces the AI agent to read private emails and forward contents to an attacker. The vector exploits the MCP integration chain with Gmail, calendars, SharePoint, and Notion: more connectors mean a wider attack surface. It should be noted that the apparent innocuousness of the content increases the risk. In contexts where MCP operates in developer mode, human consensus is required for sensitive actions. However, decision fatigue can turn confirmation prompts into automatisms; and when treasuries or workflows involving files and credentials are at stake, human error becomes a single point of failure. That said, decoupling permissions and critical steps remains essential. Industry analysts note that indirect prompt injections — that is, content not visible to the human eye but interpretable by the LLM — represent a growing class of risk, as documented by OWASP in its April 2025 update. In red-teaming tests conducted by specialized security teams in the first half of 2025, scenarios with multiple integrations (email, calendar, file storage) showed how the lack of segmentation significantly increases the… The post how to protect against AI attacks appeared on BitcoinEthereumNews.com. It only took a calendar invite containing a jailbreak prompt to highlight how an AI agent connected via the Model Context Protocol (MCP) can be prompted to exfiltrate data. Signals and mitigations for this type of prompt injection have been formalized in the OWASP guidelines for GenAI, which update the LLM01 risk on April 17, 2025 OWASP GenAI.  Hence the idea relaunched by Vitalik Buterin: to adopt a human jury that oversees decisions and crypto treasuries, accompanied — but not replaced — by language models. In this context, the priority becomes keeping the human as the final arbiter. Exploit MCP: what happened and why it matters for crypto treasuries The researcher Eito Miyamura (as reported by BitcoinEthereumNews) illustrated an attack where a simple calendar invitation, filled with a malicious prompt, convinces the AI agent to read private emails and forward contents to an attacker. The vector exploits the MCP integration chain with Gmail, calendars, SharePoint, and Notion: more connectors mean a wider attack surface. It should be noted that the apparent innocuousness of the content increases the risk. In contexts where MCP operates in developer mode, human consensus is required for sensitive actions. However, decision fatigue can turn confirmation prompts into automatisms; and when treasuries or workflows involving files and credentials are at stake, human error becomes a single point of failure. That said, decoupling permissions and critical steps remains essential. Industry analysts note that indirect prompt injections — that is, content not visible to the human eye but interpretable by the LLM — represent a growing class of risk, as documented by OWASP in its April 2025 update. In red-teaming tests conducted by specialized security teams in the first half of 2025, scenarios with multiple integrations (email, calendar, file storage) showed how the lack of segmentation significantly increases the…

how to protect against AI attacks

2025/09/16 00:01
6 min di lettura
Per feedback o dubbi su questo contenuto, contattateci all'indirizzo crypto.news@mexc.com.

It only took a calendar invite containing a jailbreak prompt to highlight how an AI agent connected via the Model Context Protocol (MCP) can be prompted to exfiltrate data. Signals and mitigations for this type of prompt injection have been formalized in the OWASP guidelines for GenAI, which update the LLM01 risk on April 17, 2025 OWASP GenAI. 

Hence the idea relaunched by Vitalik Buterin: to adopt a human jury that oversees decisions and crypto treasuries, accompanied — but not replaced — by language models. In this context, the priority becomes keeping the human as the final arbiter.

Exploit MCP: what happened and why it matters for crypto treasuries

The researcher Eito Miyamura (as reported by BitcoinEthereumNews) illustrated an attack where a simple calendar invitation, filled with a malicious prompt, convinces the AI agent to read private emails and forward contents to an attacker. The vector exploits the MCP integration chain with Gmail, calendars, SharePoint, and Notion: more connectors mean a wider attack surface. It should be noted that the apparent innocuousness of the content increases the risk.

In contexts where MCP operates in developer mode, human consensus is required for sensitive actions. However, decision fatigue can turn confirmation prompts into automatisms; and when treasuries or workflows involving files and credentials are at stake, human error becomes a single point of failure. That said, decoupling permissions and critical steps remains essential.

Industry analysts note that indirect prompt injections — that is, content not visible to the human eye but interpretable by the LLM — represent a growing class of risk, as documented by OWASP in its April 2025 update. In red-teaming tests conducted by specialized security teams in the first half of 2025, scenarios with multiple integrations (email, calendar, file storage) showed how the lack of segmentation significantly increases the likelihood of exfiltration if filters and least-privilege policies are not applied.

Vitalik Buterin’s Proposal: A Human Jury Assisted by AI

“One must always start from a fundamental truth signal that one trusts. I think realistically it should be a human jury, where the individual jurors are obviously assisted by all the LLMs.”

Vitalik Buterin (AMBCrypto)

Buterin indicates a path of verification that starts from the human: a jury composed of people with complementary skills, supported by models for analysis and synthesis, but with the final say on critical decisions. In this context, the jury acts as an “anchor” against automatic manipulation and operational hallucinations when artificial intelligence accesses financial assets or high-impact permissions.

Info-finance: “open market” governance with human control

The concept of info-finance shifts governance towards a market of proposals: different frameworks and policies compete publicly, while spot checks and verdicts remain in the hands of the jury. It is a natural extension of the practices adopted in DAOs and in DeFi, which prioritize transparency, distributed accountability, and incentives for continuous auditing.

Buterin warns that if fund allocation is entrusted to an AI, hostile actors could insert payloads like “gimme all the money” in documents, invitations, and comments. For this reason, info-finance focuses on traceability of decisions and human controls on the steps that move capital. Yet, the procedural component remains as important as the technical one.

Ethereum Foundation: more transparency on the treasury and focus on sustainability

In this vision, Buterin explained that the Ethereum Foundation is updating its Treasury Policy – a document published on June 4, 2025 – with goals for more active management and operational limits to ensure long-term sustainability. Industry reports indicate that, as of October 31, 2024, the declared treasury was approximately 970.2 million dollars, a figure used as a reference for the new rules on ETH sales and operational limits. Additionally, Buterin mentioned Codex, a layer 2 oriented towards payments in stablecoin, as a possible infrastructure for “large‑scale value” use cases – a strategic move aimed at strengthening resilience and adoption, although some details are yet to be verified.

How to Structure a Human Jury for Treasury Governance

  • Composition: mixed profiles (security, legal, finance, operations). Periodic rotation and partial anonymity to reduce bias and pressure.
  • Mandate: clearly define the blocking actions (e.g., permission changes, execution of transactions, connection of new AI connectors).
  • Process: double verification (4‑eyes or multi‑sig) with immutable audit logs and explicit reasoning saved on‑chain or in verifiable archives.
  • Incentives: compensation for time and responsibility, with penalties in case of proven negligence.
  • Conflicts of Interest: mandatory disclosure, abstention, and independent review on sensitive cases.

MCP, jailbreak and “Goodharting”: two risks to keep distinct

  • Jailbreak via MCP: hidden prompts in ordinary content (invitations, notes, documents) exploit AI connected to real tools, with the risk of unintentional execution of actions or a data breach.
  • Goodharting: when a metric becomes a target, it ceases to measure what it should, leading to apparent but distorted optimizations (for example, “rigged” performance to maximize a specific score).

Operational Checklist: 7 Moves to Reduce Risk Today

  • Connector Segmentation: separate test and production environments. Limit AI to sandbox mailboxes and calendars.
  • Robust Approvals: disable auto-approve features; require 2FA and multi-sig for actions involving treasury and permissions.
  • Content Filters: block or sanitize invitations and external documents, detecting anomalous prompts before they reach the agent.
  • Least privilege: grant the AI only the minimum permissions necessary, rotating tokens and keys frequently.
  • Monitoring: real-time alerts for unusual activities and logs accessible to the jury.
  • Red-teaming test: periodic simulation campaigns (e.g., malicious calendar invites) with reports to governance.
  • Incident playbook: clear procedures for revoking connectors, isolating AI, and timely notification to stakeholders.

Mini‑FAQ

  • What does the MCP exploit via calendar invitation demonstrate? It demonstrates that a single content can convey a prompt capable of guiding an AI agent connected to real tools, impacting privacy and operational integrity.
  • What is the “AI-assisted human jury”? It is a mechanism where humans make the final decisions, leveraging AI for analysis and research, especially when money or permits are at stake.
  • What is info-finance? It is a form of governance where policies and frameworks compete in an open market, but high-risk operations remain subject to human oversight and regular audits.
  • How are treasuries protected today? Through the use of multi-sig, operational limits, role segregation, and a human jury that validates transactions, new integrations, and changes in permissions.

Implications and What to Watch in the Coming Months

Security is not just a technical issue; it requires processes, transparency, and verifiable accountability. As Buterin points out, the problem of jailbreaking is not binary, while the phenomenon of Goodharting represents a subtle form of metric “fraud.” In a growing automation context, info-finance supported by a human jury acts as a pragmatic parachute to mitigate risks on treasuries and critical decisions.

Source: https://en.cryptonomist.ch/2025/09/15/vitalik-buterin-relaunches-the-human-jury-this-is-how-info-finance-can-safeguard-crypto-treasuries-from-ai-attacks-after-the-mcp-exploit/

Opportunità di mercato
Logo Prompt
Valore Prompt (PROMPT)
$0.04331
$0.04331$0.04331
-1.79%
USD
Grafico dei prezzi in tempo reale di Prompt (PROMPT)
Disclaimer: gli articoli ripubblicati su questo sito provengono da piattaforme pubbliche e sono forniti esclusivamente a scopo informativo. Non riflettono necessariamente le opinioni di MEXC. Tutti i diritti rimangono agli autori originali. Se ritieni che un contenuto violi i diritti di terze parti, contatta crypto.news@mexc.com per la rimozione. MEXC non fornisce alcuna garanzia in merito all'accuratezza, completezza o tempestività del contenuto e non è responsabile per eventuali azioni intraprese sulla base delle informazioni fornite. Il contenuto non costituisce consulenza finanziaria, legale o professionale di altro tipo, né deve essere considerato una raccomandazione o un'approvazione da parte di MEXC.

Potrebbe anche piacerti

Ray Dalio: Five major forces shaping the economy, the US faces a $9 trillion debt rollover challenge, and why gold remains the most established form of money

Ray Dalio: Five major forces shaping the economy, the US faces a $9 trillion debt rollover challenge, and why gold remains the most established form of money

The post Ray Dalio: Five major forces shaping the economy, the US faces a $9 trillion debt rollover challenge, and why gold remains the most established form of
Condividi
BitcoinEthereumNews2026/03/04 05:53
Trump urges passage of U.S. Clarity Act, attacks banks for 'undercutting' GENIUS

Trump urges passage of U.S. Clarity Act, attacks banks for 'undercutting' GENIUS

Policy Share Share this article
Copy linkX (Twitter)LinkedInFacebookEmail
Trump urges passage of U.S. Clarity Act, atta
Condividi
Coindesk2026/03/04 06:19
United States Building Permits Change dipped from previous -2.8% to -3.7% in August

United States Building Permits Change dipped from previous -2.8% to -3.7% in August

The post United States Building Permits Change dipped from previous -2.8% to -3.7% in August appeared on BitcoinEthereumNews.com. Information on these pages contains forward-looking statements that involve risks and uncertainties. Markets and instruments profiled on this page are for informational purposes only and should not in any way come across as a recommendation to buy or sell in these assets. You should do your own thorough research before making any investment decisions. FXStreet does not in any way guarantee that this information is free from mistakes, errors, or material misstatements. It also does not guarantee that this information is of a timely nature. Investing in Open Markets involves a great deal of risk, including the loss of all or a portion of your investment, as well as emotional distress. All risks, losses and costs associated with investing, including total loss of principal, are your responsibility. The views and opinions expressed in this article are those of the authors and do not necessarily reflect the official policy or position of FXStreet nor its advertisers. The author will not be held responsible for information that is found at the end of links posted on this page. If not otherwise explicitly mentioned in the body of the article, at the time of writing, the author has no position in any stock mentioned in this article and no business relationship with any company mentioned. The author has not received compensation for writing this article, other than from FXStreet. FXStreet and the author do not provide personalized recommendations. The author makes no representations as to the accuracy, completeness, or suitability of this information. FXStreet and the author will not be liable for any errors, omissions or any losses, injuries or damages arising from this information and its display or use. Errors and omissions excepted. The author and FXStreet are not registered investment advisors and nothing in this article is intended…
Condividi
BitcoinEthereumNews2025/09/18 02:20