OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users. (Read More)OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users. (Read More)

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

2026/03/25 02:42
3 min di lettura
Per feedback o dubbi su questo contenuto, contattateci all'indirizzo crypto.news@mexc.com.

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

Luisa Crawford Mar 24, 2026 18:42

OpenAI launches prompt-based safety policies and gpt-oss-safeguard model to help developers build age-appropriate AI protections for teenage users.

OpenAI Releases Open-Source Teen Safety Tools for AI Developers

OpenAI dropped a new toolkit on March 24 aimed squarely at one of AI's thorniest problems: keeping teenage users safe without neutering the technology's usefulness. The release includes prompt-based safety policies designed to work with gpt-oss-safeguard, the company's open-weight safety model available on Hugging Face.

The policies target six risk categories that disproportionately affect younger users: graphic violent and sexual content, harmful body ideals, dangerous challenges, romantic or violent roleplay, and age-restricted goods and services. Developers can plug these prompts directly into their content moderation systems for real-time filtering or batch analysis.

Why This Matters for the AI Ecosystem

Most developers building AI applications face a frustrating gap between knowing they need teen safety measures and actually implementing them. Translating "protect kids from harmful content" into operational code requires both child development expertise and deep technical knowledge—a combination few teams possess.

"One of the biggest gaps in AI safety for teens has been the lack of clear, operational policies that developers can build from," said Robbie Torney, Head of AI & Digital Assessments at Common Sense Media, who helped shape the policies. "Many times, developers are starting from scratch."

The timing feels relevant given recent Microsoft research from February showing that single benign-sounding prompts can systematically strip safety guardrails from major language models. That vulnerability makes robust, well-tested safety policies more valuable—developers can't just wing it.

What's Actually in the Release

OpenAI structured these policies as prompts rather than hard-coded rules, which means developers can adapt them to specific use cases and iterate over time. The company worked with Common Sense Media and everyone.ai to define edge cases and refine the policy language.

Dr. Mathilde Cerioli, Chief Scientist at everyone.ai, noted that content filtering is just the starting point. Her team has already built on this work to create behavioral policies addressing risks like "exclusivity and overreliance"—the tendency of AI systems to become too central to a teen's social or emotional life.

The policies are being released through the ROOST Model Community on GitHub, explicitly inviting the developer community to translate them into other languages and extend coverage to additional risk areas.

The Limitations

OpenAI is clear these policies represent a floor, not a ceiling. The company explicitly states they don't reflect the full extent of its internal safeguards and shouldn't be treated as comprehensive teen safety solutions.

"Each application has unique risks, audiences and contexts," the release notes. Developers still need to layer these policies with product design decisions, user controls, monitoring systems, and what OpenAI calls "teen-friendly transparency."

This release builds on OpenAI's broader push for youth protection, including the Model Spec's Under-18 principles, parental controls in ChatGPT, and the Teen Safety Blueprint the company has been promoting as an industry standard. Whether competitors adopt similar open-source approaches will determine if this becomes a genuine ecosystem improvement or just an OpenAI talking point.

Image source: Shutterstock
  • openai
  • ai safety
  • teen protection
  • open source
  • gpt-oss-safeguard
Opportunità di mercato
Logo Prompt
Valore Prompt (PROMPT)
$0.03543
$0.03543$0.03543
-2.47%
USD
Grafico dei prezzi in tempo reale di Prompt (PROMPT)
Disclaimer: gli articoli ripubblicati su questo sito provengono da piattaforme pubbliche e sono forniti esclusivamente a scopo informativo. Non riflettono necessariamente le opinioni di MEXC. Tutti i diritti rimangono agli autori originali. Se ritieni che un contenuto violi i diritti di terze parti, contatta crypto.news@mexc.com per la rimozione. MEXC non fornisce alcuna garanzia in merito all'accuratezza, completezza o tempestività del contenuto e non è responsabile per eventuali azioni intraprese sulla base delle informazioni fornite. Il contenuto non costituisce consulenza finanziaria, legale o professionale di altro tipo, né deve essere considerato una raccomandazione o un'approvazione da parte di MEXC.

Potrebbe anche piacerti

One Of Frank Sinatra’s Most Famous Albums Is Back In The Spotlight

One Of Frank Sinatra’s Most Famous Albums Is Back In The Spotlight

The post One Of Frank Sinatra’s Most Famous Albums Is Back In The Spotlight appeared on BitcoinEthereumNews.com. Frank Sinatra’s The World We Knew returns to the Jazz Albums and Traditional Jazz Albums charts, showing continued demand for his timeless music. Frank Sinatra performs on his TV special Frank Sinatra: A Man and his Music Bettmann Archive These days on the Billboard charts, Frank Sinatra’s music can always be found on the jazz-specific rankings. While the art he created when he was still working was pop at the time, and later classified as traditional pop, there is no such list for the latter format in America, and so his throwback projects and cuts appear on jazz lists instead. It’s on those charts where Sinatra rebounds this week, and one of his popular projects returns not to one, but two tallies at the same time, helping him increase the total amount of real estate he owns at the moment. Frank Sinatra’s The World We Knew Returns Sinatra’s The World We Knew is a top performer again, if only on the jazz lists. That set rebounds to No. 15 on the Traditional Jazz Albums chart and comes in at No. 20 on the all-encompassing Jazz Albums ranking after not appearing on either roster just last frame. The World We Knew’s All-Time Highs The World We Knew returns close to its all-time peak on both of those rosters. Sinatra’s classic has peaked at No. 11 on the Traditional Jazz Albums chart, just missing out on becoming another top 10 for the crooner. The set climbed all the way to No. 15 on the Jazz Albums tally and has now spent just under two months on the rosters. Frank Sinatra’s Album With Classic Hits Sinatra released The World We Knew in the summer of 1967. The title track, which on the album is actually known as “The World We Knew (Over and…
Condividi
BitcoinEthereumNews2025/09/18 00:02
Data focus shifts to payrolls – Societe Generale

Data focus shifts to payrolls – Societe Generale

The post Data focus shifts to payrolls – Societe Generale appeared on BitcoinEthereumNews.com. Societe Generale analysts note a quiet data calendar ahead of key
Condividi
BitcoinEthereumNews2026/04/02 17:52
MEXC Chain Observation Daily Day 1

MEXC Chain Observation Daily Day 1

On May 15, 2026, the US Senate Banking Committee passed the CLARITY bill, Winklevoss Twins invested 100 million USD in Gemini via Bitcoin, Coinbase became the official USDC treasury deployer on Hyperliquid, CME planned Nasdaq crypto index futures, and Tether froze over 450 million USD of illicit assets. Industry trends include Consensys delaying its IPO, Kraken switching to Chainlink CCIP, Strive launching a daily dividend security with 13.88 percent yield, and major funding rounds for Onramp, Turnkey, Fasset, and Stitch. MEXC platform data shows top gainers ENM, PEAQ, TROLLSOL and high volume in BTC, ETH, XRP. Upcoming token unlocks for PYTH, Humanity, TON, and MemeCore pose selling pressure. Users are warned against phishing scams and advised to use only official channels.
Condividi
MEXC NEWS2026/05/15 10:16

KAIO Global Debut

KAIO Global DebutKAIO Global Debut

Enjoy 0-fee KAIO trading and tap into the RWA boom