The post TorchForge RL Pipelines Now Operable on Together AI’s Cloud appeared on BitcoinEthereumNews.com. Jessie A Ellis Dec 04, 2025 17:54 Together AI introduces TorchForge RL pipelines on its cloud platform, enhancing distributed training and sandboxed environments with a BlackJack training demo. TorchForge reinforcement learning (RL) pipelines are now seamlessly operable on Together AI’s Instant Clusters, offering robust support for distributed training, tool execution, and sandboxed environments, as demonstrated by an open-source BlackJack training demo, according to together.ai. The AI Native Cloud: Foundation for Next-Gen RL In the rapidly evolving field of reinforcement learning, building flexible and scalable systems necessitates compatible and efficient compute frameworks and tooling. Modern RL pipelines have transcended basic training loops, now relying heavily on distributed rollouts, high-throughput inference, and a coordinated use of CPU and GPU resources. The comprehensive PyTorch stack, inclusive of TorchForge and Monarch, now operates with distributed training capabilities on Together Instant Clusters. These clusters provide: Low-latency GPU communication: Utilizing InfiniBand/NVLink topologies for efficient RDMA-based data transfers and distributed actor messaging. Consistent cluster bring-up: Preconfigured with drivers, NCCL, CUDA, and the GPU operator, enabling PyTorch distributed jobs to run without manual setup. Heterogeneous RL workload scheduling: Optimized GPU nodes for policy replicas and trainers, alongside CPU-optimized nodes for environment and tool execution. Together AI’s clusters are aptly suited for RL frameworks that require a blend of GPU-bound model computation and CPU-bound environment workloads. Advanced Tool Integration and Demonstration A significant portion of RL workloads involves executing tools, running code, or interacting with sandboxed environments. Together AI’s platform natively supports these requirements through: Together CodeSandbox: MicroVM environments tailored for tool-use, coding tasks, and simulations. Together Code Interpreter: Facilitates fast, isolated Python execution suitable for unit-test-based reward functions or code-evaluation tasks. Both CodeSandbox and Code Interpreter integrate with OpenEnv and TorchForge environment services, allowing rollout workers to utilize these tools… The post TorchForge RL Pipelines Now Operable on Together AI’s Cloud appeared on BitcoinEthereumNews.com. Jessie A Ellis Dec 04, 2025 17:54 Together AI introduces TorchForge RL pipelines on its cloud platform, enhancing distributed training and sandboxed environments with a BlackJack training demo. TorchForge reinforcement learning (RL) pipelines are now seamlessly operable on Together AI’s Instant Clusters, offering robust support for distributed training, tool execution, and sandboxed environments, as demonstrated by an open-source BlackJack training demo, according to together.ai. The AI Native Cloud: Foundation for Next-Gen RL In the rapidly evolving field of reinforcement learning, building flexible and scalable systems necessitates compatible and efficient compute frameworks and tooling. Modern RL pipelines have transcended basic training loops, now relying heavily on distributed rollouts, high-throughput inference, and a coordinated use of CPU and GPU resources. The comprehensive PyTorch stack, inclusive of TorchForge and Monarch, now operates with distributed training capabilities on Together Instant Clusters. These clusters provide: Low-latency GPU communication: Utilizing InfiniBand/NVLink topologies for efficient RDMA-based data transfers and distributed actor messaging. Consistent cluster bring-up: Preconfigured with drivers, NCCL, CUDA, and the GPU operator, enabling PyTorch distributed jobs to run without manual setup. Heterogeneous RL workload scheduling: Optimized GPU nodes for policy replicas and trainers, alongside CPU-optimized nodes for environment and tool execution. Together AI’s clusters are aptly suited for RL frameworks that require a blend of GPU-bound model computation and CPU-bound environment workloads. Advanced Tool Integration and Demonstration A significant portion of RL workloads involves executing tools, running code, or interacting with sandboxed environments. Together AI’s platform natively supports these requirements through: Together CodeSandbox: MicroVM environments tailored for tool-use, coding tasks, and simulations. Together Code Interpreter: Facilitates fast, isolated Python execution suitable for unit-test-based reward functions or code-evaluation tasks. Both CodeSandbox and Code Interpreter integrate with OpenEnv and TorchForge environment services, allowing rollout workers to utilize these tools…

TorchForge RL Pipelines Now Operable on Together AI’s Cloud

2025/12/06 15:05


Jessie A Ellis
Dec 04, 2025 17:54

Together AI introduces TorchForge RL pipelines on its cloud platform, enhancing distributed training and sandboxed environments with a BlackJack training demo.

TorchForge reinforcement learning (RL) pipelines are now seamlessly operable on Together AI’s Instant Clusters, offering robust support for distributed training, tool execution, and sandboxed environments, as demonstrated by an open-source BlackJack training demo, according to together.ai.

The AI Native Cloud: Foundation for Next-Gen RL

In the rapidly evolving field of reinforcement learning, building flexible and scalable systems necessitates compatible and efficient compute frameworks and tooling. Modern RL pipelines have transcended basic training loops, now relying heavily on distributed rollouts, high-throughput inference, and a coordinated use of CPU and GPU resources.

The comprehensive PyTorch stack, inclusive of TorchForge and Monarch, now operates with distributed training capabilities on Together Instant Clusters. These clusters provide:

  • Low-latency GPU communication: Utilizing InfiniBand/NVLink topologies for efficient RDMA-based data transfers and distributed actor messaging.
  • Consistent cluster bring-up: Preconfigured with drivers, NCCL, CUDA, and the GPU operator, enabling PyTorch distributed jobs to run without manual setup.
  • Heterogeneous RL workload scheduling: Optimized GPU nodes for policy replicas and trainers, alongside CPU-optimized nodes for environment and tool execution.

Together AI’s clusters are aptly suited for RL frameworks that require a blend of GPU-bound model computation and CPU-bound environment workloads.

Advanced Tool Integration and Demonstration

A significant portion of RL workloads involves executing tools, running code, or interacting with sandboxed environments. Together AI’s platform natively supports these requirements through:

  • Together CodeSandbox: MicroVM environments tailored for tool-use, coding tasks, and simulations.
  • Together Code Interpreter: Facilitates fast, isolated Python execution suitable for unit-test-based reward functions or code-evaluation tasks.

Both CodeSandbox and Code Interpreter integrate with OpenEnv and TorchForge environment services, allowing rollout workers to utilize these tools during training.

BlackJack Training Demo

Together AI has released a demonstration of a TorchForge RL pipeline running on its Instant Clusters, interacting with an OpenEnv environment hosted on Together CodeSandbox. This demo, adapted from a Meta reference implementation, trains a Qwen 1.5B model to play BlackJack using GRPO. The RL pipeline integrates a vLLM policy server, BlackJack environment, reference model, off-policy replay buffer, and a TorchTitan trainer—connected through Monarch’s actor mesh and using TorchStore for weight synchronization.

The OpenEnv GRPO BlackJack repository includes Kubernetes manifests and setup scripts. Deployment and training initiation are streamlined with simple kubectl commands, allowing experimentation with model configurations and GRPO hyperparameter adjustments.

Additionally, a standalone integration wraps Together’s Code Interpreter as an OpenEnv environment, enabling RL agents to interact with the Interpreter like any other environment. This integration allows RL pipelines to be applied to diverse tasks such as coding and mathematical reasoning.

The demonstrations highlight that sophisticated, multi-component RL training can be conducted on the Together AI Cloud with ease, setting the stage for a flexible, open RL framework in the PyTorch ecosystem, scalable on the Together AI Cloud.

Image source: Shutterstock

Source: https://blockchain.news/news/torchforge-rl-pipelines-operable-together-ai-cloud

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

OSL Hong Kong Lists XRP for Professional Investors Amid Signs of Sustained Market Interest

OSL Hong Kong Lists XRP for Professional Investors Amid Signs of Sustained Market Interest

The post OSL Hong Kong Lists XRP for Professional Investors Amid Signs of Sustained Market Interest appeared on BitcoinEthereumNews.com. OSL Hong Kong has listed XRP for professional investors, enabling deposits, withdrawals, and trading through pairs like XRP/HKD, XRP/USD, and XRP/USDT. This move supports Hong Kong’s regulated framework and reflects growing institutional interest in XRP amid ETF inflows exceeding $897 million. OSL Hong Kong launches XRP trading for professional investors under local licensing rules, expanding access to regulated digital asset services. XRP pairs including XRP/HKD, XRP/USD, and XRP/USDT are now available via Flash Trade, OTC channels, and the XRP Ledger. Market data from Santiment and SoSo indicates sustained accumulation by large holders, with $897.35 million in XRP ETF inflows despite a 32% market cap drop over two months. Discover how OSL Hong Kong’s XRP listing boosts professional trading options amid rising ETF interest. Explore key details, market insights, and implications for investors in this regulated expansion. What is the Significance of OSL Hong Kong Listing XRP? OSL Hong Kong’s listing of XRP marks a key expansion in regulated cryptocurrency trading for professional investors in the region. The exchange, licensed under Hong Kong’s Securities and Futures Commission, now supports XRP deposits, withdrawals, and trading through established pairs, enhancing accessibility via the XRP Ledger. This development aligns with broader institutional adoption trends, providing secure channels for cross-border transaction capabilities inherent to XRP. How Does OSL Hong Kong Facilitate XRP Trading? OSL Hong Kong enables XRP trading exclusively for professional investors, adhering to local regulatory standards that define eligibility based on financial expertise and net worth criteria. Trading pairs such as XRP/HKD, XRP/USD, and XRP/USDT became available this week, with operations routed through the platform’s Flash Trade for spot trading and OTC desk for larger transactions. Deposits and withdrawals integrate directly with the XRP Ledger, ensuring efficient settlement times of just a few seconds, as per blockchain specifications. The exchange’s official announcement emphasized…
Share
BitcoinEthereumNews2025/12/07 23:12
XRP Dips 6% Yet Spot ETFs Draw Steady Inflows Amid Potential Consolidation

XRP Dips 6% Yet Spot ETFs Draw Steady Inflows Amid Potential Consolidation

The post XRP Dips 6% Yet Spot ETFs Draw Steady Inflows Amid Potential Consolidation appeared on BitcoinEthereumNews.com. XRP experienced a 6% price slip last week, yet spot ETF inflows exceeded $10 million, signaling robust investor confidence. This resilience stems from steady open interest and positive funding rates, indicating long-term holders are undeterred by short-term volatility in the XRP market. XRP spot ETF inflows reached $10.23 million daily, pushing total net assets to $861.32 million despite price dips. XRP traded near $2.02, with consistent buying even on quieter market days. Momentum indicators like RSI and CMF show weak but stable demand, with capital flow remaining slightly positive at 0.04. Discover why XRP’s 6% dip didn’t deter investors, with strong ETF inflows and steady open interest. Explore the latest XRP price action and market signals for informed decisions. What Are the Latest XRP ETF Inflows and Their Impact? XRP ETF inflows demonstrated impressive resilience last week, totaling over $10.23 million in daily net additions despite the token’s 6% price decline. This surge, highlighted by a peak of more than $240 million earlier in the period, underscores sustained institutional interest in XRP. Total net assets under management climbed to $861.32 million, reflecting a broader trend of accumulation amid market fluctuations. How Has XRP’s Price Action Evolved Amid Recent Volatility? XRP’s price action has shown a pattern of consolidation around the $2.05 level, retreating from recent highs as resistance at $2.10 consistently capped upward moves. Technical indicators reveal a cooling but controlled environment: the Relative Strength Index (RSI) indicated subdued momentum without entering oversold territory, while the Chaikin Money Flow (CMF) hovered near 0.04, suggesting modest positive capital inflows. Data from TradingView illustrates this stability, with XRP positioned below the 20-day Exponential Moving Average (EMA) at $2.29, yet avoiding panic selling. According to market analysts at SoSoValue, such indicators point to a healthy pause rather than a bearish reversal. This phase…
Share
BitcoinEthereumNews2025/12/07 23:30
GBP trades firmly against US Dollar

GBP trades firmly against US Dollar

The post GBP trades firmly against US Dollar appeared on BitcoinEthereumNews.com. Pound Sterling trades firmly against US Dollar ahead of Fed’s policy outcome The Pound Sterling (GBP) clings to Tuesday’s gains near 1.3640 against the US Dollar (USD) during the European trading session on Wednesday. The GBP/USD pair holds onto gains as the US Dollar remains on the back foot amid firm expectations that the Federal Reserve (Fed) will cut interest rates in the monetary policy announcement at 18:00 GMT. At the time of writing, the US Dollar Index (DXY), which tracks the Greenback’s value against six major currencies, holds onto losses near a fresh two-month low of 96.60 posted on Tuesday. Read more… UK inflation unchanged at 3.8%, Pound shrugs The British pound is unchanged on Wednesday, trading at 1.3645 in the European session. Today’s inflation report was a dour reminder that UK inflation remains entrenched. CPI for August was unchanged at 3.8% y/y, matching the consensus and its highest level since January 2024. Airfares decreased but this was offset by food and petrol prices. Monthly, CPI rose 0.3%, up from 0.1% in July and matching the consensus. Core CPI, which excludes volatile items such as food and energy, eased to 3.6% from 3.8%. Monthly, core CPI ticked up to 0.3% from 0.2%. The inflation report comes just a day before the Bank of England announces its rate decision. Inflation is almost double the BoE’s target of 2% and today’s release likely means that the BoE will not reduce rates before 2026. Read more… Source: https://www.fxstreet.com/news/pound-sterling-price-news-and-forecast-gbp-trades-firmly-against-us-dollar-ahead-of-feds-policy-outcome-202509171209
Share
BitcoinEthereumNews2025/09/18 01:50