The post Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration appeared on BitcoinEthereumNews.com. Rongchai Wang Oct 06, 2025 06:01 NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark. As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing. Velox and cuDF: A Powerful Combination The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF. Accelerating Presto with GPU Power By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts. Multi-GPU Execution for Enhanced Performance The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes. Hybrid Execution in Apache Spark In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid… The post Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration appeared on BitcoinEthereumNews.com. Rongchai Wang Oct 06, 2025 06:01 NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark. As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing. Velox and cuDF: A Powerful Combination The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF. Accelerating Presto with GPU Power By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts. Multi-GPU Execution for Enhanced Performance The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes. Hybrid Execution in Apache Spark In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid…

Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration

2025/10/07 19:13


Rongchai Wang
Oct 06, 2025 06:01

NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark.





As data-driven demands grow, NVIDIA and IBM have partnered to enhance data analytics capabilities by integrating GPU-native Velox with NVIDIA cuDF. This collaboration aims to deliver significant performance improvements over traditional CPU-based systems by leveraging the high memory bandwidth and thread count of GPUs, according to NVIDIA. These enhancements are particularly beneficial for compute-heavy workloads involving multiple joins, complex aggregations, and string processing.

Velox and cuDF: A Powerful Combination

The integration of NVIDIA cuDF into the Velox execution engine allows for GPU-native query execution on widely-used platforms like Presto and Apache Spark. This open project aims to address performance bottlenecks, enabling real-time insights from massive datasets. Velox acts as an intermediary, translating query plans from systems like Presto and Spark into executable GPU pipelines powered by cuDF.

Accelerating Presto with GPU Power

By moving the entire Presto query plan to GPU, the integration aims to boost execution speed significantly. Enhancements to GPU operators such as TableScan, HashJoin, and HashAggregations in Velox enable end-to-end GPU execution in Presto. Initial benchmarks show impressive runtime reductions, with Presto on NVIDIA GPUs achieving runtimes significantly lower than CPU counterparts.

Multi-GPU Execution for Enhanced Performance

The collaboration introduces a UCX-based Exchange operator, which supports the entire execution pipeline on GPUs, leveraging high bandwidth NVLink and RoCE or InfiniBand for connectivity. This setup allows for substantial performance gains, with Presto on GPU showcasing more than a sixfold speedup in data exchange processes.

Hybrid Execution in Apache Spark

In Apache Spark, the integration with Apache Gluten and cuDF focuses on offloading compute-intensive query stages to GPUs, optimizing resource use in hybrid clusters. This strategy allows for efficient use of GPU resources while maintaining CPU availability for other tasks, resulting in significant performance improvements.

Community Involvement and Future Prospects

The open-source nature of this project encourages community involvement, aiming to drive further innovations across the data processing ecosystem. By implementing reusable GPU operators in Velox, the collaboration seeks to reduce duplication and simplify maintenance while accelerating various systems.

Image source: Shutterstock


Source: https://blockchain.news/news/revolutionizing-data-analytics-gpu-native-velox-nvidia-cudf-integration

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Fed Acts on Economic Signals with Rate Cut

Fed Acts on Economic Signals with Rate Cut

In a significant pivot, the Federal Reserve reduced its benchmark interest rate following a prolonged ten-month hiatus. This decision, reflecting a strategic response to the current economic climate, has captured attention across financial sectors, with both market participants and policymakers keenly evaluating its potential impact.Continue Reading:Fed Acts on Economic Signals with Rate Cut
Share
Coinstats2025/09/18 02:28