Copied


AMD Unveils ROCm 6.2.3 Enhancing AI Performance on Radeon GPUs

Iris Coleman   Oct 13, 2024 02:37 2 Min Read


AMD has launched the latest iteration of its open compute software, AMD ROCm™ 6.2.3, specifically engineered to enhance the performance of Radeon GPUs on native Ubuntu® Linux® systems. This update is aimed at providing superior inference performance for AI models, notably the Llama 3 70BQ4, and enables developers to integrate Stable Diffusion (SD) 2.1 text-to-image capabilities into their AI projects, according to AMD.com.

Key Features of ROCm 6.2.3

The new ROCm 6.2.3 release brings several advanced features aimed at accelerating AI development:

  • Support for Llama 3 via vLLM: This feature provides exceptional inference performance on Radeon GPUs with the Llama 3 70BQ4 model.
  • Flash Attention 2 Integration: Designed to optimize memory usage and enhance inference speed, this feature supports forward enablement.
  • Stable Diffusion 2.1 Support: Developers can now incorporate SD text-to-image models into their AI applications.
  • Triton Framework Beta Support: This allows developers to write high-performance AI code with minimal expertise, utilizing AMD hardware efficiently.

Advancements in AI Development

Erik Hultgren, Software Product Manager at AMD, emphasized that ROCm 6.2.3 targets specific features to expedite generative AI development. The release includes professional-level performance enhancements for Large Language Model (LLM) inference via vLLM and Flash Attention 2. It also introduces beta support for the Triton framework, broadening the scope for AI development on AMD hardware.

Evolution of ROCm Support

AMD's ROCm support for Radeon GPUs has significantly evolved over the past year, starting with the 5.7 release. Version 6.0 expanded capabilities by incorporating the ONNX runtime and formally qualifying more Radeon GPUs, including the Radeon PRO W7800. The 6.1 update marked another milestone with multi-GPU configuration support and integration with the TensorFlow framework.

With the current release, ROCm 6.2.3 continues to focus on Linux® systems, with plans to introduce Windows® Subsystem for Linux® (WSL 2) support soon. This strategic approach aims to further enhance the ROCm solution stack for Radeon GPUs, positioning it as a robust option for AI and machine learning development.

For additional information and resources, visit AMD's official community page.


Read More
The Hong Kong Monetary Authority has issued a warning about a fraudulent website posing as OCBC Bank (Hong Kong) Limited, urging public vigilance.
BitMEX has changed the Mark Method for NILUSDTH25 and REDUSDTZ25 to Fair Price marking, effective March 25, 2025, enhancing price accuracy.
BitMEX introduces NILUSDT perpetual swaps, offering traders up to 50x leverage. This new listing enhances trading options on the platform.
Bitcoin remains vulnerable to downward pressure due to tight liquidity conditions and weak investor sentiment, with ETF outflows and cautious market behavior persisting.
Vodafone implements AI-driven solutions using LangChain and LangGraph to optimize data operations and improve performance metrics monitoring and information retrieval across its data centers.
BitMEX announces the introduction of NILUSDT perpetual swap listing, offering traders up to 50x leverage. The NIL token will be available for trading starting March 25, 2024.
Cronos (CRO) Labs has appointed Mirko Zhao as its new leader, succeeding Ken Timsit. Zhao aims to enhance the blockchain’s growth and community engagement.
Cronos (CRO) Labs announces Mirko Zhao as the new Head of Product and Engineering, succeeding Ken Timsit, to lead the blockchain ecosystem's innovative growth.