Copied


Brave Browser Integrates RTX-Accelerated AI with Leo AI and Ollama

Zach Anderson   Oct 02, 2024 19:53 0 Min Read


The Brave browser, known for its privacy focus, has introduced a powerful AI assistant, Leo AI, enhanced by RTX-accelerated local large language models (LLMs) through a collaboration with Ollama, according to the NVIDIA Blog. This integration aims to improve user experience by providing efficient, locally processed AI capabilities.

Enhanced AI Experience with RTX Acceleration

Brave's Leo AI, powered by NVIDIA's RTX technology, offers users the ability to summarize articles, extract insights, and answer questions directly within the browser. This is achieved through the use of NVIDIA's Tensor Cores, which are designed to handle AI applications by processing numerous calculations simultaneously. The collaboration with Ollama allows Brave to leverage the open-source llama.cpp library, which facilitates AI inference tasks specifically optimized for NVIDIA's RTX GPUs.

Advantages of Local AI Processing

Running AI models locally on a PC provides significant privacy benefits, as it eliminates the need to send data to external servers. This local processing approach ensures user data remains private and accessible without the necessity of cloud services. Additionally, it allows users to interact with various specialized models, such as bilingual or code generation models, without incurring cloud service fees.

Technical Integration and Performance

Brave's integration with Ollama and RTX technology offers a responsive AI experience, with the Llama 3 8B model achieving processing speeds of up to 149 tokens per second. This setup ensures quick responses to user queries and content requests, enhancing the overall browsing experience with Leo AI.

Getting Started with Leo AI and Ollama

Users interested in utilizing these advanced AI capabilities can easily install Ollama from its official website. Once installed, Brave's Leo AI can be configured to use local models through Ollama, offering flexibility to switch between cloud and local models as needed. Developers can explore more about using Ollama and llama.cpp through resources provided by NVIDIA.


Read More
Google's Gemini 2.0 Flash LLM now integrates with ElevenLabs AI voice technology, enabling developers to build advanced conversational AI agents with improved speed and functionality.
NVIDIA unveils RTX Remix, a platform leveraging AI and ray tracing to breathe new life into classic games, offering modders powerful tools for enhanced graphics.
The Hong Kong Monetary Authority has issued a warning about a fraudulent website posing as OCBC Bank (Hong Kong) Limited, urging public vigilance.
BitMEX has changed the Mark Method for NILUSDTH25 and REDUSDTZ25 to Fair Price marking, effective March 25, 2025, enhancing price accuracy.
BitMEX introduces NILUSDT perpetual swaps, offering traders up to 50x leverage. This new listing enhances trading options on the platform.
Bitcoin remains vulnerable to downward pressure due to tight liquidity conditions and weak investor sentiment, with ETF outflows and cautious market behavior persisting.
Vodafone implements AI-driven solutions using LangChain and LangGraph to optimize data operations and improve performance metrics monitoring and information retrieval across its data centers.
BitMEX announces the introduction of NILUSDT perpetual swap listing, offering traders up to 50x leverage. The NIL token will be available for trading starting March 25, 2024.