Modelserve: Golem Network's New AI Inference Service

Joerg Hiller Jul 15, 2024 15:19 0 Min Read

Golem Network has unveiled Modelserve, a new service aimed at providing scalable and affordable AI model inferences, according to a recent announcement by the Golem Project. This service is designed to allow seamless deployment and inference of AI models through scalable endpoints, enhancing the efficiency and cost-effectiveness of AI applications.

What Is Modelserve?

Modelserve, developed in collaboration with an external team and Golem Factory, integrates into the Golem Network ecosystem. It aims to support the AI open-source community and attract developers of AI applications for GPU providers. The service allows for the seamless deployment and inference of AI models through scalable endpoints, ensuring efficient and cost-effective AI apps operations.

Why Is Golem Network Introducing Modelserve?

The introduction of Modelserve aims to meet the growing demand for computing power in the AI industry. By leveraging consumer-grade GPU resources, which offer sufficient power and memory, the service can effectively run AI models such as diffusion models, automatic speech recognition, and small to medium language models. This approach is more cost-effective compared to traditional methods. The decentralized architecture of the Golem Network serves as a marketplace for matching supply and demand for these resources, enabling access to computing power that is perfectly suited to AI applications.

The addition of Modelserve to the Golem ecosystem plays a key role in getting AI use cases, driving demand for providers and contributing to the broader adoption of the Golem Network.

Target Audience

Modelserve is designed for a diverse range of users including service and product developers, startups, and companies operating in both Web 2.0 and Web 3.0 environments. These users typically:

Utilize small and medium-sized open-source models or create their own models from scratch
Require scalable AI model inference capabilities
Seek an environment to test and experiment with AI models

Technical Implementation

Modelserve comprises three key components:

Website: Allows users to create and manage endpoints
Backend: Manages GPU resources to handle inferences, featuring a load balancer and auto-scaling capabilities. It leverages GPU resources available in the market, sourcing them from the Golem open and decentralized marketplace and other platforms offering GPU instances
API: Enables the running of AI model inferences and management of endpoints

The service uses USD payments for user transactions, while settlements with Golem GPU providers are conducted using GLM, the native token of the Golem Network.

Benefits for Users

Maintenance-Free AI Infrastructure (AI IaaS): Users do not need to manage model deployment, inference, or GPU clusters as Modelserve handles these tasks
Affordable Autoscaling: The system automatically scales GPU resources to meet application demands without requiring user intervention
Cost-Effective Pricing: Users are charged based on the actual processing time of their requests, avoiding the costs associated with hourly GPU rentals or maintaining their own clusters

Synergy with Other AI/GPU Projects

Modelserve integrates with GPU Provider and AI Provider GamerHash AI, which is currently in the proof-of-concept stage. Additionally, the first version of Golem-Workers has been created as part of Modelserve, which will be developed as a separate project in the future.

Milestones and Next Steps

Beta tests have been conducted with several AI-based startups and companies
The Golem Community Tests are scheduled for July
Commercialization of the service is set to begin in August

For more detailed information, visit the Golem Project blog.

News

Coindesk CONSENSUS 2025 (Part 1) - Crypto's Next Phase

Institutional interest in crypto surges; regulatory clarity and tokenization reshape the landscape.

by Khushi. V. Rangdhol

Apr 03, 2025 | 3 Min Read

News

Can New Cryptos Outpace Bitcoin? Exploring the Battle for Market Dominance

Bitcoin (BTC) has held the top spot in the cryptocurrency world since its creation in 2009. It remains the largest and most recognized digital asset by market capitalization.

News Publisher

Apr 01, 2025 | 3 Min Read

News

Coindesk CONSENSUS 2025 (Part 2) - AI and Blockchain

AI and blockchain converge, enabling decentralized data ownership and real-time integration for better predictions.

by Khushi. V. Rangdhol

Apr 03, 2025 | 3 Min Read

News

Coindesk CONSENSUS 2025 (Part 3) - Crypto for Everyone

Crypto for Everyone: Crypto must focus on real-world utility and user experience to gain mainstream acceptance and rebuild trust.

by Khushi. V. Rangdhol

Apr 02, 2025 | 0 Min Read

News

AI Revolutionizes Forex Trading: Transforming Currency Markets

AI is transforming forex trading, with algorithms executing 70-75% of trades. Human traders now focus on strategy and oversight, adapting to a fast-paced market.

by Khushi V Rangdhol

Apr 10, 2025 | 0 Min Read

Press Release

How Blockchain Technology Is Revolutionizing Online Casinos

Online casinos have experienced rapid growth during the last decade as they have had to overcome security issues all while working to establish transparency.

News Publisher

Apr 02, 2025 | 3 Min Read

Press Release

The Evolution of Crypto Apps and Their Role in Betting

Blockchain technology transformed digital transactions, with crypto apps playing a crucial role in this transformation.

News Publisher

Apr 02, 2025 | 3 Min Read

News

Liberland: Can a Blockchain Nation Actually Work?

Liberland, a self-proclaimed blockchain nation, aims for innovative governance but faces challenges like unverified claims, lack of recognition, and economic instability.

by Khushi. V. Rangdhol

Apr 10, 2025 | 3 Min Read

Modelserve: Golem Network's New AI Inference Service

What Is Modelserve?

Why Is Golem Network Introducing Modelserve?

Target Audience

Technical Implementation

Benefits for Users

Synergy with Other AI/GPU Projects

Milestones and Next Steps

Read More

Newsletter