Artificial IntelligenceCloud StorageDigitalizationPress Releases

OVHcloud Launches AI Endpoints to Simplify and Democratise Access to AI Models

A new serverless platform offering a library of open-source models designed to meet a wide range of business and industry use cases

OVHcloud today launches AI Endpoints, a serverless solution that enables developers to effortlessly add high-value AI functions to their apps. With over 40 advanced and powerful open-source LLM and generative AI models, addressing use cases such as chatbots, text-to-speech or coding assistance, AI Endpoints makes it easier to democratise AI whatever the size of the organisation. Without the need to manage the underlying infrastructure or requiring Machine Learning expertise, AI Endpoints enables easy access to open-source AI models hosted in a trusted Cloud environment.

AI ready to consume: easily level-up business applications

OVHcloud AI Endpoints enables developers to test AI features in a sandbox environment before deploying at scale in applications, internal tools, or business processes. Use cases include:

  • Integrating LLMs into applications: Easily add conversations through agents. With real-time, natural language interactions, AI Endpoints helps improve user engagement or automate customer service at scale.
  • Text Extraction: Advanced machine learning models automatically extract, interpret, and organise unstructured data, playing a key role in ETL (Extract, Transform, Load) pipelines to improve operational efficiency.
  • Bring speech to your app: Through APIs, the service converts spoken language into text and vice versa, enabling transcription and interactive voice-based transcriptions.
  • Coding assistance: With tools like Continue, developers can integrate private, real-time AI assistance directly into their IDEs. Code suggestions, error detection, task automation all improve both productivity and code quality.

A serverless platform designed to meet organisations’ need for strategic autonomy

OVHcloud’s sovereign cloud infrastructure gives peace of mind to developers, assuring them that data is hosted in Europe, and is protected against non-European regulations, providing both technical and strategic autonomy.

The cloud is central to AI and AI Endpoints runs on OVHcloud’s energy-efficient infrastructure, powered by water-cooled servers in ecofriendly data centres. This helps reduce the environmental impact of AI workloads, without compromising performance.

AI Endpoints promotes full model transparency through open weight AI models. This ensures that organisations can implement the same models on their infrastructure or deploy them on other Cloud services, whilst keeping control over data.

We are excited to launch AI Endpoints and are humbled by the incredible feedback we get from our amazing community. With support for the most diverse and sought after open source LLM models, AI Endpoints helps to democratise AI so developers can add to their apps the most cutting-edge models. Our solution enables them to do this easily in a trusted cloud environment with full confidence in OVHcloud’s sovereign infrastructure” said Yaniv Fdida, Chief Product and Technology Officer, OVHcloud.

Availability: AI in a pay-as-you-go model

After a preview phase, the service has been thoroughly developed and tested, incorporating customer feedback including a number of highly requested features such as support for stable open-source models, more choice and finer API key management. With more than 40 opensource cutting-edge AI models, AI Endpoints include:

  • LLMs: Llama 3.3 70B, Mixtral 8x7B, …
  • SLMs: Mistral Nemo, Llama 3.1 8B, …
  • Code: Qwen 2.5 Coder 32B, Codestral Mamba
  • Reasoning: DeepSeek-R1 (distilled Llama)
  • Multimodal: Qwen 2.5 VL 72B, …
  • Image generation: SDXL
  • Speech: ASR (speech-to-text), TTS (text-to-speech)

It is available now in APAC, Canada and Europe and deployed from the Gravelines data centre. With a pay-as-you-go model, OVHcloud AI Endpoints pricing varies on a per model basis using the number of tokens consumed per minute as a unit.

DSA Editorial

The region’s leading specialist IT news publication focused on Data Lifecycle, Storage Infrastructure and Data-Driven Transformation. DSA has nearly 17,000 e-news subscribers, over 6500 unique visitors per day, over 20,000 social media followers and a reputation for deep domain knowledge.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *