Hugging Face Unveils Low-Latency Inference Endpoints
Hugging Face launches new inference endpoints optimized for real-time AI apps, reducing latency by up to 50% for develop…
30 articles about 'hugging face'
Hugging Face launches new inference endpoints optimized for real-time AI apps, reducing latency by up to 50% for develop…
Hugging Face launches open-source tools to streamline dataset cleaning and preparation for AI researchers.
Hugging Face partners with AWS to offer dedicated inference clusters, simplifying large model deployment for enterprises…
Hugging Face launches a new optimized inference engine that significantly reduces latency for open-source models, boosti…
Hugging Face unveils Inference Endpoints V2, enabling global custom model deployments with enhanced scalability and redu…
Hugging Face launches a new platform for collaborative open-source AI model development, aiming to democratize access.
Hugging Face launches a new open-source model hub infrastructure to accelerate AI development and reduce latency for glo…
Hugging Face and Intel partner to optimize open-source models for local execution on Intel hardware.
Hugging Face releases a massive 1 trillion token multilingual training dataset under an open license, democratizing larg…
Hugging Face now hosts over 2 million open-source AI models, cementing its position as the world's largest AI model repo…
Hugging Face releases a 405B parameter open-source model that matches GPT-4 and Claude on key benchmarks, reshaping the …
Hugging Face releases SmolLM 3, a compact language model designed to run efficiently on smartphones, IoT devices, and la…