Hugging Face Unveils New Dataset Curation Tools
Hugging Face launches open-source tools to streamline dataset cleaning and preparation for AI researchers.
15 articles about 'dataset'
Hugging Face launches open-source tools to streamline dataset cleaning and preparation for AI researchers.
Datasette Agent 0.1a0 launches with MicroPython sandboxing, resisting GPT-5.5 jailbreaks.
Zhiyuan releases the Agibot World 2026 dataset, the first open-source collection focusing on rich physical interactions …
AI costs rise as Anthropic dominates May. Model releases disappoint while Datasette Agent launches for developers.
Simon Willison's datasette-llm plugin gets a key upgrade with configurable default options for specific LLM models.
Ai2 releases OLMo 2, a truly open-source large language model with full access to training data, code, weights, and logs…
Amazon QuickSight's Dataset Q&A feature uses generative AI to let business users ask natural language questions directly…
AWS expands natural language querying in QuickSight with Dataset Q&A, enabling users to query structured data conversati…
Researchers propose BioGraphletQA, a QA data generation framework anchored in small knowledge graph subgraphs (graphlets…
A new arXiv survey systematically reviews datasets, benchmarks, and data engines for Vision-Language-Action models in ro…
A research team has released BifDet, the first annotated dataset dedicated to 3D bifurcation detection in airway trees, …
A research team has released ParkingScenes, a structured dataset specifically designed for end-to-end autonomous parking…