New Data Pricing Paradigm: Token-Level Quality Assessment Reshapes LLM Training Data Valuation
A latest arXiv paper proposes a utility-based dynamic data valuation framework that starts from token-level information …
Latest articles in Research
A latest arXiv paper proposes a utility-based dynamic data valuation framework that starts from token-level information …
A latest arXiv paper proposes the E²-CRF method, leveraging two key structural properties — spectral localization and mi…
A research team has released a physics-informed, high-fidelity co-simulation system for aircraft main fuel pumps, provid…
A new study proposes a multi-fidelity surrogate model approach that leverages AI to fuse simulation data of varying accu…
A new arXiv study proposes a PM2.5 fusion system combining LightGBM, spatial cross-validation, and conformal prediction.…
A latest arXiv paper proposes "Stochastic KV Routing" technology, enabling adaptive KV cache sharing across the depth di…
A research team has proposed the AutoCompress method, discovering that Layer 0 in small Transformers carries over 60 tim…
New research challenges the widespread assumption that "parameter efficient means memory efficient," revealing that whil…
New neuroscience research has uncovered an entirely new neural plasticity mechanism that allows the brain to complete sy…
A new study systematically tracks the singular value spectrum evolution of weight matrices during Transformer pre-traini…
Researchers propose the KARL framework, a knowledge-boundary-aware reinforcement learning approach that enables large la…
Researchers propose the BiTA framework, which integrates Bidirectional GRU with a Transformer Aggregator into Temporal G…