New Research Proposes Test-Time Safety Alignment Method for Large Language Models
A latest arXiv paper explores using input word embeddings as control variables to achieve safety alignment of large lang…
Latest articles in Research
A latest arXiv paper explores using input word embeddings as control variables to achieve safety alignment of large lang…
A latest arXiv paper introduces the EvoSelect framework, which achieves efficient iterative evolution of large language …
Researchers propose the HIVE framework, which detects hallucinations in diffusion large language models by extracting co…
A new paper on arXiv proposes a novel approach based on Neural Cellular Automata (NCA) and discrete bottlenecks that ach…
A latest arXiv paper argues that while content generated by large language models often outperforms human work in evalua…
A latest arXiv paper proposes the CogRAG+ framework, which systematically diagnoses and repairs memory and reasoning def…
A new paper introduces the phenomenon of "Anchored Confabulation": providing large language models with partial intermed…
A new paper introduces the UniMatrix architecture family, which leverages structured recurrent states as a compact assoc…
A new study on arXiv proposes training Computer Use Agents (CUAs) to automatically evaluate graphical user interface usa…
A Spanish research team leveraged two general-purpose large language models — Gemini 1.5 Pro and Mistral-small — to extr…
Researchers propose BioGraphletQA, a QA data generation framework anchored in small knowledge graph subgraphs (graphlets…
A research team has released the MATH-PT dataset, a mathematical reasoning evaluation benchmark specifically designed fo…