Archive

All AI stories, newest first.

researchApr 9, 2026

SELFDOUBT Uncertainty Quantification

SELFDOUBT is a new framework for uncertainty quantification in reasoning language models. It addresses the difficulty of deploying uncertainty estimation in practice, particularly for proprietary APIs.

via ArXiv cs.AI#uncertainty#quantification#language-models

researchApr 9, 2026

ReVEL: LLM-Guided Heuristic Evolution

ReVEL is a hybrid framework that uses large language models for iterative reasoning in combinatorial optimization. It embeds LLMs within an evolutionary algorithm to improve heuristic design.

via ArXiv cs.AI#llm#heuristics#evolutionary-algorithm

researchApr 9, 2026

Rethinking Generalization in Reasoning SFT

Researchers challenge the notion that supervised finetuning memorizes while reinforcement learning generalizes. They find that cross-domain generalization is conditional, influenced by optimization, data, and model capability. This challenges prevailing narratives in LLM post-training.

via ArXiv cs.AI#llm#sft#generalization

researchApr 9, 2026

Reasoning Fails in Large Models

Large reasoning models perform well on multi-step tasks but have unstable behavior. Step-Saliency analysis reveals information-flow failures.

via ArXiv cs.AI#reasoning#large-models#analysis

industryApr 9, 2026

Quantum Computers Threaten Encryption

Quantum computers can break vital encryption with fewer resources than thought. This increases the threat to elliptic curve cryptosystems.

via Ars Technica AI#quantum#encryption#security

$ProofSketcher: Hybrid LLM for Reliable Math Logic$

researchApr 9, 2026

ProofSketcher: Hybrid LLM for Reliable Math Logic

ProofSketcher combines LLMs with a lightweight proof checker for reliable math and logic reasoning. It aims to address the limitations of LLMs in producing persuasive but flawed arguments.

via ArXiv cs.AI#llm#proof-checker#math-logic

researchApr 9, 2026

Pramana: Fine-Tuning LLMs for Epistemic Reasoning

Pramana is a novel approach to fine-tune large language models for epistemic reasoning. It aims to address the epistemic gap in AI, where models struggle with systematic reasoning and often produce unfounded claims.

via ArXiv cs.AI#llms#epistemic-reasoning#navya-nyaya

researchApr 9, 2026

Pramana Fine-Tunes LLMs

Pramana is a novel approach that teaches large language models explicit epistemological methods to improve their reasoning. This approach aims to address the epistemic gap in AI, where models struggle with systematic reasoning and often produce unfounded claims.

via ArXiv cs.AI#llms#epistemology#navya-nyaya

researchApr 9, 2026

Pramana: Enhancing LLMs with Epistemic Reasoning

Pramana is a novel approach to fine-tune large language models for epistemic reasoning. It aims to address the epistemic gap in LLMs, enabling them to ground claims in traceable evidence.

via ArXiv cs.AI#llms#epistemic-reasoning#pramana

researchApr 9, 2026

Pramana: Enhancing LLMs for Epistemic Reasoning

Pramana is a novel approach to fine-tune large language models for epistemic reasoning. It aims to address the epistemic gap in LLMs, enabling them to ground claims in traceable evidence.

via ArXiv cs.AI#llms#epistemic-reasoning#navya-nyaya

researchApr 9, 2026

PaperOrchestra Automates AI Research Paper Writing

PaperOrchestra is a multi-agent framework that automates AI research paper writing. It transforms unstructured materials into submission-ready manuscripts, including literature synthesis and generated visuals.

via ArXiv cs.AI#ai#research#automation

researchApr 9, 2026

PaperOrchestra: AI Research Paper Writing

PaperOrchestra is a multi-agent framework for automated AI research paper writing. It transforms pre-writing materials into submission-ready manuscripts, including literature synthesis and generated visuals.

via ArXiv cs.AI#ai#research#writing

modelsApr 9, 2026

OpenAI Raises $122B

OpenAI secures $122 billion in funding to expand AI globally. The investment will boost next-gen compute and meet growing demand for ChatGPT and enterprise AI.

via OpenAI Blog#funding#ai-development#compute

industryApr 9, 2026

Nvidia GPUs Vulnerable to Rowhammer Attacks

New attacks exploit GPU memory to hijack CPUs. GDDRHammer, GeForge, and GPUBreach give attackers complete control.

via Ars Technica AI#gpu#security#vulnerability

researchApr 9, 2026

Noncommutativity in Metacognitive Judgments

Researchers introduce a framework to study operational noncommutativity in sequential metacognitive judgments. This work explores how order effects impact cognitive processes.

via ArXiv cs.AI#metacognition#noncommutativity#cognitive-processes

open-sourceApr 9, 2026

mRNA Models Trained Across 25 Species

Researchers trained mRNA language models across 25 species for $165. This breakthrough has significant implications for bioinformatics and natural language processing.

via Hugging Face Blog#mRNA#language-models#bioinformatics

researchApr 9, 2026

MMORF Framework

MMORF is a framework for designing multi-objective retrosynthesis planning systems. It leverages language model-based multi-agent systems to balance quality, safety, and cost objectives.

via ArXiv cs.AI#chemistry#multi-agent#retrosynthesis

researchApr 9, 2026

MMORF Framework for Multi-objective Retrosynthesis

MMORF is a framework for designing multi-objective retrosynthesis planning systems. It leverages language model-based multi-agent systems to balance quality, safety, and cost objectives.

via ArXiv cs.AI#chemistry#multi-agent#retrosynthesis

researchApr 9, 2026

MMORF Framework Advances Multi-objective Retrosynthesis

MMORF is a framework for designing multi-objective retrosynthesis planning systems. It leverages language model-based multi-agent systems to balance quality, safety, and cost objectives.

via ArXiv cs.AI#chemistry#retrosynthesis#multi-agent-systems

industryApr 9, 2026

Meta Launches Muse Spark AI Model

Meta is reentering the AI race with Muse Spark, its first new model since overhauling AI efforts. The model will power various Meta platforms.

via The Verge AI#meta#ai-model#muse-spark

← PreviousPage 60 of 64Next →