
Chrome's New 'Skills' Turn AI Prompts Into One-Click Tools
Google Chrome introduces 'Skills', allowing users to save and remix AI workflows for instant reuse. This feature democratizes AI tool creation for non-developers.
All AI stories, newest first.

Google Chrome introduces 'Skills', allowing users to save and remix AI workflows for instant reuse. This feature democratizes AI tool creation for non-developers.

Major tech companies are accelerating their adoption of post-quantum cryptography, but progress remains uneven. This shift is crucial to protect against future quantum computing threats.

Anthropic's tightly controlled rollout of Claude Mythos has been compromised, with unauthorized users gaining access. This incident contradicts the company's claims about the model's security capabilities and potential dangers.

A new arXiv paper outlines the architecture for an AI-based automated Course of Action (CoA) planning system, essential for modern military operations. This system addresses the challenges of increasing maneuver speeds and expanding operational areas in warfare.

Researchers introduce a framework that dynamically allocates compute resources and adapts generation strategies during inference. The method outperforms static approaches by focusing computation on challenging queries.

A new wiki system allows AI agents to maintain and access knowledge in Markdown files stored in a Git repository. This approach avoids complex databases and enables portable, version-controlled knowledge sharing.

Researchers introduce ZeroFolio, a method for algorithm selection using pretrained text embeddings instead of hand-crafted features. This approach eliminates the need for domain knowledge or task-specific training.

Researchers introduce TRACES, a method to tag and analyze reasoning steps in Language Reasoning Models (LRMs). This approach aims to reduce inefficiencies and improve the accuracy of model outputs.

ThermoQA is a new benchmark for evaluating LLMs on engineering thermodynamics problems. It shows significant performance drops as problem complexity increases, with top models like Claude Opus 4.6 leading at 94.1%.

A comprehensive guide for running large language models on a 64GB RAM device has been released. It covers practical tips for optimizing performance in code and math applications.

Graeme (@gkisokay) shares a curated list of powerful local LLMs that run efficiently on 32GB RAM machines. This opens up flagship-class models to a wider range of users.

Alibaba has released Qwen3.6-27B, an open-source model with 27 billion parameters that excels in coding tasks, surpassing its larger predecessor. This model demonstrates significant advancements in agentic coding capabilities.

Hugging Face introduces QIMMA, a quality-focused leaderboard for Arabic LLMs. It aims to highlight models that excel in both performance and cultural relevance.

OpenCLAW-P2P v6.0 enhances decentralized AI peer review with multi-layer persistence and live reference verification. This update strengthens the platform's ability to handle production-scale evaluations without human intervention.

OpenAI has introduced GPT-5.5, designed to handle complex tasks and power AI agents. It represents a significant leap in AI capabilities for real-world applications.

OpenAI has released GPT-5.5 and GPT-5.5 Pro, offering improved performance and new features. The models are now available through the API, expanding access for developers.

Researchers have identified a pervasive phenomenon called 'tool overuse' in large language models, where they unnecessarily rely on external tools instead of internal knowledge. The study explores the underlying mechanisms behind this behavior, highlighting a 'knowledge epistemic illusion' where models misjudge their own capabilities.

Researchers propose hierarchical policy optimization to improve simultaneous speech translation (SST) efficiency. The method leverages LLM KV cache reuse, reducing computational overhead without requiring extensive dialogue annotations.

The US military's rapid targeting during the Iran assault highlights AI's transformative role. Project Maven's success has reshaped defense strategies and procurement.

GitHub Copilot now integrates GPT-5.5, enhancing code completion and debugging capabilities. This marks a significant leap in AI-assisted development tools.