Cainew - Curated AI news for developers

May 10, 2026 Weekly

TL;DR

Model Releases

Supertone/supertonic-3

Tools & Products

Research Papers

Tutorials

Using Claude Code: The unreasonable effectiveness of HTML

Industry News

Discussion

Model Releases

Supertone/supertonic-3

HuggingFace

Tools & Products

op7418/guizang-ppt-skill

A Claude Code Skill that turns prompts into horizontal-swipe magazine-style HTML decks — 10 layouts, 5 curated themes, WebGL hero backgrounds, single-file output.

GitHub

EKKOLearnAI/hermes-web-ui

Web dashboard for Hermes Agent — multi-platform AI chat, session management, scheduled jobs, usage analytics

GitHub

cosmicstack-labs/mercury-agent

Soul-driven AI agent with permission-hardened tools, token budgets, and multi-channel access. Runs 24/7 from CLI or Telegram.

GitHub

ciembor/agent-rules-books

AGENTS.md rules / skills for AI coding agents: Codex, Cursor & Claude Code. Inspired by Clean Code, Refactoring, DDD, Clean Architecture and DDIA programming books.

GitHub

eight-acres-lab/skillplus

A compilable skill package standard for reliable content-generation agents.

GitHub

sylearn/AIUsage

One dashboard for all your AI subscriptions — quotas, costs, accounts, and Claude Code proxy.

GitHub

Gemini API File Search is now multimodal

Google's Gemini API File Search feature now supports multimodal capabilities, allowing users to search across both text and image content within files for more comprehensive retrieval results.

RSS

Adject 2.0: Create hyperrealistic product visuals with AI

Adject 2.0 is an agentic product studio where brands can create, edit, and iterate product visuals inside an infinite creative workflow. Instead of isolated generations, products, models, edits, videos, and assets stay connected inside projects and evolve continuously over time. Upload once, generate in context, iterate visually, and build complete campaigns without fragmented tools or repetitive prompting.

ProductHunt

Notion 3.4: New dashboards, connectors, sidebar & smarter AI agents

Notion now generates images, builds dashboards, and presents your docs without leaving the workspace. 10+ features shipped for operators, PMs, and teams scaling on AI.

ProductHunt

Keel: An AI assistant whose memory belongs to you

Keel is a local-first desktop app for macOS and Windows that captures what matters from your conversations into plain markdown files on your disk. Bring your own model - Claude, GPT, OpenRouter, or Ollama. Your context stays with you, not the vendor. Open source, MIT. https://keel-labs.github.io/

ProductHunt

oritera/Cairn

A AI general-purpose state-space search engine, validated first on autonomous penetration testing.

GitHub

reacher-z/ClawBench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

GitHub

OpenRaiser/ProDa

📖 Data Engineering from Raw Corpora

GitHub

oliviazzzu/minimal-embodiment

A minimal hardware-software architecture giving large language models a closed-loop physical embodiment with self-perception loops.

GitHub

lthoangg/OpenAgentd

Self-hosted AI agent OS — streaming chat, tool use, persistent memory, and multi-agent teams. Runs entirely on your machine.

GitHub

Research Papers

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current methods are largely purely reactive, which weakens both exploration and credit assignment over extended trajectories. In this work, we present Strategic Trajectory Abstraction (StraTA), a simple framework that introduces an explicit trajectory-level strategy into agentic reinforcement learning (RL). StraTA samples a compact strategy from...

HuggingFace

GeoStack: A Framework for Quasi-Abelian Knowledge Composition in VLMs

We address the challenge of knowledge composition in Vision-Language Models (VLMs), where accumulating expertise across multiple domains or tasks typically leads to catastrophic forgetting. We introduce GeoStack (Geometric Stacking), a modular framework that allows independently trained domain experts to be composed into a unified model. By imposing geometric and structural constraints on the adapter manifold, GeoStack ensures the foundational knowledge of the base model is preserved. Furthermor...

HuggingFace

Killswitch: Per-function short-circuit mitigation primitive

Killswitch presents a per-function short-circuit mitigation primitive that provides granular control over function execution and error handling, enabling developers to implement safety mechanisms at the individual function level.

RSS

Tutorials

Using Claude Code: The unreasonable effectiveness of HTML

This article explores how HTML proves to be surprisingly effective as a tool for code generation and manipulation when working with Claude, demonstrating the practical value of leveraging web standards in AI-assisted development.

Twitter

Industry News

Chrome's AI features may be hogging 4GB of your computer storage

Google Chrome's new AI-powered features may consume up to 4GB of storage space on users' computers, raising concerns about resource usage and storage management for those with limited disk space.

RSS

Google Cloud Fraud Defence is just WEI repackaged

Google Cloud's Fraud Defence tool is criticized for being essentially a repackaged version of WEI (Web Environment Integrity), raising questions about the originality of the offering.

RSS

Apple, Intel have reached preliminary chip-making deal

Apple and Intel have announced a preliminary agreement for chip-making collaboration, signaling potential deeper manufacturing partnership between the two tech giants.

RSS

Discussion

Task Paralysis and AI

Task paralysis refers to the difficulty some users experience when deciding how to best leverage AI systems for complex tasks. Understanding how to break down objectives and properly prompt AI models can help overcome this cognitive barrier.

RSS

LLMs corrupt your documents when you delegate

The article warns that delegating document processing tasks to large language models can result in data corruption and integrity issues, highlighting risks in relying on LLMs for sensitive document handling without proper safeguards.

ArXiv