Cainew - Curated AI news for developers

TL;DR

Model Releases

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks

Tools & Products

Research Papers

Industry News

Model Releases

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks

Rio de Janeiro's city government AI model Rio3.5 has achieved higher benchmark scores compared to Qwen3.7 in recent performance tests. The results highlight competitive performance in model evaluation metrics.

Twitter

Tools & Products

omnigent-ai/omnigent

A meta-harness for all your AI agents. Omnigent provides a common layer over Claude Code, Codex, Pi, and the agents you write yourself: swap or combine harnesses without rewriting, keep them in check with policies and sandboxing, and collaborate in real time on the same live session, from any device.

GitHub

duolahypercho/fusion-fable

Fuse two frontier models into one Fable-tier answer: Opus 4.8 drafts, a second model (Opus 4.8 or GPT-5.5 via codex) checks, Opus fuses. A Claude Code skill.

GitHub

sdeonvacation/opencode-x

Open-source Claude Code alternative. Provider-agnostic, MIT licensed. Native Claude Code hook/plugin compatibility.

GitHub

Dong90/oh-my-taiyiforge

AI workflow automation plugin for intelligent code generation with Claude/Codex

GitHub

WhitzardAgent/AgentGuard

AgentGuard：An Attribute-Based Access Control Framework for Tool-Use LLM-Based Agent

GitHub

Slashy: The AI assistant that does email for you

Slashy is an AI-native email client and assistant that drafts replies in your voice, triages what matters, and makes sure no follow-up slips, so you spend less time in your inbox and more time on what matters. It connects to your email, calendar, CRM, and meeting notes and learns how you work, so you can ask Slashy to prep you for your next meeting, draft a follow-up, clear your inbox to zero, track who still owes you a reply, or fire off an email from iMessage or Slack while you're on the go.

ProductHunt

PolyHelper/polyhelper

Self-evolving cognitive AI exoskeleton. 10+ frontier models, 245 consensus methods, governed autonomous agents. Automotive, medical, legal, accessibility. 9.3M LOC, 205K tests. Open-source multi-model orchestration platform.

GitHub

Taste Lab: Extract any website's design DNA

Point your AI agent at any website. Get back a complete design breakdown — colors, type, spacing, and the reasoning behind every decision — ready to use in your next build.

ProductHunt

Memoriq: Your private AI memory for ChatGPT, Claude, Gemini and Grok

Memoriq is your private AI memory for ChatGPT, Claude, Gemini and Grok. Save the conversations that matter in an end-to-end encrypted vault that only you can access. Open source, self-hostable, and built for people who don't want to lose valuable AI chats or trust another plaintext cloud service. Search, organize, and keep your AI knowledge under your control.

ProductHunt

Research Papers

Don't trust large context windows

Recent analysis suggests that large context windows in language models may not be as reliable as previously thought, with models potentially struggling to effectively utilize information across very long input sequences. Users should exercise caution when relying on models to process and accurately refer to information from extended contexts.

RSS

Making Claude a Chemist

Anthropic has enhanced Claude to improve its chemistry capabilities, enabling the AI assistant to better assist with chemical research, analysis, and molecular design tasks. The upgrade expands Claude's utility for scientific and chemical applications.

Anthropic

Industry News

Meta’s chaotic AI strategy

Meta's approach to AI strategy has been characterized as chaotic, with inconsistent priorities, shifting investments, and unclear direction across different AI initiatives. The company's AI strategy lacks coherence and long-term strategic vision.

RSS

Police officer investigated for using AI to 'create evidence' in multiple cases

A police officer is under investigation for using AI systems to fabricate or manipulate evidence in multiple criminal cases. The incident raises serious concerns about the misuse of AI technology in law enforcement.

RSS

KPMG pulls report on AI usage due to apparent hallucinations

KPMG has withdrawn a report on AI usage after discovering the document contained significant hallucinations and inaccuracies generated by AI. The incident underscores the importance of verifying AI-generated content for accuracy.

RSS

State Attorneys General Are Investigating OpenAI

Multiple State Attorneys General have launched investigations into OpenAI over various compliance and consumer protection concerns. The investigations examine the company's business practices and regulatory adherence.

RSS

EU Commission looking at practical consequences of Anthropic decision

The EU Commission is evaluating the practical consequences and regulatory implications of Anthropic's recent strategic decisions. The review focuses on how these changes may affect competition and compliance within the European market.

RSS