AI - SDINFO

Tencent Open-Sources Memory for TencentDB agent: A 4-tier local memory pipeline for AI Agents

May 23, 2026May 23, 2026 by dardanvuc1996@gmail.com

Tencent released TencentDB Agent Memory, an open source memory system for AI agents. The project runs under the MIT license. It addresses a common problem for anyone deploying long-horizon agents: context bloat and recall failure. It is symbolic short-term memory and superimposed long-term memory. It also integrates OpenClaw as a plugin and Hermes Agent via … Read more

Build a SuperClaude Framework Workflow with commands, agents, modes, and runtime memory

May 23, 2026 by dardanvuc1996@gmail.com

class SuperClaude: “”” Mimics what Claude Code does at session start: • reads Markdown behavior files for the active command/agent/modes, • concatenates them into one system prompt, • runs the conversation through the Anthropic API. “”” BASE_SYSTEM = textwrap.dedent(“”” You are operating inside the SuperClaude Framework — a structured development platform layered on top of … Read more

Perplexity Open-Sources Bumblebee: A Read-Only Provisioning Scanner for Developer Endpoints

May 23, 2026May 23, 2026 by dardanvuc1996@gmail.com

Attackers are increasingly targeting packages, editor extensions, and configurations of AI tools on developer machines and not just production systems. Confusion has opened the source of an internal tool that it uses to fix this problem. Perplexity has released Bumblebee on GitHub. The tool is a read-only inventory collector for macOS and Linux endpoints for … Read more

New Agent for Alibaba-LLM First Coding

May 22, 2026May 22, 2026 by dardanvuc1996@gmail.com

Alibaba’s Qwen team has launched Qwen3.7-Max, a premium model built for the agent era. Unlike traditional chatbot-focused LLMs, it is designed as a foundation for autonomous AI agents that can code, debug, implement tools, manage workflows, and perform long-term business operations. Alibaba claims the model can operate automatically for up to 35 hours without performance … Read more

Microsoft Releases Fara1.5: A Family of Browser Computing Agents (4B/9B/27B) Beyond OpenAI Operator and Gemini 2.5 Computing Online-Mind2Web

May 22, 2026 by dardanvuc1996@gmail.com

Microsoft Research’s AI Frontiers lab has released Fara1.5. It is a family of computer user interface (CUA) models for the browser. The release ships in three sizes: Fara1.5-4B, Fara1.5-9B, and Fara1.5-27B. The models are integrated with MagenticLite, Microsoft’s sandboxed browser interface for these agents. Computing agents are pixel-to-action models that drive a real browser. They … Read more

Cohere Releases Command A+: 218B Sparse MoE Model for Agentic Workflow Running on Two H100 GPUs

May 21, 2026May 21, 2026 by dardanvuc1996@gmail.com

Cohere recently released Command A+, as an open source model that streamlines enterprise agent workflows. Available under the Apache 2.0 license, Command A+ is a mixed-expert (MoE) model designed for high-performance agent tasks with minimal compute overhead. The model is designed for logic, agent workflow, RAG, multi-language, and heterogeneous document processing. It combines capabilities from … Read more

AI Agents Enter Google Search

May 21, 2026 by dardanvuc1996@gmail.com

Google Search recently went from being an encyclopedia to an assistant. That’s the gist of everything Google announced at its latest I/O 2026 conference. The buzzword is “AI agents”, now entering Google Search, its coding platforms, and even a new autonomous system. The idea is to move from AI that answers to AI that actually … Read more

Meet Turbovec: A Rust Vector Index with Python Bindings, and Built on Google’s TurboQuant Algorithm

May 20, 2026 by dardanvuc1996@gmail.com

Vector Search supports multiple regression generation augmented (RAG) pipelines. At scale, it’s expensive. Storing 10 million embedded documents in float32 consumes 31 GB of RAM. For dev teams that use local or local assumptions, that number creates real issues. A new open source library called turbovec it says this directly. It’s a vector directory written … Read more

Alibaba Qwen Team Introduces Qwen3.5-LiveTranslate-Flash: Real-Time Multimodal Interpretation Across 60 Languages with 2.8-Second Latency

May 20, 2026May 20, 2026 by dardanvuc1996@gmail.com

Simultaneous translation is one of the hardest problems in applied AI. You ask the model to translate the speech before the speaker finishes the sentence. Every additional second of delay breaks the illusion of real-time communication. Alibaba’s Qwen team has been excelling at this with each release. Their latest model, Qwen3.5-LiveTranslate-Flashit brings that latency reduced … Read more

Google’s Hands-on Review of High-Speed AI

May 20, 2026 by dardanvuc1996@gmail.com

Google Gemini’s family donation for the next generation: Gemini 3.5 is here! Gemini 3.5 Flash combines edge intelligence with real-world action and supports high-speed agent workflows, encoding, and multimodal reasoning while maintaining the low latency expected of the Flash series. With the Gemini 3.5 Pro, scheduled for release next month, let’s take a look flash … Read more