AI - SDINFO

Implementation of IWE Content Bridge as an AI-Powered Knowledge Graph with Agentic RAG, OpenAI Function Calling, and Graph Traversal

March 27, 2026 by dardanvuc1996@gmail.com

In this lesson, we use YOU: an open source, scalable personal information management system that manages put-down notes as a floating information graph. Since IWE is a CLI/LSP tool designed for local programmers. We build a virtual developer’s knowledge base from scratch, connect wiki links and markup links into a directed graph, and walk through … Read more

openJiuwen Community Releases ‘JiuwenClaw’: A Self Evolving AI Agent for Task Management

March 27, 2026March 27, 2026 by dardanvuc1996@gmail.com

Over the past year, AI agents have evolved from answering questions to trying to perform real tasks. However, an important barrier has emerged: while many agents may seem smart during a conversation, they often ‘drop the ball’ when it comes to performing real-world tasks. Whether it’s an office workflow that breaks when needs change, or … Read more

Build an AI Assembly Summary with Claude Code + MCP

March 27, 2026March 27, 2026 by dardanvuc1996@gmail.com

Teams in all companies lose meeting notes and action items after discussions. This guide builds a permanent fix: Summary of the AI Assembly and Task Scheduler using Claude Code and MCP. It processes the transcripts into organized summaries with tasks, decisions, and calendar reminders, connects to Google Calendar and Gmail, and stores everything in SQLite. … Read more

Meta Releases TRIBE v2: A Brain Coding Model That Predicts fMRI Responses to Across Video, Audio, and Text Stimuli

March 27, 2026March 27, 2026 by dardanvuc1996@gmail.com

Neuroscience has long been a divide-and-conquer field. Researchers typically map specific cognitive functions to isolated brain regions—such as movement in area V5 or the surface of the fusiform gyrus—using models designed to summarize experimental paradigms. Although this has provided deep insight, the resulting field is fragmented, lacking a unified framework to explain how the human … Read more

Google Releases Gemini 3.1 Flash Live: A Multimodal Real-Time Voice Model for Low-Latency Audio, Video, and Instrumentation for AI Agents

March 27, 2026March 27, 2026 by dardanvuc1996@gmail.com

Google has released Gemini 3.1 Flash Live in developer preview with the Gemini Live API in Google AI Studio. This model aims for lower latency, more natural, and reliable real-time voice interaction, serving as ‘Google’s highest quality audio and speech model to date.’ By processing multimodal streams in nature, the release provides a technical basis … Read more

Code Implementation of Qwen3.5 Refined Reasoning Models with Claude-Style Reasoning Using GGUF and 4-Bit Quantization

March 26, 2026 by dardanvuc1996@gmail.com

In this tutorial, we work directly with Qwen3.5 models populated with Claude logic and set up a Colab pipeline that allows us to switch between the 27B GGUF variant and the lightweight 2B 4-bit version with one flag. We start by verifying the availability of the GPU, then conditionally install llama.cpp or transformers with bizandbytes, … Read more

Cohere AI Releases Cohere Transcribe: SOTA Automatic Speech Recognition (ASR) Model Powering Enterprise Speech Intelligence

March 26, 2026March 26, 2026 by dardanvuc1996@gmail.com

In the case of enterprise AI, the bridge between unstructured audio and physical text is often a bottleneck of proprietary APIs and complex pipelines. Today, Cohere—a company generally known for its text generation and embedding models—has officially entered the Automatic Speech Recognition (ASR) market with the release of its latest model ‘.Cohere Transcribe‘. Architecture: Why … Read more

Tencent AI Open Sources Covo-Audio: 7B Speech Language Model and Suggestive Line for Real-Time Audio Conversations and Consultations

March 26, 2026March 26, 2026 by dardanvuc1996@gmail.com

Tencent AI Lab has been released Covo-Audioparameter 7B-end-to-end Large Audio Language Model (LALM). The model is designed to integrate speech processing and language intelligence by directly processing continuous audio input and generating audio output within a single architecture. System Architecture The Covo-Audio framework consists of four main components designed for seamless cross-modal interoperability: Audio encoder: … Read more

How to Build a Vision-Driven Web Agent with MolmoWeb-4B Using Multimodal Reasoning and Action Prediction

March 25, 2026March 25, 2026 by dardanvuc1996@gmail.com

def parse_click_coords(action_str): “”” Extract normalised (x, y) coordinates from a click action string. e.g., ‘click(0.45, 0.32)’ -> (0.45, 0.32) Returns None if the action is not a click. “”” match = re.search(r”click(s*([d.]+)s*,s*([d.]+)s*)”, action_str) if match: return float(match.group(1)), float(match.group(2)) return None def parse_action_details(action_str): “”” Parse a MolmoWeb action string into a structured dict. Returns: {“type”: “click”, … Read more

RIP OpenClaw? Meet Claude Dispatch

March 25, 2026March 25, 2026 by dardanvuc1996@gmail.com

My biggest complaint with AI solutions is that they are too dependent on my presence in any activity. Even with agent AI now in the mix, complete automation of any complex process still seems like a myth. Tools like n8n and make.com require a lot of setup time and are not as efficient as traditional … Read more