Poolside AI Introduces Laguna XS.2 and M.1: Agentic Code Models Up to 68.2% and 72.5% in SWE-bench Verified

Poolside AI Introduces Laguna XS.2 and M.1: Agentic Code Models Up to 68.2% and 72.5% in SWE-bench Verified

Poolside AI has released the first two models in its Laguna family: Laguna M.1 again Laguna XS.2. Along with this, the company releases lake — a lightweight terminal code agent and dual Agent Client (ACP) server — the same environment Poolside uses internally for RL agent training and testing, now available as a research preview. … Read more

OpenAI Releases Privacy Filter: A 1.5B-Parameter Open-Source PII Redaction Model with 50M Valid Parameters

OpenAI Releases Privacy Filter: A 1.5B-Parameter Open-Source PII Redaction Model with 50M Valid Parameters

OpenAI just quietly dropped something worth paying attention to. Released from Hugging Face under the Apache 2.0 license, Privacy Filter is an open, bi-directional tokenization model, designed to identify and reconstruct personally identifiable information (PII) in text. It’s small enough to run in a web browser or on a laptop and fast enough for high-end … Read more

How to Build a Visual-Language-Action-Inspired Composite Agent with Latent World Modeling and Predictive Control Modeling

How to Build a Visual-Language-Action-Inspired Composite Agent with Latent World Modeling and Predictive Control Modeling

import random, numpy as np, torch, torch.nn as nn, torch.nn.functional as F import matplotlib.pyplot as plt from dataclasses import dataclass from typing import Tuple, Dict, List from torch.utils.data import Dataset, DataLoader try: from tqdm.auto import tqdm except Exception: def tqdm(x, **kwargs): return x SEED = 7 random.seed(SEED); np.random.seed(SEED); torch.manual_seed(SEED) if device.type == “cuda”: torch.backends.cudnn.benchmark = … Read more

Meet the Talkie-1930: An Open-Weight 13B LLM in Pre-1931 English Literature for Historical Consulting and General Research

Meet the Talkie-1930: An Open-Weight 13B LLM in Pre-1931 English Literature for Historical Consulting and General Research

What if the language model has never heard of the Internet, smartphones, or World War II? That’s not a myth – that’s exactly what the research team led by Nick Levine, David Duvenaud, and Alec Radford. They called it speak upand it may be the largest disciplinary language model ever released to the public. Talkie … Read more

Build a Learning Powered Emphasis Agent that Learns to Retrieve Relevant Long-Term Memories for Accurate LLM Question Answering

Build a Learning Powered Emphasis Agent that Learns to Retrieve Relevant Long-Term Memories for Accurate LLM Question Answering

@dataclass class MemoryItem: memory_id: int topic: str entity: str slot: str value: str text: str def build_memory_bank() -> List[MemoryItem]: entities = [ { “entity”: “Astra”, “topic”: “robotics”, “facts”: { “battery”: “18 hours”, “sensor”: “LiDAR”, “country”: “Japan”, “release_year”: “2023”, “specialty”: “warehouse navigation”, }, }, { “entity”: “Orion”, “topic”: “astronomy”, “facts”: { “telescope”: “infrared array”, “country”: “Chile”, … Read more

Microsoft has relinquished exclusive control over OpenAI, and now the race for artificial intelligence seems to be wide open.

Microsoft has relinquished exclusive control over OpenAI, and now the race for artificial intelligence seems to be wide open.

Microsoft’s fortuitous entry into the OpenAI technology circle has been largely discounted. It is not a divorce or a complete separation. More like that, the most important artificial intelligence partnership is now getting stronger with other clouds. In the field of technology, that’s a big deal. For years, Microsoft’s initial investment in OpenAI helped make … Read more

Meta AI Releases Sapiens2: A High-Resolution Human Vision Model for Situation, Segmentation, Normalization, Point Mapping, and Albedo

Meta AI Releases Sapiens2: A High-Resolution Human Vision Model for Situation, Segmentation, Normalization, Point Mapping, and Albedo

If you’ve ever watched a motion capture system struggle with human fingers, or seen a segmentation model fail to distinguish between teeth and gums, you’ll understand why it’s hard to see human-centered computing. People aren’t just objects, they come with clear structure, fine details, and great variation in posture, clothing, lighting, and ethnicity. Finding a … Read more

RAG Without Vectors: How PageIndex Gets Through Consultation

RAG Without Vectors: How PageIndex Gets Through Consultation

Retrieval is where most RAG systems quietly break down. Native pipelines rely on vector matching—embedding queries and document fragments in the same location and retrieving the same “closest” match. But uniformity is a poor proxy for what we really need: compatibility based on reasoning. In longer, professional documents—such as financial reports, research papers, or legal … Read more

xAI Introduces grok-voice-think-fast-1.0: Bench-high τ-voice at 67.3%, High-Performance Gemini, Real-Time GPT, and More

xAI Introduces grok-voice-think-fast-1.0: Bench-high τ-voice at 67.3%, High-Performance Gemini, Real-Time GPT, and More

Building an AI agent for voice production is one of the most difficult engineering challenges in machine learning in use today. It’s not just about the accuracy of the transcription. You need a system that can capture the context of an entire five-minute conversation, request external APIs during the call without pausing, recover properly when … Read more