AI Radar

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

9/10

Hugging Face blog local LLM · May 14 · huggingface.co

IBM has released Granite Embedding Multilingual R2, an Apache 2.0 licensed multilingual embedding model with a 32K context window. This model achieves state-of-the-art retrieval quality among models under 100 million parameters. It is designed for efficient and accurate information retrieval across multiple languages.

→ This sub-100M multilingual embedding model with a huge context window is a strong contender for RAG applications, especially for those looking for open-source, performant alternatives to larger models.

multilingual embeddings rag open-source model retrieval quality apache 2.0

Unlocking asynchronicity in continuous batching

9/10

Hugging Face blog local LLM · May 14 · huggingface.co

inaturalist-clumper 0.1

8/10

Simon Willison local LLM · May 15 · simonwillison.net

Simon Willison released inaturalist-clumper 0.1, a tool he uses to publish his iNaturalist sightings to his blog. This release follows several weeks of production use and iterations on its functionality. An example of the output is available as a JSON file.

→ This is a low-signal item, but it's a good example of how developers are building custom tools for data integration and publishing, which could inspire similar RAG-related projects.

data integration blogging tools open-source projects

inaturalist-clumper 0.1

8/10

Simon Willison local LLM · May 15 · simonwillison.net

Simon Willison released inaturalist-clumper 0.1, a tool he uses to publish his iNaturalist sightings to his blog. This release follows several weeks of production use and iterations on its functionality. An example of the output is available as a JSON file.

→ This is a low-signal item, but it's a good example of how developers are building custom tools for data integration and publishing, which could inspire similar RAG-related projects.

data integration blogging tools open-source projects

datasette-llm-limits 0.1a0

8/10

Simon Willison local LLM · May 15 · simonwillison.net

Simon Willison released datasette-llm-limits 0.1a0, a new plugin for Datasette. This plugin integrates with datasette-llm and datasette-llm-accountant to enable per-user or global spending limits for LLM usage within Datasette. It allows configuration of limits such as a rolling 24-hour window with a specified USD amount.

→ This is a practical tool for managing LLM costs in Datasette, especially useful for controlling spending in multi-user environments or when experimenting with various local LLMs.

datasette llm limits cost management plugin

datasette-ip-rate-limit 0.1a0

8/10

Simon Willison local LLM · May 14 · simonwillison.net

Simon Willison released datasette-ip-rate-limit 0.1a0, a new plugin for Datasette. This plugin was developed with the help of Codex (GPT-5.5 xhigh) to address issues with poorly-behaved crawlers. It allows for configurable rate limiting based on IP addresses, with options for exempt paths and specific rules for different site areas.

→ This is a great example of using an LLM (even a non-local one) for practical, targeted development to solve real-world infrastructure problems, which is highly relevant for tooling and RAG improvements.

datasette plugin rate limiting llm development web infrastructure tooling

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

9/10

Hugging Face blog local LLM · May 14 · huggingface.co

IBM has released Granite Embedding Multilingual R2, an Apache 2.0 licensed multilingual embedding model with a 32K context window. This model achieves state-of-the-art retrieval quality among models under 100 million parameters. It is designed for efficient and accurate information retrieval across multiple languages.

→ This sub-100M multilingual embedding model with a huge context window is a strong contender for RAG applications, especially for those looking for open-source, performant alternatives to larger models.

multilingual embeddings rag open-source model retrieval quality apache 2.0

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

9/10

Hugging Face blog local LLM · May 14 · huggingface.co

IBM has released Granite Embedding Multilingual R2, an Apache 2.0 licensed multilingual embedding model with a 32K context window. This model achieves state-of-the-art retrieval quality among models under 100 million parameters. It is designed for efficient and accurate information retrieval across multiple languages.

→ This sub-100M multilingual embedding model with a huge context window is a strong contender for RAG applications, especially for those looking for open-source, performant alternatives to larger models.

multilingual embeddings rag open-source model retrieval quality apache 2.0

Unlocking asynchronicity in continuous batching

9/10

Hugging Face blog local LLM · May 14 · huggingface.co

inaturalist-clumper 0.1

8/10

Simon Willison local LLM · May 15 · simonwillison.net

Simon Willison released inaturalist-clumper 0.1, a tool he uses to publish his iNaturalist sightings to his blog. This release follows several weeks of production use and iterations on its functionality. An example of the output is available as a JSON file.

→ This is a low-signal item, but it's a good example of how developers are building custom tools for data integration and publishing, which could inspire similar RAG-related projects.

data integration blogging tools open-source projects

datasette-llm-limits 0.1a0

8/10

Simon Willison local LLM · May 15 · simonwillison.net

Simon Willison released datasette-llm-limits 0.1a0, a new plugin for Datasette. This plugin integrates with datasette-llm and datasette-llm-accountant to enable per-user or global spending limits for LLM usage within Datasette. It allows configuration of limits such as a rolling 24-hour window with a specified USD amount.

→ This is a practical tool for managing LLM costs in Datasette, especially useful for controlling spending in multi-user environments or when experimenting with various local LLMs.

datasette llm limits cost management plugin

datasette-ip-rate-limit 0.1a0

8/10

Simon Willison local LLM · May 14 · simonwillison.net

Simon Willison released datasette-ip-rate-limit 0.1a0, a new plugin for Datasette. This plugin was developed with the help of Codex (GPT-5.5 xhigh) to address issues with poorly-behaved crawlers. It allows for configurable rate limiting based on IP addresses, with options for exempt paths and specific rules for different site areas.

→ This is a great example of using an LLM (even a non-local one) for practical, targeted development to solve real-world infrastructure problems, which is highly relevant for tooling and RAG improvements.

datasette plugin rate limiting llm development web infrastructure tooling

Helping ChatGPT better recognize context in sensitive conversations

8/10

OpenAI news frontier · May 14 · openai.com

Learn how new ChatGPT safety updates improve context awareness in sensitive conversations, helping detect risk over time and respond more safely.

QR code generator

7/10

Simon Willison local LLM · May 15 · simonwillison.net

Tool: QR code generator Claude helped me build this tool for creating QR codes, for both text/URLs and for connecting to WiFi networks. Tags: vibe-coding , tools , generative-ai , ai , llms

Not so locked in any more

7/10

Simon Willison local LLM · May 14 · simonwillison.net

This Mitchell Hashimoto quote about Bun migrating from Zig to Rust reminded me of a similar conversation I had at a conference last week. I was talking to someone who worked for a medium sized technology company with a pair of legacy/ legendary iPhone and Android apps. They told

OpenAI and Malta partner to bring ChatGPT Plus to all citizens

7/10

OpenAI news frontier · May 16 · openai.com

OpenAI and Malta partner to expand AI access, offering ChatGPT Plus and training to help citizens build practical AI skills and use AI responsibly.

Work with Codex from anywhere

7/10

OpenAI news frontier · May 14 · openai.com

Use Codex anywhere with the ChatGPT mobile app. Monitor, steer, and approve coding tasks in real time across devices and remote environments.

[AINews] Cerebras' $60B IPO: Slowly, then All at Once

7/10

Latent Space commentary · May 16 · www.latent.space

We normally focus on technical stories, but occasional large fundraisings are noteworthy in themselves, and the Cerebras IPO (after one pulled S-1 and a fantastic 750MW partnership and $10-$20B stake/deal with OpenAI) this week, certainly qualifies as a growing theme supporting t

AI-Native Healthcare: 100M Doctor Visits, 10–20 Hours Saved, Prior Auth in Minutes — Janie Lee & Chai Asawa, Abridge

7/10

Latent Space commentary · May 14 · www.latent.space

Special discounts up for AIE Melbourne ( LS discount ) and AIE World’s Fair (group discounts up to 25% - CFPs still open for Autoresearch and Vertical AI ) Cya there! Abridge did not start as an “GPT wrapper”. It was founded in 2018, years before the Cambrian explosion of AI appl

New AI Model SHOCKED Me

7/10

Matt Wolfe YouTube commentary · May 16 · www.youtube.com

This might be the first AI demo in a LONG time that felt like there was actually a big step up in intelligence. Thinking Machines Labs just showed off their new “Interaction Models” and some of these demos genuinely feel like the next evolution of AI. They showed off real-time tr

Quoting Mitchell Hashimoto

6/10

Simon Willison local LLM · May 14 · simonwillison.net

[...] On the interesting side is how fungible programming languages are nowadays. Programming languages used to be LOCK IN, and they're increasingly not so. You think the Bun rewrite in Rust is good for Rust? Bun has shown they can be in probably any language they want in roughly

A new personal finance experience in ChatGPT

6/10

OpenAI news frontier · May 15 · openai.com

Preview a new personal finance experience in ChatGPT for Pro users in the U.S. Securely connect your financial accounts and get AI-powered insights and guidance grounded in your financial context, goals, and priorities.

How data science teams use Codex

6/10

OpenAI news frontier · May 15 · openai.com

See how data science teams can use Codex to build root-cause briefs, impact readouts, KPI memos, scoped analyses, and dashboard specs from real work inputs.

Databricks brings GPT-5.5 to enterprise agent workflows

6/10

OpenAI news frontier · May 15 · openai.com

Databricks uses GPT-5.5 for enterprise agent workflows after the model set a new state of the art on the OfficeQA Pro benchmark.

Sea's View on the Future of Agentic Software Development with Codex

6/10

OpenAI news frontier · May 14 · openai.com

Sea Limited's CPO explains why the company is deploying Codex across engineering teams to accelerate AI-native software development in Asia.

AI News: OpenAI Finally Released What We Asked For

6/10

Matt Wolfe YouTube commentary · May 15 · www.youtube.com

Here's the AI News you probably missed this week. Stop choosing between performance and budget and start building today with Crusoe Managed Inference at www.crusoe.ai/cloud/managed-inference?utm_source=mattwolfe&utm_medium=influencer&utm_campaign=bring_your_own_model_launch Disco

How sales teams use Codex

5/10

OpenAI news frontier · May 15 · openai.com

See how sales teams can use Codex to create pipeline briefs, meeting prep packets, forecast reviews, account plans, and stalled-deal diagnoses from real work inputs.

My Favorite AI Model Right Now

5/10

Matt Wolfe YouTube commentary · May 14 · www.youtube.com

I'm always being asked to pick my favorite AI model, but the truth is my favorite changes all the time based on what's out there. At the moment, GPT-5.5 is probably my go-to for almost everything like coding, brainstorming, and general questions. But a month ago I probably would’

Pipeline stats

244 items pulled across 19 sources
31 new items written to tp3_memories_local
31 locally scored by Ollama (gemma3:4b)
4 summarized by Gemini Flash · spend $0.0021 this run / $0.0021 today / $1.00 cap
21 rendered above (score ≥ 5)

Top 3 today

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Unlocking asynchronicity in continuous batching

inaturalist-clumper 0.1

By topic

other (4)

inaturalist-clumper 0.1

datasette-llm-limits 0.1a0

datasette-ip-rate-limit 0.1a0

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Full ranked list

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Unlocking asynchronicity in continuous batching

inaturalist-clumper 0.1

datasette-llm-limits 0.1a0

datasette-ip-rate-limit 0.1a0

Helping ChatGPT better recognize context in sensitive conversations

QR code generator

Not so locked in any more

OpenAI and Malta partner to bring ChatGPT Plus to all citizens

Work with Codex from anywhere

[AINews] Cerebras' $60B IPO: Slowly, then All at Once

AI-Native Healthcare: 100M Doctor Visits, 10–20 Hours Saved, Prior Auth in Minutes — Janie Lee & Chai Asawa, Abridge

New AI Model SHOCKED Me

Quoting Mitchell Hashimoto

A new personal finance experience in ChatGPT

How data science teams use Codex

Databricks brings GPT-5.5 to enterprise agent workflows

Sea's View on the Future of Agentic Software Development with Codex

AI News: OpenAI Finally Released What We Asked For

How sales teams use Codex

My Favorite AI Model Right Now

Pipeline status

Pipeline stats

Dead sources today