
Beans API & MCP
v0.1 LiveBeans is a news and blog aggregator. It collects data from 7 000+ sources/publishers every day.
Key features
- Vector semantic search — Natural language queries with configurable accuracy thresholds
- Comprehensive filtering — By tags (categories, entities, regions), sources, and time ranges
- Entity & sentiment enrichment — Automatic extraction of sentiment, named entities, and metadata optimized for analytics and AI pipelines
- Trend scoring — Articles ranked by social engagement metrics for relevance and timeliness
- Cross-publisher linking — Related articles mapped across the entire feed
Authentication
Get API KeyCode
Base URL
All Beans endpoints live under the /beans path prefix.
Core endpoints
Articles
| Endpoint | Description |
|---|---|
GET /beans/articles/top-headlines | Top trending headlines from past 24 hours, ranked by trend score |
GET /beans/articles/latest | Most recently published articles, sorted by publish date (newest first) |
GET /beans/articles/trending | Trending articles ranked by trend score (based on social engagement) |
GET /beans/articles/search | Semantic or tag-based search across all articles in the database |
Sources
| Endpoint | Description |
|---|---|
GET /beans/sources | Retrieves detailed metadata for sources (site name, description, favicon) |
Tags / Metadata
| Endpoint | Description |
|---|---|
GET /beans/tags/categories | Paginated list of unique article categories/topics |
GET /beans/tags/entities | Paginated list of named entities (persons, orgs, products, places) |
GET /beans/tags/regions | Paginated list of geographic regions mentioned in articles |
Pagination
All metadata endpoints support offset (default 0) and limit (default 16, max 128).
Query Parameters
Articles
| Parameter | Type | Description |
|---|---|---|
q | string (3–512) | Optional semantic search query (natural language, triggers vector embedding) |
acc | number (0–1) | Embedding accuracy/similarity threshold — higher = stricter match (default 0.75) |
content_type | string | Content type filter: news, blog, post, comment, etc. |
tags | string[] | Case/whitespace-insensitive filter across categories, regions, entities combined (recommended). E.g., AI, ai, #ai are equivalent. AND combination. |
categories | string[] | Precise category topic filters — inclusive OR, case/whitespace-sensitive |
regions | string[] | Precise geographic region filters — inclusive OR, case/whitespace-sensitive |
entities | string[] | Precise named entity filters — inclusive OR, case/whitespace-sensitive |
sources | string[] | Publisher/source ID filters — inclusive OR |
from | date (YYYY-MM-DD) | Latest/Trending only: Articles published/trending since this date (defaults to 7 days ago) |
full_content | boolean | Include full article text (default false) — large payload |
limit | integer (1–128) | Results per page (default 16) |
offset | integer | Pagination offset — number of items to skip (default 0) |
Response: 200 OK → array of article objects.
Sources
| Parameter | Type | Description |
|---|---|---|
sources | string[] | Required. Source IDs to fetch metadata for (CSV, case-sensitive) |
limit | integer (1–128) | Items per page (default 16) |
offset | integer | Pagination offset (default 0) |
Response: 200 OK → array of Publisher objects.
Tags / Metadata
| Parameter | Type | Description |
|---|---|---|
limit | integer (1–128) | Items per page (default 16) |
offset | integer | Pagination offset (default 0) |
Response: 200 OK → array of strings.
Checkout API reference for more details.
MCP Server
Server URL: https://api.cafecito.tech/beans/mcp
Beans endpoints are exposed as hosted MCP tools for AI agent integration. See the MCP Integration guide for more details.
Examples
Below are real-world examples. Replace YOUR-API-KEY with the key you generated in the developer portal.
1. Health check — verify service is operational
2. Top headlines from the last 24 hours
Get trending headlines ranked by trend score, perfect for building breaking news dashboards.
3. Latest news on market performance & economy
4. Trending news on Robotics in Saudi Arabia
Use the trending endpoint to surface content ranked by social engagement signals and trend score.
5. Semantic search on archive — find articles about AI safety concerns
Search across all articles using natural language and retrieve full content for RAG/summarization.
6. Get sources metadata
Retrieve site names, descriptions, and favicon URLs for a set of publishers.
7. List article categories with pagination
Retrieve all unique categories/topics in the database.
Best practices
- Start with
acc=0.75, then tweak up for precision or down for recall. - Use
fromparameter to keep feeds fresh — specify YYYY-MM-DD dates (e.g.,from=2026-03-10). - Use
full_content=truesparingly — requests will be slower and payload larger; great for RAG/summarization pipelines. - Paginate with
offset+limitfor stable ingestion pipelines and monitoring. - Use
tagsparameter (recommended) for flexible filtering across categories, regions, and entities in one parameter. - Use precise filters (
categories,regions,entities) only when you need exact matches (case-sensitive). - Combine search (
q) with filters — semantic search + tag filters = powerful precision.
Use cases that aren't boring
- AI assistants and RAG workflows that need fresh context
- Finance dashboards that actually stay current
- Media trend detection — who's talking about what and when
- Analyst workflows that want enrichment-ready JSON, not raw HTML

