The Problem
Most organizations sit on a fragmented content estate: a CMS for published material, a wiki for internal notes, feeds and crawls for outside coverage, archives in cloud storage, and structured records behind internal APIs. There’s no unified way to query across all of it, and no clean way to push the consolidated view back out to the tools and systems that need it.
Honcho closes that gap. Ingest documents, feeds, API responses, and bulk archives into a single full-text index, then expose the consolidated content back out through GraphQL, REST, MCP, and replication endpoints. AI assistants are one of many consumers of the same data.
- Ingest from anywhere
- Query across everything
- Emit anywhere
The Short Version
Content flows in from many directions: bulk importers for CMS migrations and document archives, pull replication from WordPress and other CMS platforms, feed crawling for ongoing sources, an extract API with declarative rules for structured data, pluggable Java connectors for custom integrations, and AI assistants that save articles and notes on your behalf. Tens of thousands of articles, hundreds of feeds, decades of archive: everything lands in the same full-text index.
Once indexed, the consolidated corpus is queryable through whatever protocol fits your stack. GraphQL for flexible structured queries, REST for JSON, feed, and sitemap output, and MCP for AI clients. Boolean search, faceted filters, time-range constraints, structured fragment lookups, and configurable relevance. The full power of Lucene is exposed through clean APIs. Push the same content back out through replication to other Honcho instances or downstream systems.
The same data is also reachable through human and AI surfaces: a web Console for operations (sources, users, tokens, metrics, Editions config), a built-in Copilot chat interface in the browser, and an MCP server that lets Claude, Claude Code, and other AI clients query the index directly. Use whichever fits the task.
On top of that foundation, you can ask “how have expert views on China’s economy evolved over the past decade?” and get an answer synthesized across thousands of indexed documents, threading together analysis, tracking how positions shifted, surfacing connections that no keyword search would find. Combine results from your library with live web content, search connectors, and internal APIs to compare, verify, and fill gaps.
It supports multiple users and groups, so a team can share a common pool of content and collaborate through their AI assistants. An analyst can summarize a report and share it to the team’s feed. A writer can research across the entire corpus and send findings to colleagues. A shared knowledge base builds organically without anyone copying and pasting links around.
Honcho is also a persistence layer for AI workflows. Agent apps that summarize articles, monitor topics, or produce analysis can write their output back to Honcho, where it becomes searchable alongside everything else. The content you curate and the content your tools produce all live in one place.
Most importantly, Honcho gets smarter as you use it. Every entry you save, every note you add, every article you mark as important becomes a curated relevance signal that shapes future answers. Rather than stuffing the AI’s context window with whatever matches the keywords, Honcho prioritizes your own research and annotations, so the AI answers in the context of what you already know and think, not just what’s in the library.
Cross-topic synthesis across years of financial commentary. Briefings assembled from dozens of sources in minutes. Expert opinion tracked over time. Pattern recognition across industries.
See real use cases →What It Does
Most content platforms are assembled from separate services: a CMS for storage, Elasticsearch for search, a feed reader for ingestion, S3 for assets, a custom API layer to glue it all together. Honcho replaces that stack with a single integrated system deployed on your infrastructure.
- Aggregate
- Ingest
- Index
- Serve
- Replicate
| Concern | Typical Stack | Honcho |
|---|---|---|
| Storage | PostgreSQL / DynamoDB | Built-in |
| Search | Algolia / Elasticsearch | Built-in |
| Crawling | Scrapy / custom crawlers | Built-in |
| Ingestion | Custom importers / ETL | Built-in (MCP, save URL, extract API, bulk import) |
| Assets | S3 / cloud storage | Built-in |
| API | Custom REST / GraphQL | Built-in (GraphQL + REST + MCP) |
| Sync | Custom ETL / webhooks | Built-in |
What Makes It Different
- Search is built in, not bolted on
- Structured content fragments
- Declarative content extraction
- One API surface, three protocols
- One system, not six
- AI surfaces share the same foundation
- Curated context, not brute-force retrieval
Architecture
Built on standard enterprise infrastructure that any Java team can deploy and maintain. No exotic dependencies, no cloud-specific lock-in, no operational surprises.
- Runtime
- Search engine
- API layer
- Data model
- Instrumentation
Access Surfaces
Honcho exposes the same content, the same connectors, and the same write paths through four complementary interfaces. Pick the one that fits the task at hand. Most teams use several.
- GraphQL & REST APIs
- Console
- Copilot
- MCP server
MCP Server
Honcho includes a built-in Model Context Protocol server,
so AI assistants like Claude can search, retrieve, create content, query external data
sources, and fetch web content directly. Point any MCP client at the /mcp
endpoint and the entire index becomes conversational. Your client’s LLM does
the reasoning. See practical use cases →
| Tool | What It Does |
|---|---|
search_content |
Full-text search with host, author, date, group, collection, source tag, tag, type, and sort filters. Scope to your library, your group, or all public content. |
get_entry |
Retrieve a single entry by UID or most recent |
get_entries |
Retrieve multiple entries by UID in a single call |
find_similar_entries |
Find related content by UID. Uses More Like This to surface entries you might have missed |
create_entry |
Create a new entry with title, content, tags, topics, type, author, and metadata |
update_entry |
Update fields on an existing entry. Replace or append tags/topics, merge metadata |
delete_entry |
Soft-delete an entry from the database and search index |
tag_entries |
Add or remove tags from entries matching a search query |
get_digest |
Read a named digest. Admin-curated reading lists and briefings |
search_connector |
Query external data sources directly (SEC filings, economic data, etc.) |
fetch_url |
Fetch a URL and extract its content for analysis or comparison |
web_search |
Search the web to cross-reference, fill gaps, or find content not yet in your library |
get_status |
Account overview: sources, entries, groups, digests, and connectors |
save_memory |
Save a note that can be recalled in future conversations |
recall_memory |
Search or list saved memories |
send_feedback |
Report bugs, request features, or provide feedback |
Source management, favorites, group messaging, and other operational features stay in the admin UI and Copilot. MCP is intentionally focused on querying and writing content, not managing the system.
- Your client’s LLM does the work
- Read and write
- External data connectors
- Web + URL fetch
- Cross-session memory
- Enterprise-ready auth
Built-in Copilot
For users who don’t already have an MCP client, or teams that want a single shared chat surface without per-user setup, Honcho ships Copilot, a built-in AI chat interface that opens in a browser tab and works out of the box.
- Zero setup
- Bring any LLM provider
- Token-efficient orchestration
- Dynamic planner
- Document upload
- Chat history & forks
- Group collaboration
- Style guide
Editions
AI-curated briefings that synthesize your indexed content into newspaper-style summary pages. Configure which sources to draw from, set a schedule, and Honcho generates a branded publication automatically.
- Daily & weekly
- Scheduled generation
- Custom branding
- Public URLs & sharing
- API access
Use Cases
- Research & analysis
- Team knowledge base
- Content aggregation and monitoring
- CMS integration & replication
- Headless search API
Status
Honcho is actively developed and available for licensing, collaboration, or investment.