About knot

We make AI chatbots feel obvious.

knot is a small team building the AI infrastructure we wish existed: fast, multi-tenant, transparent, and copy-pasteable. No setup theatre. No vendor lock. Just answers.

Start building Talk to the team

Built-in integrations

Avg time-to-first-token (ms)

Customer NPS

Day refund window

Mission

Make grounded AI boringly easy.

The AI landscape is split into two extremes. On one side: raw API access — powerful, but you build the retrieval, the streaming, the tenancy, the analytics yourself. On the other: closed platforms that lock you into one model and one UI.

knot is the missing middle. The full RAG + multi-tenant stack, with the model layer still open. Bring your own key. Ship in an afternoon. Scale to a million conversations without rearchitecting.

Our north star

"The visitor asks a question. The right answer streams back in under 300ms. Every other concern hides behind a sensible default."

Jacob Reyes

Founder, knot.ai

What we believe

Four opinions we'll never compromise on.

Ship the obvious thing.

Most AI products bury the magic under setup. We chase the shortest possible path from intent → answer.

Speed is a feature.

Sub-second streams aren't a luxury — they're the difference between a tool and a toy.

Tenancy is sacred.

Two layers of isolation, audited by default. Your data never bleeds into someone else's namespace.

Operator-grade UX.

We build for the engineer on Friday afternoon. Sane defaults, copy-pastable code, no surprises.

Story so far

From weekend prototype to production AI.

2025 Q3

Project knot started

Born out of frustration with bloated chatbot platforms. First prototype shipped in a weekend.

2025 Q4

RAG pipeline goes live

Embeddings + vector retrieval + streaming LLM all stitched together. The grounded-answer loop closes in <300ms end-to-end.

2026 Q1

Multi-tenant launch

Per-workspace isolation everywhere — Postgres, vectors, analytics. First paying customers go live.

2026 Q2

BYO-key + Embed v2

Pro plan adds Bring-your-own provider keys. Widget rewritten in vanilla JS, ~12kb gzipped.

Now

You're here

Public beta. Free tier forever. Self-serve onboarding. We're listening — talk to us anytime.

The stack

Built on battle-tested primitives.

We pick boring infrastructure so we can build interesting product.

Streaming LLM core

Token-by-token responses

sub-second time-to-first-token

Multi-provider routing

Pick the model that fits

swap providers without rewriting prompts

Per-workspace vectors

Retrieval layer

your data, isolated from every other tenant

Managed Postgres + Auth

Identity + storage

row-level isolation on every write

Embedding pipeline

Semantic search

incremental, re-runnable on every edit

Serverless platform

Web + API

no infra to babysit, scales to traffic

Team

Small, opinionated, and shipping.

No middle management. Every line of code, every email, every onboarding call comes from someone on this list.

Jacob Reyes

Founder · Engineering

Ex-platform. Ships at the speed of the model.

Sara Kapoor

Design · Brand

Believes great UI is invisible.

Wei Tanaka

ML · Retrieval

Lives in embedding space.

Maya Lindqvist

Customer · Success

Onboards every Pro customer personally.

Come build with us.

We're a small team. Every customer matters. Every bug gets a human.

Start free Say hi

GitHub Twitter LinkedIn