- What data sources can you integrate with?
- SharePoint, OneDrive, Notion, Google Drive, Confluence, GitHub wikis, Zendesk and HubSpot help centers, Intercom articles, S3 or Azure Blob document stores, Dropbox, Box, and most file systems with an API. PDFs, Word docs, Markdown, HTML, and slide decks are all handled. For sources without an API we build a sync adapter.
- Where do the vectors live?
- In a vector database we deploy with you — Pinecone, Weaviate, Qdrant, or pgvector running on your existing Postgres. Choice depends on scale, cost, and whether you want a managed service or to run it yourself. For Canadian data residency, we use Canadian regions on the managed services or self-host in AWS ca-central-1 or Azure Canada Central.
- How often does the index get re-indexed?
- Incrementally, in near real time. When a document changes in the source system, the ingestion pipeline detects it, re-chunks and re-embeds the affected chunks, and updates the vector store. Full reindex is only needed for things like an embedding-model change.
- How predictable is the cost?
- Three cost drivers: vector storage (typically the smallest and the steadiest), embeddings (one-time per document plus deltas), and inference (per query, the largest at scale). We share a forecast in the architecture phase and monitor against it monthly. Most engagements come in within 15% of the original forecast in steady state.
- How do you prevent hallucinated answers?
- Three layers. Retrieval confidence thresholds — below a cutoff the system refuses to answer rather than guess. Answer-faithfulness checks — the answer is verified against the retrieved chunks before it's returned. Citation on every claim — the user sees the source and can verify. For compliance-sensitive answers, we offer an exact-quote mode that returns retrieved passages verbatim.
- How accurate are the citations?
- Citations link back to the specific document chunks used to generate the answer. We measure citation accuracy as part of the eval — typically 90%+ for the cited-source-was-actually-used measure. The eval surfaces the cases where the system cited a source it didn't actually use and we tune retrieval to close those gaps.
- What does it cost to build?
- Pricing is custom per engagement and depends on the number of source systems, document volume, language and domain complexity, and the surface where the chat or search lives. We share pricing on the first discovery call. Inference and vector-database costs are passed through transparently — no markup.
- Who does the work?
- Two to three engineers from our Toronto-based team, led by Vatsal. The people who scope the engagement are the people who write the code. The full team is named on our team page — you can see and talk to them before we start.