Getting Started

Introduction

PureRecall is an AI-driven meeting transcription and retrieval system designed to handle:

Real-time Transcriptions: Using AWS Transcribe (audio → text)
Data Storage & Search: Chunks stored in Supabase, queried through a Vector DB approach (pgvector)
RAG Summaries: Summaries generated via GPT
Intuitive UI: React + Tailwind + Vite for frontend, with robust integration via Supabase Edge Functions on the backend

React (TypeScript) for the frontend
Supabase (PostgreSQL + pgvector) for chunk storage & indexing, plus Edge Functions for serverless logic
AWS for real-time transcription using Amazon Transcribe Streaming
OpenAI (GPT) for summarization, chunk analysis, keyword extraction

Token Cost Management: Instead of sending entire transcripts, we chunk + embed, retrieving only relevant pieces for GPT
Speed & Scalability: We use parallel approaches for chunk ingestion (Supabase Edge Functions can be triggered in parallel)
Ease of Development: A single codebase with local + production parity (via Supabase CLI)

git clone [repository-url]
cd [repository-folder]

npm install

Install Supabase CLI:

npm install -g supabase
supabase login

For local development:

supabase start
supabase functions serve

Create a .env in the root folder with the following variables:

npm run dev

This serves the React app on a local port (e.g., localhost:5173 or similar).

When ready to push to production:

supabase functions deploy