InsightFlow is a web application that transcribes, summarizes, and enables Q&A over audio, video, documents, and links using advanced AI models. It leverages Retrieval-Augmented Generation (RAG) and vector search for accurate, context-aware answers.
- Transcribe audio and video files to text (AssemblyAI)
- Summarize meeting content using LLMs (Cohere, AI21)
- Q&A over meeting content using RAG
- Supports documents and web links
- Video-to-audio extraction
- Semantic search with FAISS vector database
- User authentication and secure file upload
- Automated email delivery of summaries
- AES-256 encryption for sensitive data
- Scalable deployment (Render, Vercel)
- Frontend: React.js
- Backend: FastAPI
- Database: MongoDB
- Vector Search: FAISS
- AI/LLM APIs: AssemblyAI, Cohere, AI21
- Deployment: Render (backend), Vercel (frontend)
- Python 3.8+
- Node.js 16+
- MongoDB instance
- AssemblyAI, Cohere, and AI21 API keys
- Clone the repository:
git clone https://github.com/Kprateek283/Insightflow.git cd Insightflow/backend - Create a virtual environment and activate it:
python -m venv venv source venv/bin/activate - Install dependencies:
pip install -r requirements.txt
- Set up your
.envfile with the required API keys and MongoDB URI. - Run the backend server:
uvicorn main:app --reload
- Open a new terminal and navigate to the frontend directory:
cd ../frontend - Install dependencies:
npm install
- Set up your environment variables (e.g., API base URL).
- Run the frontend:
npm start
- Register or log in to your account.
- Upload audio, video, document, or link.
- Wait for transcription and summary generation.
- Ask questions about your content and receive context-aware answers.
- Download or receive summaries via email.
- All sensitive data is encrypted using AES-256.
- JWT-based authentication for secure access.
Contributions are welcome! Please open issues or submit pull requests for improvements or bug fixes.
This project is licensed under the MIT License.
For questions or support, contact kprateek283@gmail.com.