LLM Projects


Scalable RAG with GKE and LlamaIndex
Implemented a Q&A Pipeline using the LlamaIndex framework, Qdrant as a vector database, and deployment on Google Kubernetes Engine using a FastAPI app and Dockerfile. Python files from GitHub repositories are loaded into the vector database, and the FastAPI app processes requests within an interactive Streamlit app


AWS RAG Qdrant
Developed a Serverless Application with AWS Lambda and Qdrant for Semantic Search. Utilized AWS API Gateway for API development, Docker for containerization, and created additionally an interactive Streamlit app


CrewAI RAG LangChain Qdrant
Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, using LangChain and Qdrant as vector store, together with a Research and Writer agent from CrewAI to analyze and summarize research papers


Fine-Tuning Gemma 2B
Fine tuning of Gemma 2B model using quantization and LoRA Adapters (QLoRA) and hosting the model/adapters in a private repo in Hugging Face


Agentic RAG LangChain Pinecone
Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, using LangChain and Pinecone as vector store. Explored the concept of multi-tenancy and multi-agents workflows including memory


RAG Llama Index
Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, utilizing Llama Index and Deep Lake vector database


RAG LangChain Ragas
Implemented a RAG Q&A Pipeline project, using LangChain as Framework, FAISS vector database and evaluating the performance of the RAG model using Ragas metrics such as Faithfulness, Answer Relevancy, Context Precision, Context Recall, and Answer Correctness


Q&A and Summarization
Developed a LLM project for Question-Answering and Summarization, employing Whisper ASR (Automatic Speech Recognition) system and Langchain. Implemented an app using Streamlit for local deployment, facilitating the extraction of audio and text information


RAG App with AWS CDK, Qdrant and LlamaIndex
Implemented a RAG Pipeline using AWS CDK as IaC, LlamaIndex framework, Qdrant as a vector database, and deployment on AWS using a FastAPI app and Dockerfile


Agentic RAG Using Claude, LlamaIndex, and Milvus
Implemented an agentic RAG pipeline for Question-Answering retrieval, using LlamaIndex and Milvus as vector store, together with a critic reflection agent


Azure RAG Qdrant
Developed a Serverless Application with LangChain framework, Qdrant as vector database and Azure Function with HttpTrigger


AWS RAG with OpenSearch, Bedrock and Langchain
Implemented an application using Terraform IaC, LangChain as orchestration framework, Bedrock LLM and embedding model and OpenSearch as vector database and endpoint


RAG with Milvus, LlamaIndex and PII Modules
Developed a RAG pipeline using LlamaIndex, Milvus as vector store, and different PII (Personally Identifiable Information) modules from LlamaIndex and Presidio


Multimodal Bill Scan System with AWS Services
Developed a multimodal application using AWS CDK as IaC, Claude 3 Sonnet as multimodal model, DynamoDB as storage. and SQS/SNS for messaging and notifications.