LLM Projects

a diagram of a RAG application deployed on GKE

Scalable RAG with GKE and LlamaIndex

Implemented a Q&A Pipeline using the LlamaIndex framework, Qdrant as a vector database, and deployment on Google Kubernetes Engine using a FastAPI app and Dockerfile. Python files from GitHub repositories are loaded into the vector database, and the FastAPI app processes requests within an interactive Streamlit app

a diagram of a RAG application deployed on AWS

AWS RAG Qdrant

Developed a Serverless Application with AWS Lambda and Qdrant for Semantic Search. Utilized AWS API Gateway for API development, Docker for containerization, and created additionally an interactive Streamlit app

GitHub

a diagram of a RAG application using crewAI agents

CrewAI RAG LangChain Qdrant

Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, using LangChain and Qdrant as vector store, together with a Research and Writer agent from CrewAI to analyze and summarize research papers

Fine-Tuning Gemma 2B

Fine tuning of Gemma 2B model using quantization and LoRA Adapters (QLoRA) and hosting the model/adapters in a private repo in Hugging Face

GitHub

Agentic RAG LangChain Pinecone

Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, using LangChain and Pinecone as vector store. Explored the concept of multi-tenancy and multi-agents workflows including memory

GitHub

RAG Llama Index

Implemented a Retrieval-Augmented Generation (RAG) project for Question-Answering (QA) retrieval, utilizing Llama Index and Deep Lake vector database

GitHub

RAG LangChain Ragas

Implemented a RAG Q&A Pipeline project, using LangChain as Framework, FAISS vector database and evaluating the performance of the RAG model using Ragas metrics such as Faithfulness, Answer Relevancy, Context Precision, Context Recall, and Answer Correctness

GitHub

Q&A and Summarization

Developed a LLM project for Question-Answering and Summarization, employing Whisper ASR (Automatic Speech Recognition) system and Langchain. Implemented an app using Streamlit for local deployment, facilitating the extraction of audio and text information

GitHub

RAG App with AWS CDK, Qdrant and LlamaIndex

Implemented a RAG Pipeline using AWS CDK as IaC, LlamaIndex framework, Qdrant as a vector database, and deployment on AWS using a FastAPI app and Dockerfile

GitHub

Agentic RAG Using Claude, LlamaIndex, and Milvus

Implemented an agentic RAG pipeline for Question-Answering retrieval, using LlamaIndex and Milvus as vector store, together with a critic reflection agent

GitHub

Azure RAG Qdrant

Developed a Serverless Application with LangChain framework, Qdrant as vector database and Azure Function with HttpTrigger

GitHub

AWS RAG with OpenSearch, Bedrock and Langchain

Implemented an application using Terraform IaC, LangChain as orchestration framework, Bedrock LLM and embedding model and OpenSearch as vector database and endpoint

GitHub

RAG with Milvus, LlamaIndex and PII Modules

Developed a RAG pipeline using LlamaIndex, Milvus as vector store, and different PII (Personally Identifiable Information) modules from LlamaIndex and Presidio

GitHub

Multimodal Bill Scan System with AWS Services

Developed a multimodal application using AWS CDK as IaC, Claude 3 Sonnet as multimodal model, DynamoDB as storage. and SQS/SNS for messaging and notifications.

GitHub

LLM Projects

Scalable RAG with GKE and LlamaIndex

AWS RAG Qdrant

CrewAI RAG LangChain Qdrant

Fine-Tuning Gemma 2B

Agentic RAG LangChain Pinecone

RAG Llama Index

RAG LangChain Ragas

Q&A and Summarization

RAG App with AWS CDK, Qdrant and LlamaIndex

Agentic RAG Using Claude, LlamaIndex, and Milvus

Azure RAG Qdrant

AWS RAG with OpenSearch, Bedrock and Langchain

RAG with Milvus, LlamaIndex and PII Modules

Multimodal Bill Scan System with AWS Services

Martin Data Solutions