Skip to main content

Projects

End-to-end ML systems from research to production

🤖

Domain-Specific AI Agent

Master's Thesis Project (15 ECTS)

IN PROGRESS

Building a cost-effective, secure AI automation agent for domain-specific tasks. Focused on fine-tuning compact LLMs with retrieval-augmented generation, tool mapping, and compliance considerations. Exploring efficient ways to build specialized agents that understand domain context and reasoning patterns.

Python LLMs RAG LoRA Fine-tuning
⚙️

Konf.dev

Scalable Agentic AI Platform

Co-architected production-grade infrastructure for building reliable agentic systems. Contributed to memory systems (working, episodic, semantic), evaluation pipelines, and deployment reliability features. Focus on making AI agents production-ready with proper testing and monitoring.

Python FastAPI LangGraph Redis PostgreSQL
→ github.com/konf-dev
🔍

Konfsutra

AI-Powered Shortcut Assistant

DEMO

Built an intelligent assistant for 500+ Linux shortcuts using RAG pipeline. Engineered vector embeddings and generative AI to answer natural language queries from man pages with high accuracy. Demonstrates practical application of retrieval-augmented generation for documentation search.

RAG LangChain OpenAI FAISS Vector DB
→ github.com/VJShah643/Konfsutra
🧠

Custom Transformer

From Scratch Implementation

RESEARCH

Implemented a fully functional Transformer architecture from scratch in PyTorch. Built multi-head self-attention, positional encoding, and encoder-decoder components. Benchmarked on sequence tasks and evaluated the impact of hyperparameter tuning on model performance.

PyTorch Transformers Deep Learning NLP
🚀

Distributed Sentiment Analysis

Scalable Big Data Pipeline

BIG DATA

Built distributed ML pipeline for sentiment analysis on 14M+ song lyrics, achieving 92% accuracy and 40% faster processing. Leveraged Hadoop ecosystem with Spark, PySpark, MapReduce, and HDFS for scalable data processing. Demonstrates expertise in big data engineering and distributed computing.

Spark Hadoop PySpark MapReduce HDFS
🏢

ApartMint

Conversational Housing Search

User-friendly apartment search application with natural language query interface. Built preference parsing system, intelligent ranking algorithms, and application guidance features to simplify the housing search experience.

Python FastAPI LLMs RAG Gemini