Projects

Research and Development

🤖

Domain-Specific AI Agent

Master's Course Project

IN PROGRESS

This project is in colaboration with Logpilot, Sweden. Logpilot AI leverages proprietary predictive models and fine-tuned LLMs (Mistral AI) to anticipate and resolve issues before they disrupt your operations. We aim to develop a domain-specific AI automation agent that balances cost-effectiveness, scalability, and security. Students will fine-tune a compact open-weight language model (e.g., Mistral-7B or Llama 3-8B) using parameter-efficient techniques like LoRA or QLoRA. The agent will be integrated into an autonomous framework with memory and tool-use capabilities, inspired by architectures such as Reflexion and AutoGen. Emphasis will be placed on deploying the agent in a secure environment, ensuring compliance with data protection standards.

Python LLMs RAG LoRA Fine-tuning

🏢

ApartMint

Cource Project for LLMs and Societal Consequences of AI

User-friendly apartment search with natural language queries. Built preference parsing system, intelligent ranking algorithms, and application guidance features to simplify the housing search experience.

Python FastAPI Google Gemini Pydantic FAISS

→ github.com/VJShah643/ApartMint

⚙️

Konf.dev

Scalable Agentic AI Platform

Co-architected and developed a scalable agentic AI platform designed for production environments. Includes declarative agent specifications with YAML-based configuration, multi-tier memory systems (working, episodic, semantic), integration with vector stores and relational databases, comprehensive evaluation pipelines for agent testing.

Python FastAPI LangGraph LangFuse PostgreSQL

→ github.com/konf-dev

🔍

Konfsutra

AI-Powered Shortcut Assistant

Built an intelligent assistant for 500+ Linux shortcuts using RAG pipeline. Engineered vector embeddings and generative AI to answer natural language queries from man pages with high accuracy. Demonstrates practical application of retrieval-augmented generation for documentation search.

RAG LangChain OpenAI FAISS Vector DB

→ github.com/VJShah643/Konfsutra

🧠

Custom Transformer

From Scratch Implementation

Implemented a fully functional Transformer architecture from scratch in PyTorch. Built multi-head self-attention, positional encoding, and encoder-decoder components. Benchmarked on sequence tasks and evaluated the impact of hyperparameter tuning on model performance.

PyTorch Transformers Deep Learning NLP

🚀

Distributed Sentiment Analysis 14M+ song lyrics

Scalable Big Data Pipeline

BIG DATA

Built distributed ML pipeline for sentiment analysis on 14M+ song lyrics, achieving 92% accuracy and 40% faster processing. Leveraged Hadoop ecosystem with Spark, PySpark, MapReduce, and HDFS for scalable data processing. Demonstrates expertise in big data engineering and distributed computing.

Spark Hadoop PySpark MapReduce HDFS

→ github.com/eagning1/DE1-G29