Sarthak Singh

Skilled in building scalable and distributed systems.
I build LLM-based RAG pipelines and tools for automation.

I am a Master’s in Software Engineering student at the University of Maryland, graduating in December 2025. I previously worked as a Software Engineer at IBM India Labs and as a full-stack developer for Influenced (US startup). I specialize in Python, Golang, Kubernetes, AWS, FastAPI, and high-performance backend systems. I enjoy sports, outdoor activities, and working on AI automation projects.

Profile Photo

Projects

RAG Expert Assistant

A full Retrieval-Augmented Generation (RAG) system that allows users to upload documents, generate embeddings, store them in a vector database, and chat with the content using an LLM-powered assistant. Demonstrates end-to-end document ingestion, intelligent chunking, and real-time context retrieval.

LangChain RAG ChromaDB Vector Search Gradio Python 3.10+

Indian Kanoon LLM

An AI-powered Indian Law Assistant built using Retrieval-Augmented Generation (RAG). Grounds answers in Indian legal statutes and real case law to help users understand their rights through a conversational Streamlit interface.

IndianKanoon API LangChain RAG Streamlit GPT-4 Python 3.10+

RAG Voice Customer Support

An end-to-end voice-enabled customer support assistant that combines Retrieval-Augmented Generation with real-time speech interaction.

LangChain OpenAI ChromaDB Gradio Web Scraping

GPT-OSS 120B CritiqueLoop

An automated critique → improve → repeat pipeline that iteratively refines answers. Uses Groq for ultra-fast inference and an OpenAI-style OSS model, wrapped in a simple Gradio UI.

Groq API GPT-OSS-120B Critique Loop Gradio Python 3.10+

Work Experience

Software Development Engineer II – IBM

July 2021 – Jan 2024 | Kochi, India

Technologies: Python, AWS, Kubernetes, Docker, Jenkins, PostgreSQL, Golang

  • Led the migration of data analytics pipelines from ElasticSearch to OpenSearch, reducing operational costs by up to $3M annually and improving search performance.
  • Streamlined backend services for API management on IBM Cloud and AWS, optimizing traffic routing, security, and data processing.
  • Spearheaded a security optimization project for managing sensitive data, earning an IBM Quarterly Cash Award.

Full Stack Developer – Influenced

July 2024 – June 2025 | Remote, USA

Technologies: Next.js, React, TypeScript, Tailwind CSS, AWS Amplify, Dynamo DB

  • Developed an AI‑powered marketplace connecting brands with micro‑influencers, enabling authentic social proof and massive reach.
  • Implemented role‑based access and AI‑driven workflows for product listings, interest expressions, and collaboration management.

Software Engineer Intern – Informatica

Jan 2021 – July 2021 | Bangalore, India

Technologies: Python, ETL, Oracle DB, Tableau, Golang, AWS S3

  • Resolved over 100 customer support tickets for the Informatica ETL tool, providing technical guidance and troubleshooting.
  • Integrated the ETL tool with databases such as PostgreSQL, MySQL and AWS S3.
  • Led an intern showcase demonstrating how the ETL tool integrates with Tableau to transform and visualise a COVID dataset.

Skills

Programming

Python, Golang, Flask, REST APIs, SQL (PostgreSQL, MySQL), NoSQL, Redis, Kafka, Django

DevOps & Cloud

AWS, Docker, Kubernetes (K8s), ArgoCD, Jenkins, CI/CD pipelines, Terraform

Other

Git/GitHub, Windows, Linux, Data Structures & Algorithms

AI & ML

Cursor, NLP (transformer models), LLM (Claude, GPT), Deep Learning & ML (PyTorch, SGD, scikit‑learn), Gradio, LangChain, RAG