Tech Stack

(hint: press a key)

Experience

My professional journey.

Undergraduate Researcher

2024 - Present
  • Collaborated with e6data to design and prototype a pipeline that ports parts of their Java query engine to C++, using FlatBuffers for zero-copy serialization, Java's Foreign Function & Memory (FFM) API for native interop, and JMH for rigorous JVM-vs-native micro-benchmarks.
  • Currently building 'SLMs on Edge' — a system-level profiler for Small Language Models running on edge devices, capturing per-phase metrics (prefill, decode, KV-cache, etc.) across CPU, GPU and memory.
  • Working alongside PhD researchers on systems for ML, with a focus on reproducible benchmarking and low-level performance analysis.
C++C++
JavaJava
PythonPython
LinuxLinux
DockerDocker
GitGit

Full Stack Developer

2024 - 2024
  • Designed and shipped the company's official marketing website end-to-end — from information architecture and visual identity to a responsive, SEO-friendly production build.
  • Built dedicated product and category sections, an enquiry/contact flow wired to the client's preferred channel, and a content layout the team can update without touching code.
  • Tuned performance, accessibility and Core Web Vitals so the site loads fast and ranks well, establishing the company's first real digital presence.
Next.jsNext.js
ReactReact
TypeScriptTypeScript
TailwindTailwind
Node.jsNode.js
VercelVercel

Chief Coordinator

2024 - Present
  • Lead the technical wing of the society — owning end-to-end delivery of official fest websites, fest mobile apps, and external client projects taken on by the team.
  • Drive architecture decisions, sprint planning, code reviews and release management across multiple simultaneous web/mobile projects.
  • Mentor juniors on the full stack — from Next.js/React frontends to Node backends, deployments and on-call basics — and recruit/onboard new members into the technical team.
Next.jsNext.js
ReactReact
TypeScriptTypeScript
Node.jsNode.js
ExpressExpress
MongoDBMongoDB
PostgreSQLPostgreSQL
TailwindTailwind
AWSAWS

Projects

A few things I've shipped recently — clean descriptions, no fluff. Click through to the code on GitHub.

Client Project · Marketing Site

Dhanlakshmi Biochem — Company Website

Designed and built the official marketing website for Dhanlakshmi Biochem end-to-end — a clean, responsive React + Vite site that establishes the company's online presence and showcases their product range to B2B customers.

  • Modern, responsive UI built with React + Vite and modular components
  • Dedicated product / category sections with a content layout the team can update without touching code
  • Enquiry / contact flow wired to the client's preferred channel
  • Optimised for performance, accessibility and SEO so the site loads fast and ranks well
AI · Full-Stack · Hackathon

Devfest 2026 — AI Networking Platform

AI-powered networking platform that finds the best people to collaborate with on GitHub and LinkedIn, given your resume, technical skills, and networking goals. Upload a resume, answer five quick questions, and the system returns ranked, personalised matches with reasons.

  • Resume OCR with EasyOCR + PyMuPDF to extract skills and experience from PDF / image resumes
  • Keyword extraction and match scoring using Ollama locally and Groq's Llama 3.1 70B in the cloud
  • GitHub matcher analyses repos, tech stack and activity; LinkedIn search powered by a Playwright-based scraper
  • 6-service architecture (Next.js frontend, FastAPI services, Ollama) deployed as a single docker compose stack
ML · Speech · Hackathon

Arabic Native vs Non-Native Speech Classifier

End-to-end deep-learning pipeline that classifies Arabic speech as native vs non-native using Wav2Vec2 prosody embeddings. Built for Hackenza 2026 and deployed live on Hugging Face Spaces.

  • Uses facebook/wav2vec2-lv-60-espeak-cv-ft and concatenates mid-layer (4 / 6 / 8 / 10) temporal statistics with pitch features for prosody-focused 2051-d embeddings
  • Three model heads — MLP baseline, MLP with SE-Attention, and a LightGBM gradient-boosted ensemble
  • Validation-based threshold sweep to handle class imbalance (70 / 30 split); best ensemble hits ~85% accuracy / 0.82 F1
  • Automated pipeline: download → chunk → embed → train → evaluate, with clean script-by-script stages
Browser Extension · Python Backend

Content Summariser — NotebookLM PDF Extension

A Chrome (MV3) extension + FastAPI backend that turns any PDF you have open in a browser tab into a Report, Presentation, Podcast, Quiz, or interactive chat — all powered by NotebookLM, with per-user authentication that requires zero server-side credential management.

  • Per-user auth model: the extension reads each user's Google cookies via chrome.cookies and forwards them as an X-NLM-Auth header — no shared session token on the server
  • Backend creates a per-user NotebookLM client cached for one hour, so the developer deploys once and never has to refresh session tokens
  • Generates Report (PDF), Presentation (PPTX), Podcast (MP4), Quiz (PDF) and supports chat with the PDF + Google Docs export
  • FastAPI service containerised with Docker; deploy guide for Render included in the repo

Contact Form

Please contact me directly at utkarsh22sharma1(at)gmail.com or drop your info here.

I'll never share your data with anyone else. Pinky promise!