Hello, I'm Sooraj
I like building things.
I'm a generalist software developer with an avid interest in backend technologies. Opportunities where I get to crack previously unknown problems and design solutions for them expecially excite me. I have high ownership of my tasks and have a proven track record of delivering all tasks I take up. I am also a strong advocate for open-source software. Currently, I am looking for full time opportunites where I can explore some new technologies and contribute meaningfully.
About Me
I am currently a final year student at NIT Kurukshetra. Presently, I am working in the GovTech domain for over an year with Samagra Governance, building Digital Public Goods (DPGs) and population scale apps for the citizens of India. I have had my projects deployed in Uttar Pradesh, Odisha, Rajasthan. Along with that, my work is being used in the Kumbh Mela 2025
- ➔ TypeScript / JavaScript
- ➔ Python
- ➔ C/C++
- ➔ NestJS
- ➔ Django Rest Framework
- ➔ FastAPI
- ➔ ReactJS
- ➔ Docker
Experience
Samagra Governance
Software Development Intern
Dec. 2023 – Present (Remote)
- Set up a pipeline to handle structured and unstructured data storage.
- Optimized data retrieval for serving approximately 5k requests per second in a RAG pipeline.
- Owned three complete services, ensuring timely delivery of new features and bug fixes for the Bharat Sah’Al’yak instances in Odisha, Rajasthan, and Uttar Pradesh.
Avalon Information Systems
Web Development Intern
June 2023 – July 2023 (Remote)
- Understood the working of web servers, authentication, and REST APIs.
- Set up and configured basic CakePHP projects for new microservices.
Projects
Autotune - Dataset Builder and NLP Model Fine-Tuner
Mar 2024 - Present
- Synthetic data generation at blazing fast speeds of 60s for a 1000 record dataset on a single-core CPU.
- Fine-tuning and training of classifier models for NLP tasks.
- Used in the Nipun program of Uttar Pradesh to generate fresh questions for student assessment.
- Tools Used: Django Rest Framework, Celery, Gevent, Python, PostgreSQL.
Stencil-CLI - Command Line Interface for Custom File Generation
Oct 2023 - Present
- Developed a CLI tool to generate files for an opinionated NestJS framework.
- Used Abstract Syntax Trees (ASTs) to analyze and manipulate code programmatically for accurate file generation.
- Created custom schematics for data flow and creation.
- Tools Used: ASTs, Compilers, TypeScript.
Dataset Service - Dynamic PostgreSQL Data Store
- Created a dynamic datastore supporting all PostgreSQL data types.
- Implemented vector search (dense and sparse) with performance at ~20ms per search.
- Enabled fuzzy search using Trigram and Levenshtein distances for all columns.
- Simplified data transfer between instances using a JSON-based configuration.
- Tools Used: NestJS, PostgreSQL, MinIO.
PDF Parser and Document Service
- Extracted semantic information like headings and subheadings from PDFs.
- Parsed text without requiring the language to be specified.
- Achieved ~30s processing time on a 2.5GB VRAM GPU for a 50-page document.
- Implemented a scalable interface to store and query documents using vector and fuzzy searches.
- Tools Used: NestJS, PostgreSQL, BullMQ, MinIO.
Scholar Link - Identification and Verification System
SIH 2023
- Full Stack Developer in a 6-member team project for Smart India Hackathon 2023.
- Implemented a system for creating and validating verifiable credentials.
- Associated each certificate with a unique decentralized identifier (DID).
- Tools Used: TypeScript, NextJS, NestJS, PostgreSQL, Docker.