Devansh Bansal
Software Development Engineer
01. Education
International Institute of Information Technology, Hyderabad
2024 — 2029B.Tech Computer Science & Engineering + MS by Research | CGPA: 8.19
Relevant: Data Structures & Algorithms • Operating Systems • Computer Networks • DBMS • Distributed Systems • Linear Algebra • Theory of Computation
02. Technical Skills
Languages
Systems
Backend & Data
ML / NLP
03. Projects
Python • OpenAI Whisper • GPT-4 • NetworkX • yt-dlp • Pyvis
- Architected modular 3-phase pipeline: audio ingestion (yt-dlp + ffmpeg normalization to 16kHz mono WAV), multilingual speech-to-text (Whisper large-v3 with CUDA/CPU auto-detection), and LLM-based structured concept extraction
- Built robust LLM output parser handling code-mixed Hindi-English transcripts with concept aliasing, ID sanitization (regex normalization + collision detection), and flexible JSON extraction from varied response formats
- Implemented DAG validation engine: confidence-ranked greedy edge insertion maintaining acyclicity, topological sort for pedagogical flow ordering, and interactive Pyvis graph visualization with physics simulation
- Designed CLI with configurable Whisper models, mock-LLM dry-run mode, fail-fast debugging, per-video error tracking with UTC timestamps, and JSON Schema validation for all structured outputs
xv6 OS Kernel Enhancements
C • RISC-V • Kernel Development
- Extended the xv6 kernel (RISC-V) with custom CPU scheduling policies and additional system calls; developed kernel-level tests to evaluate scheduling fairness and context switch overhead
Distributed File System
C • Sockets • Pthreads
- Built multi-threaded networked file system enabling concurrent multi-user access with strict reader-writer locking mechanisms, socket-based IPC, and fault-tolerant connection handling
C-Shell Implementation
C • Unix Syscalls • Process Control
- Implemented a custom Unix shell supporting piping, I/O redirection, background process execution, job control, and robust signal handling (SIGINT, SIGTSTP, SIGCHLD) using low-level system calls
PokéNetDB — Distributed Relational Database
SQL • Python • DBMS
- Designed relational schema modeling complex datasets with constraints, triggers, stored procedures, and optimized analytical queries for multi-table joins and aggregation workloads
Max Flow Algorithm Analysis
C++ • Graph Theory • Algorithms
- Implemented and benchmarked Ford–Fulkerson, Dinic's, and Push–Relabel algorithms; profiled performance on dense, sparse, and adversarial graph inputs to validate theoretical complexity bounds
Python • LightGBM • CatBoost • scikit-learn
- Built quantitative trading pipeline with 22-feature cross-sectional model and rolling Ridge + LightGBM + CatBoost ensemble achieving 410% cumulative return vs. 320% benchmark over 100 equities
- Implemented point-in-time backtesting with transaction cost modeling, inverse-volatility position sizing, and grid-search Sharpe ratio optimization