Devansh Bansal

Software Development Engineer

01. Education

International Institute of Information Technology, Hyderabad

2024 — 2029

B.Tech Computer Science & Engineering + MS by Research  |  CGPA: 8.19

Relevant: Data Structures & Algorithms • Operating Systems • Computer Networks • DBMS • Distributed Systems • Linear Algebra • Theory of Computation

02. Technical Skills

Languages

C C++ Python SQL JavaScript x86-64 Assembly

Systems

Linux / Bash RISC-V CUDA Pthreads Sockets

Backend & Data

Flask Node.js MongoDB PostgreSQL Docker Git

ML / NLP

scikit-learn OpenAI Whisper GPT-4 API NetworkX pandas NumPy

03. Projects

NLP Pipeline for Educational Content Analysis

Python • OpenAI Whisper • GPT-4 • NetworkX • yt-dlp • Pyvis

  • Architected modular 3-phase pipeline: audio ingestion (yt-dlp + ffmpeg normalization to 16kHz mono WAV), multilingual speech-to-text (Whisper large-v3 with CUDA/CPU auto-detection), and LLM-based structured concept extraction
  • Built robust LLM output parser handling code-mixed Hindi-English transcripts with concept aliasing, ID sanitization (regex normalization + collision detection), and flexible JSON extraction from varied response formats
  • Implemented DAG validation engine: confidence-ranked greedy edge insertion maintaining acyclicity, topological sort for pedagogical flow ordering, and interactive Pyvis graph visualization with physics simulation
  • Designed CLI with configurable Whisper models, mock-LLM dry-run mode, fail-fast debugging, per-video error tracking with UTC timestamps, and JSON Schema validation for all structured outputs

xv6 OS Kernel Enhancements

C • RISC-V • Kernel Development

  • Extended the xv6 kernel (RISC-V) with custom CPU scheduling policies and additional system calls; developed kernel-level tests to evaluate scheduling fairness and context switch overhead

Distributed File System

C • Sockets • Pthreads

  • Built multi-threaded networked file system enabling concurrent multi-user access with strict reader-writer locking mechanisms, socket-based IPC, and fault-tolerant connection handling

C-Shell Implementation

C • Unix Syscalls • Process Control

  • Implemented a custom Unix shell supporting piping, I/O redirection, background process execution, job control, and robust signal handling (SIGINT, SIGTSTP, SIGCHLD) using low-level system calls

PokéNetDB — Distributed Relational Database

SQL • Python • DBMS

  • Designed relational schema modeling complex datasets with constraints, triggers, stored procedures, and optimized analytical queries for multi-table joins and aggregation workloads

Max Flow Algorithm Analysis

C++ • Graph Theory • Algorithms

  • Implemented and benchmarked Ford–Fulkerson, Dinic's, and Push–Relabel algorithms; profiled performance on dense, sparse, and adversarial graph inputs to validate theoretical complexity bounds

ML-Driven Equity Trading Strategy

Python • LightGBM • CatBoost • scikit-learn

  • Built quantitative trading pipeline with 22-feature cross-sectional model and rolling Ridge + LightGBM + CatBoost ensemble achieving 410% cumulative return vs. 320% benchmark over 100 equities
  • Implemented point-in-time backtesting with transaction cost modeling, inverse-volatility position sizing, and grid-search Sharpe ratio optimization