AI Engineer @ Armada AI

Bridging Computer Vision & LLMs in Production Systems

I've trained CNNs to see and transformers to reason. Now I build systems where both work together - from diffusion pipelines shipping photorealistic product imagery at Avataar AI, to agentic AI pipelines at Armada AI. IISc Bangalore alumnus, GATE AIR 221.

221
GATE AIR
2+
Years ML
IISc
M.Tech AI
01 About
Rohit Kumar

From Signals to Neural Networks

My path to AI started in electrical engineering - clearing GATE (AIR 221) and BARC, then choosing research over a government job. That decision led me to IISc, where I published on continual learning (WACV 2025) and discovered my calling: building systems where vision and language work together.

Today, I ship AI that matters - diffusion pipelines at Avataar AI, agentic systems at Armada AI. The boundary between what machines see and what they understand is blurring. I build at that edge.

02 Work Experience

AI Engineer

Armada AI
Jun 2025 - Present
Trivandrum, India
  • Architecting end-to-end agentic systems using LangGraph for multi-step reasoning
  • Building RAG pipelines with Qdrant vector database for semantic retrieval
  • Developing production APIs with FastAPI and Chainlit interfaces
  • Containerizing ML workflows with Docker and PostgreSQL backends
LangGraph FastAPI Chainlit Qdrant Docker PostgreSQL RAG

Research Engineer

Avataar AI
Jul 2024 - Apr 2025
Bangalore, India
  • Built end-to-end lifestyle image generation pipeline using Flux Model and ControlNets
  • Modified diffusion sampling for improved object reconstruction with intrinsic decomposition
  • Developed classification systems using CLIP, BLIP2, and Qwen2.5 for low-data scenarios
  • Enhanced segmentation accuracy with BiRefNet and SAM + YOLO-world integration
Flux ControlNet Diffusion CLIP SAM YOLO

Teaching Assistant

IISc Bangalore - Signal Processing
Jan 2024 - Apr 2024
Bangalore, India
  • Integrated continual learning frameworks (L2P, DualPrompt) to mitigate catastrophic forgetting
  • Built self-supervised models using MoCo and SimCLR for visual representation learning
  • Developed adaptive prompt-based learning with dynamic token expansion
L2P DualPrompt MoCo SimCLR PyTorch

Teaching Assistant

IISc Bangalore - Digital Image Processing
Aug 2023 - Dec 2023
Bangalore, India
  • Developed DFT-based frequency domain filtering for image denoising and enhancement
  • Implemented SIFT and Normalized Cut for feature detection and segmentation
  • Optimized deep learning models using EfficientNet-B0 with custom classifiers
DFT SIFT EfficientNet OpenCV NumPy
03 Projects
Virtual Try-On Project

Virtual Try-On

Deep learning Virtual Try-On pipeline with 3-stage framework combining Florence2 and IDM-VTON models for automated garment transfer.

Florence2 IDM-VTON FLUX Diffusion
Cricket Shot Predictor

Cricket-Shot Predictor

LSTM-based video classification using CLIP embeddings. Fine-tuned VideoMAE achieving 66% accuracy on cricket shots.

LSTM CLIP VideoMAE HuggingFace
04 Achievements
AIR 221
GATE EE 2022
Score: 803 | Marks: 73/100
View Certificate
AIR 227
GATE IN 2022
Score: 670 | Marks: 67.33/100
View Certificate
AIR 1683
GATE EE 2021
Score: 634 | Marks: 55.33/100
View Certificate
Rank 1
BCECE LE 2018
State Lateral Entry Exam
05 Certifications
Agents Course
HuggingFace
AI Agents Development
View Certificate
GenAI Hackathon
Participant
Generative AI
View Certificate
miniCON AI Infra
Marktechpost
AI Infrastructure
View Certificate
OpenCV Bootcamp
OpenCV University
Computer Vision
View Certificate
06 Publications

TACLE: Task and Class-aware Exemplar-free Semi-supervised Class Incremental Learning

Jayateja Kalla*, Rohit Kumar*, Soma Biswas

WACV 2025

07 Education

M.Tech in Artificial Intelligence

Indian Institute of Science (IISc), Bangalore

2022 - 2024 · CGPA: 8.0/10.0

B.Tech in Electrical Engineering

Bhagalpur College of Engineering, Bhagalpur

2018 - 2021 · CGPA: 8.75/10.0

Diploma in Electrical Engineering

Government Polytechnic Muzzafarpur, Muzaffarpur

2015 - 2018 · 77.73%

Secondary School (10th)

Bihar School Examination Board · Utkramit M S Parmanandpur

2015 · 60%

08 Coursework

ML Foundations

  • Linear Algebra
  • Stochastic Models and Applications
  • Pattern Recognition and Neural Networks
  • Computational Methods of Optimization
  • Game Theory

Computer Vision

  • Digital Image Processing
  • Advanced Image Processing
  • Computer Vision
  • Digital Video Perception and Algorithms

Language & LLMs

  • Introduction to NLP
  • Deep Learning for NLP
  • LLMs for Practical NLP
09 Skills

AI/ML

Diffusion Models LLMs RAG Agentic AI Computer Vision NLP

Frameworks

PyTorch LangGraph LangChain HuggingFace FastAPI

Infrastructure

Docker AWS Azure PostgreSQL Qdrant SLURM

Languages

Python SQL
10 Volunteering
Organizing Committee
EE Summer School 2023
IISc Bangalore · July 2023
View Certificate
11 Contact

Let's Build Something Together

Looking for collaboration on AI/ML projects, research opportunities, or just want to chat about generative models and agentic systems.

sahil15rohit88@gmail.com