Machine Learning Researcher

Building stable & interpretable AI systems for real-world applications

Specializing in computer vision, NLP, and generative AI with focus on accessibility and healthcare.

Manikanta Srinivasula - Professional Headshot

About Me

I'm a Machine Learning Researcher focused on building stable and interpretable AI systems that perform reliably with noisy real-world data. My approach emphasizes multi-stage pipelines that separate detection, alignment, refinement, and decoding instead of relying on black-box end-to-end models.

My research spans computer vision, natural language processing, and generative AI, with particular interest in accessibility technology and healthcare applications. I've contributed to multiple research projects resulting in manuscripts currently under review in computer vision and NLP domains.

With hands-on experience across the full ML pipeline from data collection and annotation to model training, evaluation, and deployment I bring both technical depth and practical execution capabilities to research projects.

Coding workspace aesthetic

Research Focus

  • Multi-stage Vision Pipelines
  • Weak Supervision
  • Evolutionary & Adversarial Optimization
  • Medical & Accessibility AI
  • Multilingual NLP

Technical Skills

Machine Learning

  • Multi-stage Learning Pipelines
  • Weak Supervision
  • Evolutionary Optimization
  • Generative Adversarial Networks (GANs)
  • Transformer-based Models
  • Object Detection & Segmentation

Frameworks & Tools

  • PyTorch
  • TensorFlow
  • HuggingFace
  • OpenCV
  • Weights & Biases
  • YOLO (v5-v11)

Research Areas

  • Computer Vision
  • Natural Language Processing
  • Generative AI
  • Healthcare AI
  • Accessibility Technology
  • ML Systems

Featured Projects

Explainable Optical Braille Recognition

Multi-stage pipeline for Braille recognition using YOLOv11, PCA deskewing, and attention-based CNN with transformer correction. Achieved 9.7% CER and eliminated silent decoding failures.

Python PyTorch YOLOv11
Manuscript Under Review View Details

Structure-Preserving Handwriting Synthesis

Hybrid evolutionary-adversarial model combining YOLO detection, PSO/CMA-ES refinement, and dual-GAN architecture for realistic handwriting generation.

Python TensorFlow GANs
Manuscript Under Review View Details

IIoT Network Traffic Classification

Advanced Intrusion Detection System for Industrial IoT using LSTM, TCN-Attention, and CNN-LSTM models with flow-based splitting. Achieved 99.99% accuracy.

Python TensorFlow LSTM
Completed View Details
View All Projects