Skip to Content

Hi, my name is

Sejal Barshikar.

I build Intelligent systems with Deep Learning.

I'm a Master's student in Computer Science at Northeastern University, specializing in AI/ML with a focus on Natural Language Processing and Computer Vision. I build end-to-end deep learning systems and have published research in code summarization and serverless computing.

About Me

Hello! I'm Sejal, and I'm passionate about building intelligent systems that solve real-world problems. My journey into AI began during my undergraduate studies in India, where I developed a strong foundation in machine learning algorithms and deep learning architectures.

I'm currently pursuing my Master's in Computer Science at Northeastern University, specializing in AI and Machine Learning. I've worked on several projects such as Research Paper Classification using Fine Tuning BERT, Video Content Analyzer and Neural Machine Translation System. I also gained hands-on industry experience as a Data Science Intern at AICTE, where I built ML pipelines processing millions of transaction records. My main focus these days is Deep Learning for Natural Language Processing and Computer Vision.

Tools and Technologies

  • Python
  • PyTorch
  • TensorFlow
  • Keras
  • Hugging Face
  • OpenCV
  • Scikit-Learn
  • C++
  • JavaScript
  • SQL
  • Git
  • MongoDB
Headshot

Education

Master of Science, Computer Science
Aug 2025 - Dec 2027
Northeastern University
GPA: 3.83/4.0
Relevant Courses: Algorithms (CS 5800), Database Management Systems (CS 5200) , Deep Learning(CS 7150), Programming Design Paradigms(CS 5010)
Bachelor of Engineering, Artificial Intelligence and Data Science
Aug 2021 - May 2025
Savitribai Phule Pune University
GPA: 3.46/4
Relevant Courses: Machine Learning, Pattern Recognition, Computer Vision, Data Structures, Discrete Mathematics, Statistics, Probability, Linear Algebra

Experience

Data Science Intern @ AICTE

December 2024 - May 2025

  • Built an end-to-end ETL pipeline processing 5M+ retail transaction records using optimized Pandas workflows reducing data preprocessing time by 60%
  • Engineered RFM behavioral features (recency, frequency, monetary) and applied K-Means clustering evaluated across 10+ initializations via Silhouette analysis, identifying 5 distinct customer segments
  • Collaborated with a team of 10+ analysts and engineers to define segmentation criteria, validate cluster profiles, and align model outputs with targeted marketing objectives
  • Deployed segmentation inference via Flask REST API achieving sub-100ms latency for real-time usage
  • Drove 20% increase in conversion rates through segment specific behavioral profiling and personalized recommendations

Projects

Other Projects

view the archive
  • Folder

    Neural Machine Translation System

    Reimplemented ”Sequence to Sequence Learning with Neural Networks” paper using TensorFlow on English-French WMT14 dataset with 40,000 sentence pairs

    Designed 4-layer LSTM encoder-decoder architecture using Bahdanau attention mechanism to handle variable-length se- quences up to 50 tokens and 512-dimensional hidden state representations

    Optimized training pipeline implementing gradient clipping, teacher forcing with decay schedule, and adaptive learning rate to reduce the convergence time by 30% and prevent gradient explosion

    • PyTorch
    • LSTM
    • Sequence to Sequence
    • NLP
  • Folder

    LifeLink - Organ Transplant Management System

    Comprehensive database-driven organ transplant management system handling the complete donation lifecycle. Designed 18 normalized (3NF) tables with automated matching algorithms using weighted scoring (Blood: 30%, HLA: 25%, Wait Time: 20%). Implemented 6 automated triggers, 4 stored procedures, and 4 custom functions for real-time priority-based waitlist management and organ allocation with viability tracking.

    • MySQL
    • Database Design
    • SQL
    • Python
    • Algorithms
  • Folder

    CIFAR-10 Image Classification

    Custom CNN architecture with 1.15M parameters achieving 64.62% test accuracy. Implemented batch normalization, dropout regularization, and optimized training pipeline with data augmentation techniques.

    • PyTorch
    • CNNs
    • Python
    • Data Augmentation
  • Folder

    Character-Level Text Generation with LSTM

    Stacked LSTM model with 512 hidden units per layer for Shakespeare-style text generation. Trained on 1.1M character sequences with temperature sampling for controllable generation.

    • PyTorch
    • LSTM
    • NLP
    • Python
  • Folder

    Plant Disease Classification using CNN

    Built a CNN model to classify plant leaf images into disease categories using the PlantVillage dataset (54,000+ images). Achieved 97% training accuracy and 87% validation accuracy with a custom architecture featuring data augmentation and transfer learning techniques. Developed a predictive system for real-time disease detection to support agricultural productivity.

    • TensorFlow
    • Keras
    • CNNs
    • Computer Vision
    • Python
  • Folder

    Number to Words Translation using Seq2Seq

    Implemented a sequence-to-sequence model with stacked LSTM encoder-decoder architecture to translate numbers into their word equivalents. Built custom synthetic dataset with text preprocessing and tokenization pipeline. Trained with teacher forcing for 150 epochs, demonstrating how encoder-decoder models learn structured sequence translations without attention mechanisms.

    • TensorFlow
    • Keras
    • LSTM
    • Seq2Seq
    • NLP

Research & Publications

Advanced Retrieval-Based Code Summarization using Meta Learning
Mukt Shabd Journal
Apr 2025
Applied MAML-based meta-learning to retrieval-augmented code summarization across 108K Python code-summary pairs, demonstrating cross-domain generalization with two stage coarse-to-fine retrieval. Applied transfer learning and data augmentation techniques on CodeSearchNet dataset achieving 12% improvement in BLEU-4 scores over baseline Seq2Seq models
Serverless Computing and Its Impact on Application Development
International Journal of Technology Engineering Arts Mathematics Science
Jul 2024
Analyzed serverless architectures (AWS, Lambda and Google Cloud Functions) across scalability, cost efficiency, and developer productivity demonstrating 40% cost reduction and 3x faster deployment cycles. Explored the role of serverless architectures in modern cloud application development, covering theoretical and practical aspects including methodology, advantages, challenges, application design patterns, existing models, and security considerations.

What’s Next?

Get In Touch

I'm currently seeking Summer 2026 ML Engineering internships where I can apply my skills in deep learning, NLP, and computer vision to build impactful AI systems. Whether you have an opportunity, a question, or just want to connect, feel free to reach out!