Shubhashis Roy Dipta

PhD ResearcherMultimodal (Text + Vision) Generation, RetrievalNLP


Shubhashis Roy Dipta


Machine Learning Research Intern
Summer 2024
Manager: Dr. Adrian Lam
Mentor: Vijay Kalmath
University of Maryland, Baltimore County
Ph.D. in CS
Spring 2021 - Present
Advisor: Dr. Frank Ferraro
Grade: 4.00/4.00
Publications: See Here (From 2023)
University of Maryland, Baltimore County
M.Sc. in CS
Spring 2021 - Spring 2023
Awards: Phi Kappa Phi
Grade: 4.00/4.00
Morgan State University
Volunteer Researcher (Remote)
2017 - 2019
Advisor: Dr. Iman Dehzangi
Publications: 4 Journals (Genes, Elsevier, IEEE Access, Springer)
2017 - 2021
Military Institute of Science & Technology
B.Sc. in CS
Spring 2013 - Fall 2017
Advisor: Dr. Wali Mohammad Abdullah
Grade: 3.51/4.00

Professional Services

Reviewer of BMC Bioinformatics (July 2024)
Reviewer of Scientific Reports (July 2024)
Reviewer of SRW at NAACL 2024 (2 Papers)
Reviewer of SemEval-2024 (4 papers)
Reviewer of Scientific Reports, Nature (Jan 2024)
Reviewer of Plant Methods (Jan 2024)
Reviewer of W-NUT workshop at EACL 2024
Reviewer of Plant Methods (Dec 2023)
Reviewer of Computational and Structural Biotechnology Journal (Mar 2023)
Secondary Reviewer of *SEM 2023

Shubhashis is a Computer Science PhD Researcher under Dr. Frank Ferarro at the University of Maryland, Baltimore County (UMBC). His research combines Natural Language Processing (NLP) and Computer Vision (CV).

His current research focuses on outcome and intention based video-text retrieval. This research has applications in video/image retrieval system, especially where there is no text available for the video (e.g., most of the videos on the internet, social media, and surveillance videos).

He is also interested about multimodal event reasoning, understanding, and generation. His previous work, a hierarchical variational autoencoder for event representation learning, has applications in text summarization, question answering, and counterfactual reasoning (Published in *SEM 2023, ACL).

In Summer 2024, he interned at Scale.AI as a Machine Learning Researcher. He explored how RLHF can improve the text2SQL generation (currently under ARR review). He also worked on Many-Shot text2SQL and text2SQL AutoEval using SLM. He was mentored by Vijay Kalmath and managed by Dr. Adrian Lam.

Actively looking for Research Collaboration in Natural Language Processing or Multimodal (Language + Vision) Work. Please contact me if you want to collaborate.

Shubhashis has a strong background in ML programming, including PyTorch, 🤗 HuggingFace, NLTK, Spacy, Matplotlib, Seaborn, and more. He has excelled in machine learning competitions (Kaggle top-70 🥉) and coding competitions (ACM ICPC 8th out of 300+ teams) and more. He was the founder of UniShopr (2018-2022), a cross-border e-commerce for his home country (Bangladesh).

Research Interest

        ✓ (Outcome & Intention Based) Video-Text Retrieval
        ✓ Video/Image + Text to Text Generation
        ✓ Natural Language Understanding 📖
        ✓ Computer Vision 👀

Recent News

Aug 16, 2024 Finished my summer internship at Scale.AI as a MLR Intern. I have worked on text2SQL, RLHF, AutoEval, Many-Shot text2SQL, RAG. For more details, please read my linkedin post.
Jul 22, 2024 Reviewed a paper in BMC Bioinformatics
Jul 21, 2024 Reviewed another paper in Scientific Reports, a reputed journal from Nature.
May 28, 2024 Started my summer internship at Scale.AI (San Francisco, CA) as a Machine Learning Research Intern. I will be working on text2SQL team.
Apr 28, 2024 Published a list of Research Helpers that I use in my research. I will keep updating the list as I find new tools. If you have any suggestions, please let me know.

WRITE ✍️  on Machine Learning, NLP, Vision, Multimodal AI

Featured Publications

Check out Google Scholar for a full list of my publications.

  1. HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
    Shubhashis Roy Dipta, and Sadat Shahriar
    SemEval 2024, 2024
  2. Semantically-informed Hierarchical Event Modeling
    Shubhashis Roy DiptaMehdi Rezaee, and Francis Ferraro
    In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), Jul 2023
  3. SEMal: Accurate protein malonylation site predictor using structural and evolutionary information
    Shubhashis Roy DiptaGhazaleh Taherzadeh, Md Wakil Ahmad, Md Easin Arafat, Swakkhar Shatabda, and Abdollah Dehzangi
    Computers in biology and medicine, Jul 2020