Shubhashis Roy Dipta

PhD ResearcherMultimodal (Text + Vision) GenerationNLP


Shubhashis Roy Dipta


University of Maryland, Baltimore County
Ph.D. in CS
Spring 2021 - Present
Advisor: Dr. Frank Ferraro
Grade: 4.00/4.00
Publications: See Here (From 2023)
University of Maryland, Baltimore County
M.Sc. in CS
Spring 2021 - Spring 2023
Awards: Phi Kappa Phi
Grade: 4.00/4.00
Morgan State University
Remote Research Work
2017 - 2019
Advisor: Dr. Iman Dehzangi
Publications: 4 Journals (Genes, Elsevier, IEEE Access, Springer)
2017 - 2021
Military Institute of Science & Technology
B.Sc. in CS
Spring 2013 - Fall 2017
Advisor: Dr. Wali Mohammad Abdullah
Grade: 3.51/4.00

Professional Services

Reviewer of SRW at NAACL 2024 (2 Papers)
Reviewer of SemEval-2024 (4 papers)
Reviewer of Scientific Reports, Nature (Jan 2024)
Reviewer of Plant Methods (Jan 2024)
Reviewer of W-NUT workshop at EACL 2024
Reviewer of Plant Methods (Dec 2023)
Reviewer of Computational and Structural Biotechnology Journal (Mar 2023)
Secondary Reviewer of *SEM 2023

I am a Computer Science PhD Researcher under Dr. Frank Ferarro at the University of Maryland, Baltimore County (UMBC). My research combines Natural Language Processing (NLP) and Computer Vision (CV).

My primary research focuses on multimodal event reasoning, understanding, and generation. My previous work, a hierarchical variational autoencoder for event representation learning, has applications in text summarization, question answering, and counterfactual reasoning (Published in *SEM 2023, ACL).

Actively looking for Research Collaboration in Natural Language Processing or Multimodal (Language + Vision) Work. I am greatly enthusiastic about participating in shared tasks. Please contact me if you want to collaborate.

I have a strong background in ML programming, including PyTorch, 🤗 HuggingFace, NLTK, Spacy, Matplotlib, Seaborn, and more. I’ve excelled in machine learning competitions (Kaggle top-70 🥉) and coding competitions (ACM ICPC 8th out of 300+ teams) and more. I was the founder of UniShopr, a cross-border e-commerce for my home country (Bangladesh).

Research Interest

        ✓ Multimodal Generation (Language 📖 + Vision 👀)
        ✓ Natural Language Understanding 📖
        ✓ Computer Vision 👀

Recent News

Apr 28, 2024 Published a list of Research Helpers that I use in my research. I will keep updating the list as I find new tools. If you have any suggestions, please let me know.
Mar 19, 2024 Our paper “HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?” got published in SemEval 2024 (co-located with NAACL 2023). The code is available here.
Mar 19, 2024 Our paper “UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and without machine translation” got published in SemEval 2024 (co-located with NAACL 2023). The code is available here.
Feb 29, 2024 Published a (awesome) list for academic ML competitions, AKA, Shared Task, Challenge. Contributions are welcome!!
Feb 17, 2024 Published a chat-bot with GPT4Vision using OPENAI API for our language understanding lab. As vision support is still not available in official playground, this demo will help our lab members to explore the vision capability of GPT4. Anyone can use the code to deploy their own vision enabled chat-bot by changing the API key.

WRITE ✍️  on Machine Learning, NLP, Vision, Multimodal AI

Featured Publications

Check out Google Scholar for a full list of my publications.

  1. HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
    Shubhashis Roy Dipta, and Sadat Shahriar
    SemEval 2024, 2024
  2. Semantically-informed Hierarchical Event Modeling
    Shubhashis Roy DiptaMehdi Rezaee, and Francis Ferraro
    In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), Jul 2023
  3. SEMal: Accurate protein malonylation site predictor using structural and evolutionary information
    Shubhashis Roy DiptaGhazaleh Taherzadeh, Md Wakil Ahmad, Md Easin Arafat, Swakkhar Shatabda, and Abdollah Dehzangi
    Computers in biology and medicine, Jul 2020