Shubhashis Roy Dipta

sroydip1@umbc.edu

Resume CV Google Scholar

Affiliations

Amazon Science

Applied Research Scientist Intern

Summer 2025

Scale.AI

Machine Learning Research Intern

Summer 2024

University of Maryland, Baltimore County

Ph.D. in CS

Spring 2021 - Present

Grade: 4.00/4.00

Publications: See Here (From 2023)

University of Maryland, Baltimore County

M.Sc. in CS

Spring 2021 - Spring 2023

Grade: 4.00/4.00

Morgan State University

Volunteer Researcher (Remote)

2017 - 2019

Publications: 4 Journals (Genes, Elsevier, IEEE Access, Springer)

UniShopr.com

Founder

2017 - 2021

Military Institute of Science & Technology

B.Sc. in CS

Spring 2013 - Fall 2017

Advisor: Dr. Wali Mohammad Abdullah

Grade: 3.51/4.00

Professional Services

Reviewer of ELVM workshop at CVPR 2025

Reviewer of ACL 2025 (6 papers)

Reviewer of W-NUT workshop at NAACL 2025

Reviewer of TrustNLP Workshop at NAACL 2025 (2 Papers)

Reviewer of NAACL Industry Track 2025 (1 Paper)

Reviewer of COLING 2025 (2 Papers)

Reviewer of BMC Bioinformatics (July 2024)

Reviewer of Scientific Reports (July 2024)

Reviewer of SRW at NAACL 2024 (2 Papers)

Reviewer of SemEval-2024 (4 papers)

Reviewer of Scientific Reports, Nature (Jan 2024)

Reviewer of Plant Methods (Jan 2024)

Reviewer of W-NUT workshop at EACL 2024

Reviewer of Plant Methods (Dec 2023)

Reviewer of Computational and Structural Biotechnology Journal (Mar 2023)

Secondary Reviewer of *SEM 2023

Shubhashis is a Computer Science PhD Researcher under Dr. Frank Ferarro at the University of Maryland, Baltimore County (UMBC). His research combines Natural Language Processing (NLP) and Computer Vision (CV).

His broad research focuses on Decomposition-based Reasoning using text or vision data. Currently, he is focused on question based decomposition to understand the scientific feasibility of a given task.

Over the years, he has worked on Outcome and Intention based Decomposition and how it can be used for video-text retrieval. This research has applications in video/image retrieval system, especially where there is no text metadata available for the video (e.g., most of the videos on the internet, social media, and surveillance videos). Also, his previous work, a hierarchical variational autoencoder for Event Representation Learning, has applications in text summarization, question answering, and counterfactual reasoning (Published in *SEM 2023, ACL).

In Summer 2025, he is interning at Amazon Science as a Applied Scientist. He is researching on optimization of Alexa conversation latency, cost and performance. He was mentored by Dr. Daniel Bis and managed by Dr. Lichao Wang.

In Summer 2024, he interned at Scale.AI as a Machine Learning Researcher. He explored how RLHF can improve the text2SQL generation (currently under ARR review). He also worked on Many-Shot text2SQL and text2SQL AutoEval using SLM. He was mentored by Vijay Kalmath and managed by Dr. Adrian Lam.

If you are interested in collaborating, please email me with a short description of your research interest.

Shubhashis has a strong background in ML programming, including PyTorch, 🤗 HuggingFace, NLTK, Spacy, Matplotlib, Seaborn, and more. He has excelled in machine learning competitions (Kaggle top-70 🥉) and coding competitions (ACM ICPC 8th out of 300+ teams) and more. He was the founder of UniShopr (2018-2022), a cross-border e-commerce for his home country (Bangladesh).

Research Interest

        ✓ Decomposition-based Reasoning using Multimodal Data
        ✓ (Outcome & Intention Based) Video-Text Retrieval
        ✓ Natural Language Understanding

Recent News
(See All)

Jun 14, 2025	🥳 New Pre-print Alert: Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
May 19, 2025	Started my summer internship at Amazon in Alexa Conversational AI team. I am working on optimizing the Alexa conversation latency, cost and performance.
May 16, 2025	Served as a judge for the National Round of the Bangladesh AI Olympiad 2025. You can solve the problem here.
Jan 23, 2025	My summer internship project at Scale.AI has been featured on the Scale.AI blog.
Dec 20, 2024	Accepted a position as a Applied Research Scientist Intern at Amazon Science. Excited to join the team and work on cutting-edge research in NLP and ML.

Featured Publications
(Show More)

Check out Google Scholar for a full list of my publications.

arXiv

Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval

Shubhashis Roy Dipta, and Francis Ferraro

2025

arXiv Bib HTML PDF Code

@misc{dipta2025q2equerytoeventdecompositionzeroshot,
  title = {Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval},
  author = {Dipta, Shubhashis Roy and Ferraro, Francis},
  year = {2025},
  publisher = {arXiv},
  archiveprefix = {arXiv},
  primaryclass = {cs.CL},
  url = {https://arxiv.org/abs/2506.10202},
  dimensions = {true},
}

NAACL
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?

Shubhashis Roy Dipta, and Sadat Shahriar

SemEval 2024, 2024

Abs arXiv Bib PDF Code

This paper describes our system developed for SemEval-2024 Task 8, "Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection." Machine-generated texts have been one of the main concerns due to the use of large language models (LLM) in fake text generation, phishing, cheating in exams, or even plagiarizing copyright materials. A lot of systems have been developed to detect machine-generated text. Nonetheless, the majority of these systems rely on the text-generating model, a limitation that is impractical in real-world scenarios, as it’s often impossible to know which specific model the user has used for text generation. In this work, we propose a single model based on contrastive learning, which uses 40% of the baseline’s parameters (149M vs. 355M) but shows a comparable performance on the test dataset (21st out of 137 participants). Our key finding is that even without an ensemble of multiple models, a single base model can have comparable performance with the help of data augmentation and contrastive learning.
@article{dipta2024hu, title = {HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?}, author = {Dipta, Shubhashis Roy and Shahriar, Sadat}, journal = {SemEval 2024}, year = {2024}, url = {https://arxiv.org/abs/2402.11815}, dimensions = {true}, }
ACL
Semantically-informed Hierarchical Event Modeling

Shubhashis Roy Dipta, Mehdi Rezaee, and Francis Ferraro

In Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), Jul 2023

Abs arXiv Bib HTML PDF Code

Prior work has shown that coupling sequential latent variable models with semantic ontological knowledge can improve the representational capabilities of event modeling approaches. In this work, we present a novel, doubly hierarchical, semi-supervised event modeling framework that provides structural hierarchy while also accounting for ontological hierarchy. Our approach consistsof multiple layers of structured latent variables, where each successive layer compresses and abstracts the previous layers. We guide this compression through the injection of structured ontological knowledge that is defined at the type level of events: importantly, our model allows for partial injection of semantic knowledge and it does not depend on observing instances at any particular level of the semantic ontology. Across two different datasets and four different evaluation metrics, we demonstrate that our approach is able to out-perform the previous state-of-the-art approaches by up to 8.5%, demonstrating the benefits of structured and semantic hierarchical knowledge for event modeling.
@inproceedings{roy-dipta-etal-2023-semantically, title = {Semantically-informed Hierarchical Event Modeling}, author = {Roy Dipta, Shubhashis and Rezaee, Mehdi and Ferraro, Francis}, booktitle = {Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023)}, month = jul, year = {2023}, address = {Toronto, Canada}, publisher = {Association for Computational Linguistics}, url = {https://aclanthology.org/2023.starsem-1.31}, doi = {10.18653/v1/2023.starsem-1.31}, pages = {353--369}, dimensions = {true} }

Shubhashis Roy Dipta

Affiliations

Amazon Science

Applied Research Scientist Intern

Summer 2025

Manager: Dr. Lichao Wang

Mentor: Dr. Daniel Bis

Scale.AI

Machine Learning Research Intern

Summer 2024

Manager: Dr. Adrian Lam

Mentor: Vijay Kalmath

University of Maryland, Baltimore County

Ph.D. in CS

Spring 2021 - Present

Advisor: Dr. Frank Ferraro

Grade: 4.00/4.00

Publications: See Here (From 2023)

University of Maryland, Baltimore County

M.Sc. in CS

Spring 2021 - Spring 2023

Awards: Phi Kappa Phi

Grade: 4.00/4.00

Morgan State University

Volunteer Researcher (Remote)

2017 - 2019

Advisor: Dr. Iman Dehzangi

Publications: 4 Journals (Genes, Elsevier, IEEE Access, Springer)

UniShopr.com

Founder

2017 - 2021

Military Institute of Science & Technology

B.Sc. in CS

Spring 2013 - Fall 2017

Advisor: Dr. Wali Mohammad Abdullah

Grade: 3.51/4.00

Professional Services

Reviewer of ELVM workshop at CVPR 2025

Reviewer of ACL 2025 (6 papers)

Reviewer of W-NUT workshop at NAACL 2025

Reviewer of TrustNLP Workshop at NAACL 2025 (2 Papers)

Reviewer of NAACL Industry Track 2025 (1 Paper)

Reviewer of COLING 2025 (2 Papers)

Reviewer of BMC Bioinformatics (July 2024)

Reviewer of Scientific Reports (July 2024)

Reviewer of SRW at NAACL 2024 (2 Papers)

Reviewer of SemEval-2024 (4 papers)

Reviewer of Scientific Reports, Nature (Jan 2024)

Reviewer of Plant Methods (Jan 2024)

Reviewer of W-NUT workshop at EACL 2024

Reviewer of Plant Methods (Dec 2023)

Reviewer of Computational and Structural Biotechnology Journal (Mar 2023)

Secondary Reviewer of *SEM 2023

Research Interest

Recent News (See All)

I WRITE ✍️ on Machine Learning, NLP, Vision, Multimodal AI

Featured Publications (Show More)

Recent News
(See All)

Featured Publications
(Show More)