Shubhashis Roy Dipta
PhD Researcher ⎟ Multimodal (Text + Vision) Retrieval, Generation ⎟ NLP
Shubhashis Roy Dipta
sroydip1@umbc.edu
Affiliations
University of Maryland, Baltimore County
Ph.D. in CS
Spring 2021 - Present
Advisor: Dr. Frank Ferraro
Grade: 4.00/4.00
Publications: See Here (From 2023)
University of Maryland, Baltimore County
M.Sc. in CS
Spring 2021 - Spring 2023
Awards: Phi Kappa Phi
Grade: 4.00/4.00
Morgan State University
Volunteer Researcher (Remote)
2017 - 2019
Advisor: Dr. Iman Dehzangi
Publications: 4 Journals (Genes, Elsevier, IEEE Access, Springer)
Military Institute of Science & Technology
B.Sc. in CS
Spring 2013 - Fall 2017
Advisor: Dr. Wali Mohammad Abdullah
Grade: 3.51/4.00
Professional Services
Reviewer of COLING 2025 (2 Papers)
Reviewer of BMC Bioinformatics (July 2024)
Reviewer of Scientific Reports (July 2024)
Reviewer of SRW at NAACL 2024 (2 Papers)
Reviewer of SemEval-2024 (4 papers)
Reviewer of Scientific Reports, Nature (Jan 2024)
Reviewer of Plant Methods (Jan 2024)
Reviewer of W-NUT workshop at EACL 2024
Reviewer of Plant Methods (Dec 2023)
Reviewer of Computational and Structural Biotechnology Journal (Mar 2023)
Secondary Reviewer of *SEM 2023
Shubhashis is a Computer Science PhD Researcher under Dr. Frank Ferarro at the University of Maryland, Baltimore County (UMBC). His research combines Natural Language Processing (NLP) and Computer Vision (CV).
His current research focuses on outcome and intention based video-text retrieval. This research has applications in video/image retrieval system, especially where there is no text available for the video (e.g., most of the videos on the internet, social media, and surveillance videos).
He is also interested about multimodal event reasoning, understanding, and generation. His previous work, a hierarchical variational autoencoder for event representation learning, has applications in text summarization, question answering, and counterfactual reasoning (Published in *SEM 2023, ACL).
In Summer 2024, he interned at Scale.AI as a Machine Learning Researcher. He explored how RLHF can improve the text2SQL generation (currently under ARR review). He also worked on Many-Shot text2SQL and text2SQL AutoEval using SLM. He was mentored by Vijay Kalmath and managed by Dr. Adrian Lam.
Actively looking for Research Collaboration in Natural Language Processing or Multimodal (Language + Vision) Work. Please contact me if you want to collaborate.
Shubhashis has a strong background in ML programming, including PyTorch, 🤗 HuggingFace, NLTK, Spacy, Matplotlib, Seaborn, and more. He has excelled in machine learning competitions (Kaggle top-70 🥉) and coding competitions (ACM ICPC 8th out of 300+ teams) and more. He was the founder of UniShopr (2018-2022), a cross-border e-commerce for his home country (Bangladesh).
Research Interest
✓ (Outcome & Intention Based) Video-Text Retrieval
✓ Video/Image + Text to Text Generation
✓ Natural Language Understanding 📖
✓ Computer Vision 👀
Recent News (See All)
Dec 20, 2024 | Accepted a position as a Applied Research Scientist Intern at Amazon Science. Excited to join the team and work on cutting-edge research in NLP and ML. |
---|---|
Nov 16, 2024 | Attended EMNLP-2024 in Florida, Miami. Excited to meet researchers from all over the world. |
Nov 7, 2024 | Reviewed 2 papers in COLING 2025. |
Aug 16, 2024 | Finished my summer internship at Scale.AI as a MLR Intern. I have worked on text2SQL, RLHF, AutoEval, Many-Shot text2SQL, RAG. For more details, please read my linkedin post. |
Jul 22, 2024 | Reviewed a paper in BMC Bioinformatics |
Featured Publications
Check out Google Scholar for a full list of my publications.