BORRBangladesh Open Research Repository
SearchSubmitAboutContact
BORRResearch for a Better Bangladesh.
AboutSubmit PaperContactTermsPolicyGitHub

© 2026 Bangladesh Open Research Repository.

Filters

Sort By

Sort by relevanceSort by dateSort by citations
Year Range
to

Results for “"Md Farhan Ishmam"”

9 results

From image to language: A critical analysis of Visual Question Answering (VQA) approaches, challenges, and opportunities

Verified

Md Farhan Ishmam, Md Sakib Hossain Shovon, M. F. Mridha, Nilanjan Dey

Journal: Information FusionYear: 2024Citations: 81

The multimodal task of Visual Question Answering (VQA) encompassing elements of Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers to questions on any visual input. Over time, the scope of VQA has expanded from datasets focusing on an extensive collection of natural...

Physical SciencesComputer ScienceComputer Vision and Pattern Recognition
Read Source

Visual Robustness Benchmark for Visual Question Answering (VQA)

Verified

Md Farhan Ishmam, Ishmam Tashdeed, Talukder Asir Saadat, Md. Hamjajul Ashmafee et al.

Year: 2025Citations: 6

Can Visual Question Answering (VQA) systems maintain their performance when deployed in the real world? Or are they susceptible to realistic corruption effects, e.g., image blur, which can be detrimental in sensitive applications such as medical VQA? While linguistic robustness has been thoroughly e...

Physical SciencesComputer ScienceComputer Vision and Pattern Recognition
Read Source

ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla

Verified

Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Md Fahim, Fabiha Haider et al.

Journal: Lecture notes in computer scienceYear: 2025Citations: 3
Physical SciencesComputer ScienceComputer Vision and Pattern Recognition
Read Source

A Smart Approach for Early Detection of DDoS Attacks: Artificial Neural Network and Random Forest Hybridization

Verified

Ishmam Ahmed Ongshu, Ahmed Wasif Reza, Md. Emad Uddin Aksir, Md Aftab Alam et al.

Journal: Procedia Computer ScienceYear: 2025Citations: 2

Advances in networking technology have made Distributed Denial of Service (DDoS) attacks a real danger to today’s networks. Using logical reasoning, the network flow circumstances may be classified as an attack or a routine state to mimic DDoS detection. This research builds an Artificial Intelligen...

Physical SciencesComputer ScienceComputer Networks and CommunicationsOpen Access
Read Source

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities

Verified

Md Farhan Ishmam, Md Sakib Hossain Shovon, M. F. Mridha, Nilanjan Dey

Journal: arXiv (Cornell University)Year: 2023Citations: 2

The multimodal task of Visual Question Answering (VQA) encompassing elements of Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers to questions on any visual input. Over time, the scope of VQA has expanded from datasets focusing on an extensive collection of natural...

Physical SciencesComputer ScienceComputer Vision and Pattern RecognitionOpen Access
Read Source

BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts

Verified

Md Fahim, Md Sakib Ul Rahman, Akm Moshiur Rahman, Md Farhan Ishmam et al.

Year: 2026Citations: 1

The advanced multimodal processing of current vision language models (VLMs) has prompted rigorous benchmarking across multicultural settings, revealing a clear inclination toward Western culture. While the bias likely stems from the predominance of Western-centric images in the VLM pretraining data,...

Social SciencesCultural StudiesLanguage and cultural evolution
Read Source

Anti-obesity therapeutics potential of plant genetic resources of Bangladesh and their conservation at Bangladesh Agricultural University Botanical Garden

Verified

A K M Sarwar, Md Riyadh Arefin, M Farhan Ishmam, M Ashrafuzzaman

Year: 2026

Obesity, a global health issue affecting 650 million people, leads to chronic diseases and health impairments. Anti-obesity drugs are expensive and may cause side effects, raising significant concerns. One hundred eighty-eight medicinal plant species from 157 genera and 62 families in Bangladesh exh...

Health SciencesMedicinePharmacologyOpen Access
Read Source

R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization

Verified

Md Fahim, Md Farhan Ishmam, Mir Sazzat Hossain, M Ashraful Amin et al.

Year: 2026

Pre-trained vision-language models (VLMs) such as CLIP exhibit strong generalization but struggle with few-shot adaptation due to the trade-off between gaining task-specific knowledge and preserving general performance. While multimodal adapters add trainable modules that improve alignment and excel...

Physical SciencesComputer ScienceComputer Vision and Pattern Recognition
Read Source

Enhancing Vision Language Corruption Robustness using Cross-Distribution & Prompted Denoisers

Verified

Sameer Shafayet Latif, Sadab Shiper, K. M. Rahiduzzaman Kiran, Md Farhan Ishmam et al.

Year: 2026

While the current generation of Vision Language Models (VLMs) has excelled in ideal conditions, their performance drops significantly when exposed to realistic multimodal corruptions, such as blurry images and grammatically incorrect text. Our work addresses this by establishing a novel multimodal c...

Physical SciencesComputer ScienceArtificial Intelligence
Read Source
PreviousPage 1 of 1Next