Results for “"Md Farhan Ishmam"”

9 results

From image to language: A critical analysis of Visual Question Answering (VQA) approaches, challenges, and opportunities

Verified

Md Farhan Ishmam, Md Sakib Hossain Shovon, M. F. Mridha, Nilanjan Dey

Journal: Information FusionYear: 2024Citations: 81

The multimodal task of Visual Question Answering (VQA) encompassing elements of Computer Vision (CV) and Natural Language Processing (NLP), aims to generate answers to questions on any visual input. Over time, the scope of VQA has expanded from datasets focusing on an extensive collection of natural...

Results for “"Md Farhan Ishmam"”

From image to language: A critical analysis of Visual Question Answering (VQA) approaches, challenges, and opportunities

Visual Robustness Benchmark for Visual Question Answering (VQA)

ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla

A Smart Approach for Early Detection of DDoS Attacks: Artificial Neural Network and Random Forest Hybridization

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities

BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts

Anti-obesity therapeutics potential of plant genetic resources of Bangladesh and their conservation at Bangladesh Agricultural University Botanical Garden

R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization

Enhancing Vision Language Corruption Robustness using Cross-Distribution & Prompted Denoisers