Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker et al.
This paper presents a novel large-scale dataset and comprehensive baselines for end-to-end pedestrian detection and person recognition in raw video frames. Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to...
Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian et al.
Temporally localizing actions in a video is a fundamental challenge in video understanding. Most existing approaches have often drawn inspiration from image object detection and extended the advances, e.g., SSD and Faster R-CNN, to produce temporal locations of an action in a 1D sequence. Neverthele...
Zhaofan Qiu, Ting Yao, Chong‐Wah Ngo, Xinmei Tian et al.
Convolutional Neural Networks (CNN) have been regarded as a powerful class of models for visual recognition problems. Nevertheless, the convolutional filters in these networks are local operations while ignoring the large-range dependency. Such drawback becomes even worse particularly for video reco...
Kai Su, Dongdong Yu, Zhenqi Xu, Xin Geng et al.
Multi-person pose estimation is an important but challenging problem in computer vision. Although current approaches have achieved significant progress by fusing the multi-scale feature maps, they pay little attention to enhancing the channel-wise and spatial information of the feature maps. In this...
Niluthpol Chowdhury Mithun, Nafi Ur Rashid, S. M. Mahbubur Rahman
Detection and classification of vehicles are two of the most challenging tasks of a video-based intelligent transportation system. Traditional detection and classification methods are computationally highly expensive and become unsuccessful in many cases such as occlusion among the vehicles and when...
Nazir Saleheen, Amin Ahsan Ali, Syed Monowar Hossain, Hillol Sarker et al.
Recent researches have demonstrated the feasibility of detecting smoking from wearable sensors, but their performance on real-life smoking lapse detection is unknown. In this paper, we propose a new model and evaluate its performance on 61 newly abstinent smokers for detecting a first lapse. We use ...
M. Shamim Kaiser, Khin Lwin, Mufti Mahmud, Donya Hajializadeh et al.
The recent expansion of pervasive computing technology has contributed with novel means to pursue human activities in urban space. The urban dynamics unveiled by these means generate an enormous amount of data. These data are mainly endowed by portable and radio-frequency devices, transportation sys...
Md. Sabbir Ejaz, Md. Rabiul Islam
Recognition from faces is a popular and significant technology in recent years. Face alterations and the presence of different masks make it too much challenging. In the real-world, when a person is uncooperative with the systems such as in video surveillance then masking is further common scenarios...
Muhammad Usama Islam, Hasan Mahmud, Faisal Bin Ashraf, Md. Iqbal Hossain et al.
Musculoskeletal disorder is increasing in humans due to accidents or aging which is a great concern for future world. Physical exercises can reduce this disorder. Yoga is a great medium of physical exercise. For doing yoga a trainer is important who can monitor the perfectness of different yoga pose...
Raihan Bin Islam, Samiha Akhter, Faria Iqbal, Md. Hasnaeen Rizvi Rahman et al.
Object detection, one of the most significant contributions of computer vision and machine learning, plays an immense role in identifying and locating objects in an image or a video. We recognize distinct objects and precisely get their information through object detection, such as their size, shape...
Shakil Ahmed Sumon, Mohammad Raihan Goni, Niyaz Bin Hashem, Md Tanzil Shahria et al.
In this paper, we have explored different strategies to find out the saliency of the features from different pretrained models in detecting violence in videos. A dataset has been created which consists of violent and non-violent videos of different settings. Three ImageNet models; VGG16, VGG19, ResN...
Zillur Rahman, Amit Mazumder Ami, Muhammad Ahsan Ullah
Wrong-way driving is one of the main causes of road accidents and traffic jam all over the world. By detecting wrong-way vehicles, the number of accidents can be minimized and traffic jam can be reduced. With the increasing popularity of real-time traffic management systems and due to the availabili...
Afsana Nowrin, Sharmin Afroz, Md. Sazzadur Rahman, Imtiaz Mahmud et al.
The outbreak of Coronavirus Disease 2019 (Covid-19) had an enormous impact on humanity. Till May 2021, almost 172 million people have been affected globally due to the contagious spread of Covid-19. Although the distribution of vaccines has been started, the worldwide mass distribution is yet to hap...
Md. Bahar Ullah
This paper describes CPU Based YOLO, a real time object detection model to run on Non-GPU computers that may facilitate the users of low configuration computer. There are a lot of well improved algorithms for object detection such as YOLO, Faster R-CNN, Fast R-CNN, R-CNN, Mask R-CNN, R-FCN, SSD, Ret...
Taqi Tahmid, Eklas Hossain
As the problem of urban traffic congestion intensifies, there is a pressing need for the introduction of advanced technology and equipment to improve the state-of-the-art of traffic control. The current methods used such as timers or human control are proved to be inferior to alleviate this crisis. ...