A Semantic Web-Based Approach for Bat Trajectory Reconstruction With Human Keypoint Information

A Semantic Web-Based Approach for Bat Trajectory Reconstruction With Human Keypoint Information

Zechen Jin, Yida Zheng, Jun Liu, Yang Yu
Copyright: © 2024 |Pages: 22
DOI: 10.4018/IJSWIS.338999
Article PDF Download
Open access articles are freely available for download

Abstract

Restoring the trajectory of a bat from a table tennis match video is critical in analyzing a table tennis technique and conducting statistical analysis. However, directly bat location detection in each frame is challenging due to changing shapes caused by varying movement directions and speeds, leading to ambiguity. This paper develops a novel two-stage method. The first stage utilizes YOLO for bat detection in each frame, followed by filtering out erroneous candidate boxes. In the second stage, the authors use a temporal prediction model that integrating human keypoint information and interpolation to reconstruct a complete bat trajectory with minimal errors. The method's effectiveness and performance are evaluated on our video datasets. The evaluation results demonstrate that the proposed method outperforms traditional methods on precision performance metrics. The error screening algorithm improves precision score to nearly 1. In addition, the method has the recall score 22.3% higher than YOLO 's and also 1.4% higher than that of YOLO with cubic spline interpolation.
Article Preview
Top

Optical Flow Method

The optical flow method exploits the temporal variations of pixels in an image sequence and the correlation between consecutive frames to calculate the motion information of objects between adjacent frames, which is based on the correspondence between the previous frame and the current frame. Optical flow can be used to estimate and analyze the motion of objects in a sequence, with the existing methods divided into traditional and deep learning algorithms. Lucas et al. (1981) proposed the Lucas-Kanade sparse optical flow algorithm (Bruhn et al., 2005), which exploits brightness constancy, temporal persistence, and spatial consistency. Bouguet introduced an improved Lucas-Kanade algorithm (Bouguet et al., 2001) based on pyramid hierarchies, overcoming the issues of tracking fast-moving objects and affine transformations. Another traditional approach is the dense optical flow, such as Farnebäck’s method (Farnebäck, 2003), which approximates the neighborhood information of each pixel using polynomials and calculates the displacement for all points in the image. However, the trade-off between the accuracy and speed of this method limits its practical application. Deep learning (Behera et al., 2023; Li et al., 2022; Tembhurne et al., 2022; Zhou, 2022) has yielded promising results in optical flow estimation in recent years. For instance, FlowNets and RAFT (Boyer et al., 2009; Dosovitskiy et al., 2015; Ilg et al., 2017) utilized convolutional neural networks to predict optical flow for each pixel in the image and achieved significant advancements in real-time estimation algorithms. However, for the bats in table tennis training or competition videos, their positions constantly change during the motion, causing rapid transformations and resulting in much motion blur. Therefore, extracting features from such scenarios is too challenging.

Complete Article List

Search this Journal:
Reset
Volume 20: 1 Issue (2024)
Volume 19: 1 Issue (2023)
Volume 18: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 17: 4 Issues (2021)
Volume 16: 4 Issues (2020)
Volume 15: 4 Issues (2019)
Volume 14: 4 Issues (2018)
Volume 13: 4 Issues (2017)
Volume 12: 4 Issues (2016)
Volume 11: 4 Issues (2015)
Volume 10: 4 Issues (2014)
Volume 9: 4 Issues (2013)
Volume 8: 4 Issues (2012)
Volume 7: 4 Issues (2011)
Volume 6: 4 Issues (2010)
Volume 5: 4 Issues (2009)
Volume 4: 4 Issues (2008)
Volume 3: 4 Issues (2007)
Volume 2: 4 Issues (2006)
Volume 1: 4 Issues (2005)
View Complete Journal Contents Listing