BadminSense: Enabling Fine-Grained Badminton Stroke Evaluation on a Single Smartwatch

cs.HC cs.AI Taizhou Chen, Kai Chen, Xingyu Liu, Pingchuan Ke, Zhida Sun · Mar 23, 2026
Local to this browser
What it does
BadminSense is a smartwatch-based system for fine-grained badminton stroke evaluation that aims to provide amateur players with professional-quality coaching feedback without requiring expensive external equipment. The system uses a single...
Why it matters
The system uses a single commercial smartwatch on the dominant wrist to segment and classify four stroke types, predict stroke quality on a 5-point Likert scale, and estimate shuttle impact location on the racket string area. The key...
Main concern
BadminSense presents a well-motivated and technically sound contribution to sports HCI, achieving promising offline results: 91. 43% stroke classification accuracy, 0.
Community signal
0
0 up · 0 down
Sign in to vote with arrows
AI Review AI reviewed
Plain-language introduction

BadminSense is a smartwatch-based system for fine-grained badminton stroke evaluation that aims to provide amateur players with professional-quality coaching feedback without requiring expensive external equipment. The system uses a single commercial smartwatch on the dominant wrist to segment and classify four stroke types, predict stroke quality on a 5-point Likert scale, and estimate shuttle impact location on the racket string area. The key innovation is enabling fine-grained quality assessment beyond simple activity recognition, targeting the gap between basic fitness tracking and professional coaching.

Critical review
Verdict
Bottom line

BadminSense presents a well-motivated and technically sound contribution to sports HCI, achieving promising offline results: 91.43% stroke classification accuracy, 0.438 MAE for quality rating (5-point scale), and 12.9% average impact location error. However, the work is limited by a small dataset (12 participants, 848 strokes), lack of longitudinal validation, and several unverified claims about generalizability. The formative methodology is solid, but the evaluation lacks quantitative validation against expert ground truth during real gameplay.

“Our evaluations show that BadminSense achieves a stroke classification accuracy of 91.43%, an average quality rating error of 0.438, and an average impact location estimation error of 12.9%”
BadminSense paper · Abstract and Section 5.3, 5.4, 5.5
What holds up

The formative exploration with badminton players is rigorous and well-documented, yielding actionable design requirements (DR1-DR4) and implementation insights. The ink-marking methodology for impact location ground truth is clever and reproducible. The two-step stroke segmentation using IMU+acoustic fusion achieves 99.41% accuracy with 0.23% false positive rate. The choice of stroke-specific evaluation strategies (I1) is technically justified and shows thoughtful feature engineering. The dataset is open-sourced, supporting reproducibility.

“The results show that our approach achieves an average stroke segmentation accuracy of 99.41% with a false positive rate of 0.23%”
BadminSense paper · Section 4.2 and 5.2.2
“We present a formative exploration that identifies requirements and guidelines for designing a badminton skill analysis and training system for a smartwatch”
BadminSense paper · Section 3.4 and 1
Main concerns

Several limitations undermine the paper's claims. First, the 'user-independent' evaluation uses leave-3-user-out cross-validation (only 9 train/3 test), which is insufficient for robust generalization claims with $N=12$ total participants. Second, the comparison to Silent Impact (Park et al., UIST 2024) is misleading: BadminSense claims 91.43% accuracy versus Silent Impact's 88.2%, but Silent Impact uses the PASSIVE arm (non-racket arm), which is a significantly harder sensing problem than BadminSense's dominant-arm placement. The paper also lacks validation that the 0.438 rating error (on a 1-5 scale) is perceptually meaningful - does a 0.5-point difference matter to learners? Finally, the rule-based feedback generation (I3) was not evaluated against actual coaching advice quality.

“we adopt a leave-3-user-out training strategy, where we randomly pick the data from 9 users for training and test on the remaining 3 users”
BadminSense paper · Section 5.3.2
“We propose Silent Impact, a novel and user-friendly system that analyzes tennis shots using a sensor placed on the passive arm...achieving a classification accuracy of 88.2%”
Evidence and comparison

The empirical evidence supports basic feasibility but has gaps. The ICC reliability scores for ratings (0.790-0.865) establish reasonable inter-rater agreement. However, the paper overstates novelty: wrist-worn stroke classification was demonstrated by Anand et al. (2017) and Ganser et al. (2021) prior to this work. The quality rating task is novel but lacks a validated baseline - what do 'good' vs 'poor' strokes look like in sensor space? The 12.9% impact location error is reported on normalized coordinates, but without knowing the physical racket dimensions, this is hard to interpret. The comparison to racket-mounted sensors (SmartDampener) is fair but the comparison to Silent Impact is inappropriate due to different anatomical placement.

“The results indicated good reliability for fold 1 (ICC = 0.824, 95% Confidence Interval = [0.79, 0.85]), fold 2 (ICC = 0.865, 95% Confidence Interval = [0.84, 0.89]), and fold 3 (ICC = 0.790, 95% Confidence Interval = [0.75, 0.82])”
BadminSense paper · Section 4.3.2
“Ganser et al. classified 5 tennis stroke types through a data-driven approach using a wrist-worn IMU sensor”
BadminSense paper · Section 2.2
Reproducibility

The paper demonstrates good open science practices: the dataset (848 strokes with labels) is publicly available at https://github.com/taizhouchen/BadminSense_Dataset. However, critical implementation details are missing: exact SVM hyperparameters for classification, the specific SVR kernel and $C$ values for quality rating, and threshold values ($21$ for peak detection is mentioned but not justified). The 'heartbeat-based network synchronization' for temporal alignment is described but not validated. The system requires a Windows server with Python/Scikit-learn - no docker/containerization is provided. Perhaps most critically, the real-time latency is reported as 'approximately one minute' for post-session analysis, but no per-stroke processing latency is given, making real-time feasibility assessment impossible. Left-handed player support is explicitly excluded, limiting generalizability.

“After finishing the session, he taps the stop button on the smartwatch to trigger the data upload and analysis process. Approximately one minute later, the smartwatch sends a notification indicating that the analysis is completed”
BadminSense paper · Section 5.7
“Currently, the development and evaluation of BadminSense only involve right-handed players”
BadminSense paper · Section 8.3
“we empirically set the window size to 2000ms and squared the sensor values before applying a local maximum threshold, which was empirically set to 21”
BadminSense paper · Section 5.2.1
Abstract

Evaluating badminton performance often requires expert coaching, which is rarely accessible for amateur players. We present adminSense, a smartwatch-based system for fine-grained badminton performance analysis using wearable sensing. Through interviews with experienced badminton players, we identified four system design requirements with three implementation insights that guide the development of BadminSense. We then collected a badminton strokes dataset on 12 experienced badminton amateurs and annotated it with fine-grained labels, including stroke type, expert-assessed stroke rating, and shuttle impact location. Built on this dataset, BadminSense segments and classifies strokes, predicts stroke quality, and estimates shuttle impact location using vibration signal from an off-the-shelf smartwatch. Our evaluations show that

Challenge the Review

Pick a starting point or write your own. Challenges run in the background, so you can keep reading while the AI investigates.

No challenges yet. Disagree with the review? Ask the AI to revisit a specific claim.