Detecting arbitrary keypoints on limbs and skis with sparse partly correct segmentation masks

Ludwig, Katja; Kienzle, Daniel; Lorenz, Julian; Lienhart, Rainer

Detecting arbitrary keypoints on limbs and skis with sparse partly correct segmentation masks

Katja Ludwig, Daniel Kienzle, Julian Lorenz, Rainer Lienhart

Analyses based on the body posture are crucial for top- class athletes in many sports disciplines. If at all, coaches label only the most important keypoints, since manual anno- tations are very costly. This paper proposes a method to de- tect arbitrary keypoints on the limbs and skis of professional ski jumpers that requires a few, only partly correct segmen- tation masks during training. Our model is based on the Vision Transformer architecture with a special design for the input tokens to query for the desired keypoints. Since we use segmentation masks only to generate ground truth labels for the freely selectable keypoints, partly correct seg- mentation masks are sufﬁcient for our training procedure. Hence, there is no need for costly hand-annotated segmen- tation masks. We analyze different training techniques for freely selected and standard keypoints, including pseudo la- bels, and show in our experiments that only a few partly cor- rect segmentation masks are sufﬁcient forAnalyses based on the body posture are crucial for top- class athletes in many sports disciplines. If at all, coaches label only the most important keypoints, since manual anno- tations are very costly. This paper proposes a method to de- tect arbitrary keypoints on the limbs and skis of professional ski jumpers that requires a few, only partly correct segmen- tation masks during training. Our model is based on the Vision Transformer architecture with a special design for the input tokens to query for the desired keypoints. Since we use segmentation masks only to generate ground truth labels for the freely selectable keypoints, partly correct seg- mentation masks are sufﬁcient for our training procedure. Hence, there is no need for costly hand-annotated segmen- tation masks. We analyze different training techniques for freely selected and standard keypoints, including pseudo la- bels, and show in our experiments that only a few partly cor- rect segmentation masks are sufﬁcient for learning to detect arbitrary keypoints on limbs and skis.…

Metadaten
Author:	Katja Ludwig GND, Daniel Kienzle GND, Julian Lorenz GND, Rainer Lienhart GND
URN:	urn:nbn:de:bvb:384-opus4-994525
Frontdoor URL	https://opus.bibliothek.uni-augsburg.de/opus4/99452
ISBN:	979-8-3503-2056-5OPAC
Parent Title (English):	IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), Jan. 3 2023 to Jan. 7 2023, Waikoloa, HI, USA
Publisher:	IEEE
Place of publication:	Piscataway, NJ
Type:	Conference Proceeding
Language:	English
Year of first Publication:	2023
Publishing Institution:	Universität Augsburg
Release Date:	2022/11/18
First Page:	1
Last Page:	10
DOI:	https://doi.org/10.1109/WACVW58289.2023.00051
Institutes:	Fakultät für Angewandte Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Maschinelles Lernen und Maschinelles Sehen
Dewey Decimal Classification:	0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Licence (German):	Deutsches Urheberrecht

Open Access

Detecting arbitrary keypoints on limbs and skis with sparse partly correct segmentation masks

Download full text files

Export metadata

Statistics

Additional Services