TY  - JOUR
A1  - Loh, Frank
A1  - Poignée, Fabian
A1  - Wamser, Florian
A1  - Leidinger, Ferdinand
A1  - Hoßfeld, Tobias
T1  - Uplink vs. Downlink: Machine Learning-Based Quality Prediction for HTTP Adaptive Video Streaming
JF  - Sensors
N2  - Streaming video is responsible for the bulk of Internet traffic these days. For this reason, Internet providers and network operators try to make predictions and assessments about the streaming quality for an end user. Current monitoring solutions are based on a variety of different machine learning approaches. The challenge for providers and operators nowadays is that existing approaches require large amounts of data. In this work, the most relevant quality of experience metrics, i.e., the initial playback delay, the video streaming quality, video quality changes, and video rebuffering events, are examined using a voluminous data set of more than 13,000 YouTube video streaming runs that were collected with the native YouTube mobile app. Three Machine Learning models are developed and compared to estimate playback behavior based on uplink request information. The main focus has been on developing a lightweight approach using as few features and as little data as possible, while maintaining state-of-the-art performance.
KW  - HTTP adaptive video streaming
KW  - quality of experience prediction
KW  - machine learning
Y1  - 2021
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-241121
SN  - 1424-8220
VL  - 21
IS  - 12
ER  - 
TY  - JOUR
A1  - Schlör, Daniel
A1  - Ring, Markus
A1  - Hotho, Andreas
T1  - iNALU: Improved Neural Arithmetic Logic Unit
JF  - Frontiers in Artificial Intelligence
N2  - Neural networks have to capture mathematical relationships in order to learn various tasks. They approximate these relations implicitly and therefore often do not generalize well. The recently proposed Neural Arithmetic Logic Unit (NALU) is a novel neural architecture which is able to explicitly represent the mathematical relationships by the units of the network to learn operations such as summation, subtraction or multiplication. Although NALUs have been shown to perform well on various downstream tasks, an in-depth analysis reveals practical shortcomings by design, such as the inability to multiply or divide negative input values or training stability issues for deeper networks. We address these issues and propose an improved model architecture. We evaluate our model empirically in various settings from learning basic arithmetic operations to more complex functions. Our experiments indicate that our model solves stability issues and outperforms the original NALU model in means of arithmetic precision and convergence.
KW  - neural networks
KW  - machine learning
KW  - arithmetic calculations
KW  - neural architecture
KW  - experimental evaluation
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-212301
SN  - 2624-8212
VL  - 3
ER  - 
TY  - JOUR
A1  - Krenzer, Adrian
A1  - Makowski, Kevin
A1  - Hekalo, Amar
A1  - Fitting, Daniel
A1  - Troya, Joel
A1  - Zoller, Wolfram G.
A1  - Hann, Alexander
A1  - Puppe, Frank
T1  - Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists
JF  - BioMedical Engineering OnLine
N2  - Background
Machine learning, especially deep learning, is becoming more and more relevant in research and development in the medical domain. For all the supervised deep learning applications, data is the most critical factor in securing successful implementation and sustaining the progress of the machine learning model. Especially gastroenterological data, which often involves endoscopic videos, are cumbersome to annotate. Domain experts are needed to interpret and annotate the videos. To support those domain experts, we generated a framework. With this framework, instead of annotating every frame in the video sequence, experts are just performing key annotations at the beginning and the end of sequences with pathologies, e.g., visible polyps. Subsequently, non-expert annotators supported by machine learning add the missing annotations for the frames in-between.
Methods
In our framework, an expert reviews the video and annotates a few video frames to verify the object’s annotations for the non-expert. In a second step, a non-expert has visual confirmation of the given object and can annotate all following and preceding frames with AI assistance. After the expert has finished, relevant frames will be selected and passed on to an AI model. This information allows the AI model to detect and mark the desired object on all following and preceding frames with an annotation. Therefore, the non-expert can adjust and modify the AI predictions and export the results, which can then be used to train the AI model.
Results
Using this framework, we were able to reduce workload of domain experts on average by a factor of 20 on our data. This is primarily due to the structure of the framework, which is designed to minimize the workload of the domain expert. Pairing this framework with a state-of-the-art semi-automated AI model enhances the annotation speed further. Through a prospective study with 10 participants, we show that semi-automated annotation using our tool doubles the annotation speed of non-expert annotators compared to a well-known state-of-the-art annotation tool.
Conclusion
In summary, we introduce a framework for fast expert annotation for gastroenterologists, which reduces the workload of the domain expert considerably while maintaining a very high annotation quality. The framework incorporates a semi-automated annotation system utilizing trained object detection models. The software and framework are open-source.
KW  - object detection
KW  - machine learning
KW  - deep learning
KW  - annotation
KW  - endoscopy
KW  - gastroenterology
KW  - automation
Y1  - 2022
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-300231
VL  - 21
IS  - 1
ER  - 
TY  - JOUR
A1  - Pfitzner, Christian
A1  - May, Stefan
A1  - Nüchter, Andreas
T1  - Body weight estimation for dose-finding and health monitoring of lying, standing and walking patients based on RGB-D data
JF  - Sensors
N2  - This paper describes the estimation of the body weight of a person in front of an RGB-D camera. A survey of different methods for body weight estimation based on depth sensors is given. First, an estimation of people standing in front of a camera is presented. Second, an approach based on a stream of depth images is used to obtain the body weight of a person walking towards a sensor. The algorithm first extracts features from a point cloud and forwards them to an artificial neural network (ANN) to obtain an estimation of body weight. Besides the algorithm for the estimation, this paper further presents an open-access dataset based on measurements from a trauma room in a hospital as well as data from visitors of a public event. In total, the dataset contains 439 measurements. The article illustrates the efficiency of the approach with experiments with persons lying down in a hospital, standing persons, and walking persons. Applicable scenarios for the presented algorithm are body weight-related dosing of emergency patients.
KW  - RGB-D
KW  - human body weight
KW  - image processing
KW  - kinect
KW  - machine learning
KW  - perception
KW  - segmentation
KW  - sensor fusion
KW  - stroke
KW  - thermal camera
Y1  - 2018
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-176642
VL  - 18
IS  - 5
ER  -