The search result changed since you submitted your search request. Documents might be displayed in a different sort order.
  • search hit 2 of 279
Back to Result List

iNALU: Improved Neural Arithmetic Logic Unit

Please always quote using this URN: urn:nbn:de:bvb:20-opus-212301
  • Neural networks have to capture mathematical relationships in order to learn various tasks. They approximate these relations implicitly and therefore often do not generalize well. The recently proposed Neural Arithmetic Logic Unit (NALU) is a novel neural architecture which is able to explicitly represent the mathematical relationships by the units of the network to learn operations such as summation, subtraction or multiplication. Although NALUs have been shown to perform well on various downstream tasks, an in-depth analysis reveals practicalNeural networks have to capture mathematical relationships in order to learn various tasks. They approximate these relations implicitly and therefore often do not generalize well. The recently proposed Neural Arithmetic Logic Unit (NALU) is a novel neural architecture which is able to explicitly represent the mathematical relationships by the units of the network to learn operations such as summation, subtraction or multiplication. Although NALUs have been shown to perform well on various downstream tasks, an in-depth analysis reveals practical shortcomings by design, such as the inability to multiply or divide negative input values or training stability issues for deeper networks. We address these issues and propose an improved model architecture. We evaluate our model empirically in various settings from learning basic arithmetic operations to more complex functions. Our experiments indicate that our model solves stability issues and outperforms the original NALU model in means of arithmetic precision and convergence.show moreshow less

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar Statistics
Metadaten
Author: Daniel Schlör, Markus Ring, Andreas Hotho
URN:urn:nbn:de:bvb:20-opus-212301
Document Type:Journal article
Faculties:Fakultät für Mathematik und Informatik / Institut für Informatik
Language:English
Parent Title (English):Frontiers in Artificial Intelligence
ISSN:2624-8212
Year of Completion:2020
Volume:3
Article Number:71
Source:Frontiers in Artificial Intelligence 2020, 3:71. doi: 10.3389/frai.2020.00071
DOI:https://doi.org/10.3389/frai.2020.00071
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Tag:arithmetic calculations; experimental evaluation; machine learning; neural architecture; neural networks
Release Date:2021/03/10
Date of first Publication:2020/09/29
Open-Access-Publikationsfonds / Förderzeitraum 2020
Licence (German):License LogoCC BY: Creative-Commons-Lizenz: Namensnennung 4.0 International