Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen: http://doi.org/10.25358/openscience-6840
Autoren: Hartmann, David
Franzen, Daniel
Brodehl, Sebastian
Titel: Studying the evolution of neural activation patterns during training of feed-forward ReLU networks
Online-Publikationsdatum: 24-Mär-2022
Erscheinungsdatum: 2021
Sprache des Dokuments: Englisch
Zusammenfassung/Abstract: The ability of deep neural networks to form powerful emergent representations of complex statistical patterns in data is as remarkable as imperfectly understood. For deep ReLU networks, these are encoded in the mixed discrete–continuous structure of linear weight matrices and non-linear binary activations. Our article develops a new technique for instrumenting such networks to efficiently record activation statistics, such as information content (entropy) and similarity of patterns, in real-world training runs. We then study the evolution of activation patterns during training for networks of different architecture using different training and initialization strategies. As a result, we see characteristic- and general-related as well as architecture-related behavioral patterns: in particular, most architectures form bottom-up structure, with the exception of highly tuned state-of-the-art architectures and methods (PyramidNet and FixUp), where layers appear to converge more simultaneously. We also observe intermediate dips in entropy in conventional CNNs that are not visible in residual networks. A reference implementation is provided under a free license1.
DDC-Sachgruppe: 004 Informatik
004 Data processing
Veröffentlichende Institution: Johannes Gutenberg-Universität Mainz
Organisationseinheit: FB 08 Physik, Mathematik u. Informatik
Veröffentlichungsort: Mainz
ROR: https://ror.org/023b0x485
DOI: http://doi.org/10.25358/openscience-6840
Version: Published version
Publikationstyp: Zeitschriftenaufsatz
Nutzungsrechte: CC BY
Informationen zu den Nutzungsrechten: https://creativecommons.org/licenses/by/4.0/
Zeitschrift: Frontiers in artificial intelligence
4
Seitenzahl oder Artikelnummer: 642374
Verlag: Frontiers Media
Verlagsort: Lausanne
Erscheinungsdatum: 2021
ISSN: 2624-8212
DOI der Originalveröffentlichung: 10.3389/frai.2021.642374
Enthalten in den Sammlungen:JGU-Publikationen

Dateien zu dieser Ressource:
  Datei Beschreibung GrößeFormat
Miniaturbild
studying_the_evolution_of_neu-20220322111055686.pdf2.75 MBAdobe PDFÖffnen/Anzeigen