Analysis of anomalies in random permutations using recurrent neural networks
- Abstract: This paper is about detecting the difference between fully-random and semi-random shuffleing data sets, with the use of unsupervised learning algorithms. Because of the limits of the k-means algorithm alone, a recurrent autoencoder is used for feature extraction to improve the results of k-means. In the next step the autoencoder alone is used for clustering. Introduction: In the last years, machine learning has been used more and more in different areas and it is also appropriate for for pattern recognition in data. Random data is characterized through the missing of defined patterns. Permutations without repetitions have the highest amount of entropy for a sequence of its length, which is similar to random data according to Andrei Kolmogorov, who states that random data have the highest amount of information and can’t be compressed. Therefore, this paper analyses the difference between random permutations and good shuffled permutations, which have some remaining patterns left. This is done via a recurrent autoencoder.
Verfasserangaben: | Fabian Fries, Ernst Georg Haffner |
---|---|
URN: | urn:nbn:de:hbz:tr5-746 |
Dokumentart: | Arbeitspapier |
Sprache: | Englisch |
Datum des OPUS-Uploads: | 11.08.2022 |
Datum der Erstveröffentlichung: | 15.08.2022 |
Veröffentlichende Hochschule: | Hochschule Trier |
Datum der Freischaltung: | 15.08.2022 |
Freies Schlagwort / Tag: | anomalies in permutations; recurrent neural networks |
GND-Schlagwort: | Maschinelles Lernen; Mustererkennung; Datensatz; Permutation; Algorithmus; k-Means-Algorithmus; Neuronales Netz |
Seitenzahl: | 4 |
Erste Seite: | 1 |
Letzte Seite: | 4 |
Einrichtungen: | FB Technik |
DDC-Klassifikation: | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 000 Informatik, Informationswissenschaft, allgemeine Werke |
Lizenz (Deutsch): | Creative Commons - CC BY - Namensnennung 4.0 International |