Skip to main content

Principal Component Analysis and Randomness Test for Big Data Analysis

Practical Applications of RMT-Based Technique

  • Book
  • © 2023

Overview

  • Presents a practical method to use PCA and randomness measure based on the RMT formula
  • Proposes a new and universal approach of big data analysis irrelevant to the details of data types or fields
  • Uses real-world data to derive practical results for stock market forecasts and computer security

Part of the book series: Evolutionary Economics and Social Complexity Science (EESCS, volume 25)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (6 chapters)

Keywords

About this book

This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp the "meaning" of the data instantly, without getting into the details of individual data. Unlike conventional approaches of principal component analysis, randomness tests, and visualization methods, the authors' approach has the benefits of universality and simplicity of data analysis, regardless of data types, structures, or specific field of science.

First, mathematical preparation is described. The RMT-PCA and the RMT-test utilize the cross-correlation matrix of time series, XXT, where X represents a rectangular matrix of N rows and L columns and XT represents the transverse matrix of X. Because C is symmetric, namely, CT, it can be converted to a diagonal matrix of eigenvalues by a similarity transformation SCS-1 = SCST using an orthogonal matrix S. When N is significantly large, the histogram of the eigenvalue distribution can be compared to the theoretical formula derived in the context of the random matrix theory (RMT, in abbreviation).

Then the RMT-PCA applied to high-frequency stock prices in Japanese and American markets is dealt with. This approach proves its effectiveness in extracting "trendy" business sectors of the financial market over the prescribed time scale. In this case, X consists of N stock- prices of length L, and the correlation matrix C is an N by N square matrix, whose element at the i-th row and j-th column is the inner product of the price time series of the length L of the i-th stock and the j-th stock of the equal length L.

Next, the RMT-test is applied to measure randomness of various random number generators, including algorithmically generated random numbers and physically generated random numbers.

The book concludes by demonstrating two applications of the RMT-test: (1) a comparison of hash functions, and (2) stock prediction by means of randomness, including a new index of off-randomness related to market decline.

Authors and Affiliations

  • Organization for the Strategic Coordination of Research and Intellectual Properties (OSRI), Meiji University, Tokyo, Japan

    Mieko Tanaka-Yamawaki

  • Department of Mathematical Sciences Based on Modeling and Analysis, School of Interdisciplinary Mathematical Sciences, Meiji University, Tokyo, Japan

    Yumihiko Ikura

About the authors

Mieko Tanaka-Yamawaki, former professor, Tottori University

Yumihiko Ikura, Meiji University

Bibliographic Information

  • Book Title: Principal Component Analysis and Randomness Test for Big Data Analysis

  • Book Subtitle: Practical Applications of RMT-Based Technique

  • Authors: Mieko Tanaka-Yamawaki, Yumihiko Ikura

  • Series Title: Evolutionary Economics and Social Complexity Science

  • DOI: https://doi.org/10.1007/978-981-19-3967-9

  • Publisher: Springer Singapore

  • eBook Packages: Economics and Finance, Economics and Finance (R0)

  • Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2023

  • Hardcover ISBN: 978-981-19-3966-2Published: 24 May 2023

  • Softcover ISBN: 978-981-19-3969-3Due: 24 June 2023

  • eBook ISBN: 978-981-19-3967-9Published: 23 May 2023

  • Series ISSN: 2198-4204

  • Series E-ISSN: 2198-4212

  • Edition Number: 1

  • Number of Pages: VII, 152

  • Number of Illustrations: 1 b/w illustrations

  • Topics: Institutional/Evolutionary Economics, Big Data, Statistics, general

Publish with us